Recent Releases of PDFIO
PDFIO - v0.1.15
PDFIO v0.1.15
Merged pull requests: - Set Compat for OpenSSL_jll (#108) (@mkitti)
Closed issues:
- Bad PDF files which have junk characters before header and after EOF marker error unexpected character. (#97)
- AssertionError: Invalid interval (#99)
- Error on open a file: ArgumentError: extra characters after whitespace in "4970\n396" (#109)
- zlib1 error in Windows (#110)
- assert error reading a pdf (#111)
- Still active? (#113)
- Incorrect Doc Info (#114)
- MethodError: no method matching setindex!(::Rectangle.IntervalTree{UInt16, Int64}, ::Float32, ::Rectangle.Interval{UInt16}) (#115)
Scientific Software - Peer-reviewed
- Julia
Published by github-actions[bot] over 1 year ago
PDFIO - v0.1.10
PDFIO v0.1.10
Closed issues: - How to retrieve text from pdPageExtractText(stdout, page)? (#82) - Would you give additional examples of how to work with pages? (#86) - SystemError: GetTempFileName (#87) - invalid or incomplete deflate data (#88) - Bug: error reading attached PDF, works with other PDFs. (#89) - Bug: no method matching readinternalstream_data(::IOStream, ::CosDict, ::Base.GenericIOBuffer{Array{UInt8,1}}) (#90) - ERROR: MethodError: no method matching String(::PDFIO.Cos.CosIndirectObject{CosName}) (#92)
Merged pull requests: - Install TagBot as a GitHub Action (#85) (@JuliaTagBot)
Scientific Software - Peer-reviewed
- Julia
Published by github-actions[bot] almost 6 years ago
PDFIO - v0.1.9
v0.1.9 (2019-11-30)
Closed issues:
- Getting assertation error in show_text_layout! (#81)
- _error not defined (#78)
- build fails (#76)
- Move the Zlib and OpenSSL dependency to JuliaBinaryWrappers (#73)
Merged pull requests:
- Fix for #78 (#80) (sambitdash)
- Update paper.md (#77) (alexhanna)
- Using the libraries built by binary builders (#75) (sambitdash)
Scientific Software - Peer-reviewed
- Julia
Published by julia-tagbot[bot] over 6 years ago
PDFIO - v0.1.8
v0.1.8 (2019-11-03)
Closed issues:
- Google Docs PDF fails at pdPageExtractText (#74)
- Not able to execute any functions on a basic PDF. Error: Found ' (32)' Expected '<' here (#72)
- Secured PDF document with X509 certificates (#3)
Merged pull requests:
- Enhancing the font unicode mapping framework. (#71) (sambitdash)
Scientific Software - Peer-reviewed
- Julia
Published by julia-tagbot[bot] over 6 years ago
PDFIO - v0.1.7
v0.1.7 (2019-07-12)
Merged pull requests:
- PKI Security Handler implementation (#69) (sambitdash)
Scientific Software - Peer-reviewed
- Julia
Published by julia-tagbot[bot] almost 7 years ago
PDFIO - v0.1.6 - Support for password protected PDF files
v0.1.6 (2019-07-09)
Merged pull requests:
- Support for encrypted PDF files with standard crypto handler (#67) (sambitdash)
Scientific Software - Peer-reviewed
- Julia
Published by julia-tagbot[bot] almost 7 years ago
PDFIO - v0.1.5 - Documentation Update
Documentation has been significantly updated and sample code is added to most methods.
Scientific Software - Peer-reviewed
- Julia
Published by julia-tagbot[bot] about 7 years ago
PDFIO - v0.1.4 - Release Notes
v0.1.4 (2019-06-17)
This release has the following enhancements:
- Support for validation of Digital Signatures in a PDF document.
- Performance improvement of
pdPageExtractText.
Scientific Software - Peer-reviewed
- Julia
Published by julia-tagbot[bot] about 7 years ago
PDFIO - Documentation automated
Release 0.1.3
- Documentation updated to the current and can be automatically updated with every release.
pdDocGetOutlinemethod added to get access to the document bookmarks (PDF terminology outline). a.PDOutlineandPDOutlineItemprovide access structures to traverse the outlines. You can use theAbstractTreesmodule interfaces to traverse these structures. b.pdOutlineItemGetAttrsenables you to query thePDOutlineItemsto get the details.- Page number related APIs:
a.
pdPageGetPageNumber- gets the physical page number for the current page. b.pdDocGetPageLabel- gets the logical page label given an absolute page number. - Fonts related methods:
a.
pdFontIsBold,pdFontIsItalic,pdFontIsFixedW,pdFontIsAllCap,pdFontIsSmallCap- provide the font attributes of the font. However, bold attribute of a font is just an estimate as in PDF, bold can be simulated by changing font weights, by over printing, Hence, the attribute may not be very accurate.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash about 7 years ago
PDFIO - Support for outlines and large number of bug fixes
This release has the following major changes:
- Support for extracting PDF outlines (bookmarks)
- Moving all test files to separate repository for better license management
- Large number of bug fixes affecting files created with LaTeX.
- Better font support with kerning.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash about 7 years ago
PDFIO - Fix for pdDocGetInfo()
Fix for pdDocGetInfo()
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash over 7 years ago
PDFIO - Compatibility with Julia 1.0
Support for all older versions and Compat libraries have been removed.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash almost 8 years ago
PDFIO - Fixes for 0.7
0.7 now supported.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash almost 8 years ago
PDFIO - Improving more use cases for pdPageExtractText
- Text extraction from Form XObjects
- LZW filter
- Bug fixes
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash about 8 years ago
PDFIO - Removed package dependencies.
PDFIO now has minimal package dependencies.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash about 8 years ago
PDFIO - Enhanced pdPageExtractText Method
pdPageExtractText has been significantly enhanced to address relatively complex layout issues in rendering text.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash over 8 years ago
PDFIO - New pdPageExtractText Method
Changes this release:
- A new
pdPageExtractTextmethod is introduced which does a cleaner text conversion for complex PDFs including non-tagged PDFs. - Bug fixes
Text conversions carried out on 25,000+ files.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash over 8 years ago
PDFIO - Text extraction from PDF files
The release provides the following functionalities.
- Has a text extraction API
pdPageExtractText(page) - Supports Unicode code extraction from font encoding as well as Unicode CMap. (does not read into the font internal encoding embedded in the font file)
- Supports Adobe’s encoding for Latin fonts (AdobeGlyphList). Symbol and ZapfDingbats encodings are supported as well.
- Does not do any special handling for tagged PDFs but tagged PDFs may behave better as the creation order and reading order of document objects are similar.
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash almost 9 years ago
PDFIO - PDFIO v0.0.6
- Implementation of PDF Common Data types - Text Strings - Date - Name Tree - Number Tree
- Page Labels
- File attachments and annotations supported as custom scripts
- Cleaner implementation of
showandprintmethods of PDF Objects - Inline API documentation in REPL
Scientific Software - Peer-reviewed
- Julia
Published by sambitdash almost 9 years ago