Recent Releases of PDFIO

PDFIO - v0.1.15

PDFIO v0.1.15

Diff since v0.1.14

Merged pull requests: - Set Compat for OpenSSL_jll (#108) (@mkitti)

Closed issues: - Bad PDF files which have junk characters before header and after EOF marker error unexpected character. (#97) - AssertionError: Invalid interval (#99) - Error on open a file: ArgumentError: extra characters after whitespace in "4970\n396" (#109) - zlib1 error in Windows (#110) - assert error reading a pdf (#111) - Still active? (#113) - Incorrect Doc Info (#114) - MethodError: no method matching setindex!(::Rectangle.IntervalTree{UInt16, Int64}, ::Float32, ::Rectangle.Interval{UInt16}) (#115)

Scientific Software - Peer-reviewed - Julia
Published by github-actions[bot] over 1 year ago

PDFIO - v0.1.14

PDFIO v0.1.14

Diff since v0.1.13

Closed issues: - Error in merge_encoding! when extracting text (#103)

Merged pull requests: - Another approach to remove the type instability (#105) (@sambitdash)

Scientific Software - Peer-reviewed - Julia
Published by github-actions[bot] almost 4 years ago

PDFIO - v0.1.13

PDFIO v0.1.13

Diff since v0.1.12

Closed issues: - Support for AbstractTrees v0.3 (#98)

Scientific Software - Peer-reviewed - Julia
Published by github-actions[bot] almost 5 years ago

PDFIO - v0.1.12

PDFIO v0.1.12

Diff since v0.1.11

Closed issues: - Precompilation issue with Julia v1.6 (#95)

Scientific Software - Peer-reviewed - Julia
Published by github-actions[bot] over 5 years ago

PDFIO - v0.1.11

PDFIO v0.1.11

Diff since v0.1.10

Closed issues: - OpenSSL warning upon using PDFIO (#91) - New release? (#93) - ERROR: Found 'j(106)' Expected 'x' here (#94)

Scientific Software - Peer-reviewed - Julia
Published by github-actions[bot] over 5 years ago

PDFIO - v0.1.10

PDFIO v0.1.10

Diff since v0.1.9

Closed issues: - How to retrieve text from pdPageExtractText(stdout, page)? (#82) - Would you give additional examples of how to work with pages? (#86) - SystemError: GetTempFileName (#87) - invalid or incomplete deflate data (#88) - Bug: error reading attached PDF, works with other PDFs. (#89) - Bug: no method matching readinternalstream_data(::IOStream, ::CosDict, ::Base.GenericIOBuffer{Array{UInt8,1}}) (#90) - ERROR: MethodError: no method matching String(::PDFIO.Cos.CosIndirectObject{CosName}) (#92)

Merged pull requests: - Install TagBot as a GitHub Action (#85) (@JuliaTagBot)

Scientific Software - Peer-reviewed - Julia
Published by github-actions[bot] almost 6 years ago

PDFIO - v0.1.9

v0.1.9 (2019-11-30)

Diff since v0.1.8

Closed issues:

  • Getting assertation error in show_text_layout! (#81)
  • _error not defined (#78)
  • build fails (#76)
  • Move the Zlib and OpenSSL dependency to JuliaBinaryWrappers (#73)

Merged pull requests:

Scientific Software - Peer-reviewed - Julia
Published by julia-tagbot[bot] over 6 years ago

PDFIO - v0.1.8

v0.1.8 (2019-11-03)

Diff since v0.1.7

Closed issues:

  • Google Docs PDF fails at pdPageExtractText (#74)
  • Not able to execute any functions on a basic PDF. Error: Found ' (32)' Expected '<' here (#72)
  • Secured PDF document with X509 certificates (#3)

Merged pull requests:

  • Enhancing the font unicode mapping framework. (#71) (sambitdash)

Scientific Software - Peer-reviewed - Julia
Published by julia-tagbot[bot] over 6 years ago

PDFIO - v0.1.7

v0.1.7 (2019-07-12)

Diff since v0.1.6

Merged pull requests:

  • PKI Security Handler implementation (#69) (sambitdash)

Scientific Software - Peer-reviewed - Julia
Published by julia-tagbot[bot] almost 7 years ago

PDFIO - v0.1.6 - Support for password protected PDF files

v0.1.6 (2019-07-09)

Diff since v0.1.5

Merged pull requests:

  • Support for encrypted PDF files with standard crypto handler (#67) (sambitdash)

Scientific Software - Peer-reviewed - Julia
Published by julia-tagbot[bot] almost 7 years ago

PDFIO - v0.1.5 - Documentation Update

Documentation has been significantly updated and sample code is added to most methods.

Scientific Software - Peer-reviewed - Julia
Published by julia-tagbot[bot] about 7 years ago

PDFIO - v0.1.4 - Release Notes

v0.1.4 (2019-06-17)

This release has the following enhancements:

  1. Support for validation of Digital Signatures in a PDF document.
  2. Performance improvement of pdPageExtractText.

Scientific Software - Peer-reviewed - Julia
Published by julia-tagbot[bot] about 7 years ago

PDFIO - Documentation automated

Release 0.1.3

  1. Documentation updated to the current and can be automatically updated with every release.
  2. pdDocGetOutline method added to get access to the document bookmarks (PDF terminology outline). a. PDOutline and PDOutlineItem provide access structures to traverse the outlines. You can use the AbstractTrees module interfaces to traverse these structures. b. pdOutlineItemGetAttrs enables you to query the PDOutlineItems to get the details.
  3. Page number related APIs: a. pdPageGetPageNumber - gets the physical page number for the current page. b. pdDocGetPageLabel - gets the logical page label given an absolute page number.
  4. Fonts related methods: a. pdFontIsBold, pdFontIsItalic, pdFontIsFixedW, pdFontIsAllCap, pdFontIsSmallCap - provide the font attributes of the font. However, bold attribute of a font is just an estimate as in PDF, bold can be simulated by changing font weights, by over printing, Hence, the attribute may not be very accurate.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash about 7 years ago

PDFIO - Support for outlines and large number of bug fixes

This release has the following major changes:

  1. Support for extracting PDF outlines (bookmarks)
  2. Moving all test files to separate repository for better license management
  3. Large number of bug fixes affecting files created with LaTeX.
  4. Better font support with kerning.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash about 7 years ago

PDFIO - Fix for pdDocGetInfo()

Fix for pdDocGetInfo()

Scientific Software - Peer-reviewed - Julia
Published by sambitdash over 7 years ago

PDFIO - Compatibility with Julia 1.0

Support for all older versions and Compat libraries have been removed.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash almost 8 years ago

PDFIO - Fixes for 0.7

0.7 now supported.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash almost 8 years ago

PDFIO - Improving more use cases for pdPageExtractText

  1. Text extraction from Form XObjects
  2. LZW filter
  3. Bug fixes

Scientific Software - Peer-reviewed - Julia
Published by sambitdash about 8 years ago

PDFIO - Removed package dependencies.

PDFIO now has minimal package dependencies.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash about 8 years ago

PDFIO - Enhanced pdPageExtractText Method

pdPageExtractText has been significantly enhanced to address relatively complex layout issues in rendering text.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash over 8 years ago

PDFIO - New pdPageExtractText Method

Changes this release:

  1. A new pdPageExtractText method is introduced which does a cleaner text conversion for complex PDFs including non-tagged PDFs.
  2. Bug fixes

Text conversions carried out on 25,000+ files.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash over 8 years ago

PDFIO - Text extraction from PDF files

The release provides the following functionalities.

  1. Has a text extraction API pdPageExtractText(page)
  2. Supports Unicode code extraction from font encoding as well as Unicode CMap. (does not read into the font internal encoding embedded in the font file)
  3. Supports Adobe’s encoding for Latin fonts (AdobeGlyphList). Symbol and ZapfDingbats encodings are supported as well.
  4. Does not do any special handling for tagged PDFs but tagged PDFs may behave better as the creation order and reading order of document objects are similar.

Scientific Software - Peer-reviewed - Julia
Published by sambitdash almost 9 years ago

PDFIO - PDFIO v0.0.6

  1. Implementation of PDF Common Data types - Text Strings - Date - Name Tree - Number Tree
  2. Page Labels
  3. File attachments and annotations supported as custom scripts
  4. Cleaner implementation of show and print methods of PDF Objects
  5. Inline API documentation in REPL

Scientific Software - Peer-reviewed - Julia
Published by sambitdash almost 9 years ago