September 10, 2012. This release adds multi-column text detection and extraction. It also allows rendering and printing of very large CCITT fax images (200 Mb and up) and PDF documents where unicode mapping is missing.

Library versions: jPDFProcess v4.80, jPDFAssemble v 2.10, jPDFFields v3.10, jPDFImages v4.000, jPDFPrint v4.30, jPDFSecure v3.10, jPDFText v3.10

New Features

Multi-Column text detection – 2658 – [jPDFProcess, jPDFText]
Our text extraction process now supports multi- column documents (such as research or magazine articles). We think that our new text extraction algorithm is first-in-class when compared to other PDF tools and we hope you’ll enjoy it too.

Improvements

Handle very large CCITT fax images
– 2991 [jPDFProcess, jPDFImages, jPDFPrint]
Used to display red “X”s when rendering pages with big fax images (200Mb+) and the library was running out of memory.

Unicode mapping missing – 2111 – [jPDFProcess, jPDFText]
Improved text extraction to handle cases where there isn’t a method available for mapping the character codes to Unicode values.

Ligature characters – 3084 – [jPDFProcess, jPDFImages, jPDFPrint]
Improvements on displaying ligature characters such as fl, fi, etc…

Fixes

Display error rendering page – 2698 [jPDFProcess, jPDFImages, jPDFPrint]
This is a regression bug for an issue introduced in version 4.70. When having an issue rendering a page in a document, we used to show “Error Rendering Page” and below “Error Reading File”. In version 4.70 the pages are blank.

Layers merged – 3018[jPDFProcess, jPDFAssemble]
Merge layers when appending documents. When a document is appended to another document, the layers in the source document were not getting transferred.

API Change: remove deprecated method – 3031 – [jPDFProcess]
setDocumentInfo() method from PDFDocument This method has been deprecated for more than 2 years, so it needed to be removed. To change document info use the DocumentInfo object returned by getDocumentInfo() which is a mutable object.

Flatten rotated form fields – 2648 – [jPDFProcess, jPDFFields]
When flattening form fields on rotated forms, they would sometimes end up rotated.

Some fax images not displaying – 2808[jPDFProcess, jPDFImages, jPDFPrint]
Regression issue introduced in version 4.70. Some fax images were not displaying correctly.


Press Release:

Qoppa Software’s fall release of Java PDF libraries and components improves PDF text extraction and rendering performance.