Home > Computing > PDF Incremental Updates Feature & PDF Text Extraction Error Reporting Implementation using Java

PDF Incremental Updates Feature & PDF Text Extraction Error Reporting Implementation using Java

Added: (Fri Jan 26 2018)

Pressbox (Press Release) - What's New in this Release?

Aspose team is pleased to announce the release of Aspose.Pdf for Java 17.12.0. While investigating a scenario where a PDF document used PDF Type 3 fonts, it was observed that the TextAbsorber class was not retrieving the text correctly. Reason was that the fonts used in the PDF, contained different encoding and it is not possible to extract text from such documents, by using Adobe Reader itself. Aspose team has realized the necessity to implement functionality in the API that such error in the document can be reported. Aspose team is pleased to inform users that text extraction error reporting has been implemented for TextAbsorber and TextFragmentAbsorber classes, which is available with Aspose.Pdf for Java 17.12. It was observed that when users load a PDF document from binary, manipulate it (i.e add some annotations) and save it to a different binary – the content of the PDF document was used to be totally changed. In order to avoid such issues, it have implemented an additional method i.e saveIncrementally() into the Document class. Now users will be able to save document into a Stream object, using Incremental Updates. As it always recommended to use latest release of API’s as they include latest features / improvements and fixes related to issues reported in earlier released versions. Some important improved features included in this release are given below

• PDF Incremental updates when load pdf document from binary
• PDF to JPEG - Missing text in output JPG
• PDF to HTML: text misplaced in resultant HTML
• HTML to PDF - Conversion process hangs
• PDF to HTML - Text changes its position
• Text absorber retrieves the garbled text
• PDF to Doc: Text in the word document are wrapped one on another
• PDF to XPS: colored images changes to greyscale
• PDF to PDF/A - Text starts appearing overlapped
• Text replacement issue: Characters are missing in replaced text
• PDF to DOCX - text is overlapping in resultant file
• PDF to HTML: text shifted to left side
• PDF to Excel - Blank File is Generated
• Remove text underline in a PDF document
• Open PDF file from stream add annotation invalidates the signature
• PDF to PNG - invisible objects become visible

Newly added documentation pages and articles

Some new tips and articles have now been added into Aspose.Pdf for Java documentation that may guide you briefly how to use Aspose.Pdf for performing different tasks like the followings.

- Saving PDF to DOCX: https://docs.aspose.com/display/pdfjava/Convert+PDF+to+other+Formats#ConvertPDFtootherFormats-SavingtoDOCX

- Convert PDF to HTML format: https://docs.aspose.com/display/pdfjava/Convert+PDF+to+HTML+format

Overview: Aspose.Pdf for Java

Aspose.Pdf is a Java PDF component to create PDF documents without using Adobe Acrobat. It supports Floating box, PDF form field, PDF attachments, security, Foot note & end note, Multiple columns document, Table of Contents, List of Tables, Nested tables, Rich text format, images, hyperlinks, JavaScript, annotation, bookmarks, headers, footers and many more. Now users can create PDF by API, XML and XSL-FO files. It also enables users to converting HTML, XSL-FO and Excel files into PDF.

More about Aspose.Pdf for Java

- Homepage of Aspose.Pdf for Java: http://www.aspose.com/products/pdf/java

- Download Aspose.Pdf for Java at: http://www.aspose.com/downloads/pdf/java

- Read online documentation of Aspose.Pdf for Java at: http://www.aspose.com/docs/display/pdfjava/Home

Contact Information
Aspose Pty Ltd
Suite 163, 79 Longueville Road
Lane Cove, NSW, 2066
Phone: 888.277.6734
Fax: 866.810.9465

Submitted by:Sher Azam
Disclaimer: Pressbox disclaims any inaccuracies in the content contained in these releases. If you would like a release removed please send an email to remove@pressbox.co.uk together with the url of the release.