• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other Pie Elite all forums
this forum made possible by our volunteer staff, including ...
  • Campbell Ritchie
  • Jeanne Boyarsky
  • Ron McLeod
  • Paul Clapham
  • Liutauras Vilda
  • paul wheaton
  • Rob Spoor
  • Devaka Cooray
Saloon Keepers:
  • Stephan van Hulst
  • Tim Holloway
  • Carey Brown
  • Frits Walraven
  • Tim Moores
  • Mikalai Zaikin

Convert PDF to DOCX & PDF to HTML with All Resources Embedded Using Java

Ranch Hand
Posts: 714
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
What’s new in this release?

We are pleased to announce the release of Aspose.Pdf for Java 9.7.0 . Document manipulation and conversion from various formats to PDF, and conversion of PDF to other file formats have always been our main development areas. Keeping this tradition alive, this new release provides couple of new features as well as improvements in the API for complex scenarios handling for image to PDF, PDF to image, text extraction, watermark manipulation and much more. Aspose.Pdf for Java already supports rendering PDF files to Microsoft Word (DOC) format. The DocSaveOptions class makes it possible to render a PDF file to Microsoft Word format. This class also provides numerous properties that improve the process of converting PDF files to DOC format. Among these properties, Mode enables you to specify a recognition mode for PDF content. You can specify any value from the RecognitionMode enumeration for this property. Each of these values have specific benefits and limitations. However starting this new release, Aspose.Pdf for Java also offers the capabilities to convert PDF files to DOCX format. HTML to PDF and PDF to HTML are some of the features our customers use most. During conversion, all resources (fonts, images and CSS) from a PDF file are saved in a separate folder in the same directory as the output HTML. However, we were asked to provide a feature for converting a PDF file to HTML format and generate a single HTML file with all resources embedded. The current release of Aspose.Pdf for Java offers this feature. As well as the enhancements and features discussed above, there have been numerous fixes related to HTML to PDF conversion, PDF to Excel conversion, XPS to PDF conversion, PDF to TIFF conversion, text replacement, text extraction, rendering PDF files to XPS, creating TOCs in PDF files, and printing PDFs with embedded fonts. Some important new and improved features included in this release are given below

- Support PDF to DOCX conversion
- Convert PDF to Single Web Archive (MHT file)
- Memory is not released after process completion/exits.
- Memory leak issue
- NullPointerException - When converting PDF files to JPG
- TIFF to PDF - Resultant PDF is blank
- TIFF to PDF - Exception during image conversion
- Only first frame of TIFF is converted to PDF
- Tiff to PDF throws unsupported file format exception
- TIFF to PDF - Exception during conversion
- Add a watermark to the same PDF
- Saving back to the same document throws InvalidPdfFileFormatException exception
- Multipage TIFF to PDF throws exception
- Pdf to Image conversion problems for different values of DPI
- Pdf to text: space between text increased
- PDF to PDFA: PDFA compliance fails
- XPS to PDF - Exception during conversion
- Exception while extracting text from PDF file
- PDF to Image: loses PolygonAnnotation
- PDF to HTML - Font missing exception
- NullPointerException when attempting to get all text from PDF

Newly added documentation pages and articles

Some new tips and articles have now been added into Aspose.Pdf for Java documentation that may guide you briefly how to use Aspose.Pdf for performing different tasks like the followings.

- Convert PDF to DOC or DOCX format
- PDF to HTML - Single HTML with All Resources Embedded

Overview: Aspose.Pdf for Java

Aspose.Pdf is a Java PDF component to create PDF documents without using Adobe Acrobat. It supports Floating box, PDF form field, PDF attachments, security, Foot note & end note, Multiple columns document, Table of Contents, List of Tables, Nested tables, Rich text format, images, hyperlinks, JavaScript, annotation, bookmarks, headers, footers and many more. Now you can create PDF by API, XML and XSL-FO files. It also enables you to converting HTML, XSL-FO and Excel files into PDF.

More about Aspose.Pdf for Java

- Homepage of Aspose.Pdf for Java
- Download Aspose.Pdf for Java
- Read online documentation of Aspose.Pdf for Java

Contact Information
Aspose Pty Ltd
Suite 163, 79 Longueville Road
Lane Cove, NSW, 2066
Aspose – Your File Format APIs
Phone: 888.277.6734
Fax: 866.810.9465
Don't get me started about those stupid light bulbs.
    Bookmark Topic Watch Topic
  • New Topic