Q: Is it possible with jPDFWeb to convert a whole PDF document to single HTML output file?
A: Yes, Qoppa’s Java PDF library, jPDFWeb, can export each page of a PDF document as an SVG element. It is then simple to add HTML header and tags around all the SVG elements to make it a single HTML page. see sample code below.
public static void main (String [] args) { try { PDFWeb pdfWeb = new PDFWeb("C:\\mydoc.pdf", null); // create a writer for the output file File outputFile = new File("mydoc.html"); PrintWriter outWriter = new PrintWriter(outputFile, "UTF-8"); writeHTML(pdfWeb, outWriter); System.out.println("Done! Look at file " + outputFile.getAbsolutePath()); } catch (Throwable t) { t.printStackTrace(); } } |
where writeHTML is the following method:
// Write pages as HTML public static void writeHTML(PDFWeb pdfweb, PrintWriter outWriter) throws PDFException, IOException { // write HTML page header outWriter.println( "<!DOCTYPE html>\n" + "<!-- Created by Qoppa Software's jPDFWeb (http://www.qoppa.com) -->\n" + "<html>\n" + " <head>\n" + " <meta name=\"generator\" content=\"jPDFWeb : Java PDF to HTML5 Conversion\">\n" + " <meta name=\"author\" content=\"Qoppa Software\">\n" + " <meta name=\"website\" content=\"http://www.qoppa.com\">\n" + " <meta charset=\"utf-8\">\n" + " <style type=\"text/css\">\n" + " body {background-color:lightgray; text-align:center;}\n" + " .pageRect {fill:white;}\n" + " #topSvg {padding:5px; margin:auto;}\n" + " </style>\n" + " </head>\n" + " <body>"); // Convert pages for(int i = 0; i < pdfweb.getPageCount(); i++) { pdfweb.savePageAsSVG(i, outWriter); outWriter.println("<br>"); } // End HTML page outWriter.println("</body>"); outWriter.println("</html>"); } |
Try jPDFWeb Live Demo
https://www.qoppa.com/pdfhtml/demo/html5-pdf-viewer-live-demo/
Choose the option “View Whole Doc as HTML”.
You can upload your own PDF document and see how our PDF library convert your PDFs!