My goal is to display pdf documents in my JavaFX application. After researching I came across Mozilla's PDFJS library and found it pretty useful. What I'm doing is reading PDF file as a byte array from JAVA and calling the js code in Web view. Here is the code.
JAVA CODE
byte[] data = Files.readAllBytes(Paths.get("D:\\test\\test.pdf"));
String base64 = Base64.getEncoder().encodeToString(data);
btn.setOnMouseClicked(e -> {
String js = "openFileFromBase64('" + base64 + "')";
engine.executeScript(js);
});
Javascript Code
<script>
var openFileFromBase64 = function(data) {
var arr = base64ToArrayBuffer(data);
PDFViewerApplication.open(arr);
}
function base64ToArrayBuffer(base64) {
var binary_string = window.atob(base64);
var len = binary_string.length;
var bytes = new Uint8Array(new ArrayBuffer(len));
alert(len);
for (var i = 0; i < len; i++) {
bytes[i] = binary_string.charCodeAt(i);
}
return bytes.buffer;
}
Problem
I'm able to render some pdf files just fine but I've some pdf files that won't open correctly. So I did some analysis and found following:-
The problematic pdf files open correctly in adobe reader, firefox and edge browser.
I thought the problem might be PDFJS library. So I tried placing my pdf file in WEB directory and opened viewer.html. To my surprise the pdf displayed correctly.
I thought maybe some bytes are lost when I send data from Java to Javascript.So I printed number of bytes on each end and they match.
Finally, I thought that java might be messing up the encoding. So I read the file from java and wrote the bytes to separate file and the file generated is correct.
I'm trying to understand if I overlooked something. Any suggestions are appreciated. Thanks in advance.
Here is how my pdf looks like after it is rendered:-