PDFBox 2.0.0 replacement for PDPageContentStream.appendRawCommands

2015-12-18 Thread Peter Prusinowski
Hello, is there an alternative command for PDPageContentStream.appendRawCommands in version 2.0.0 ? I am using this command for linebreaks and saw that its deprectaed in this version. Thanks, Peter - To unsubscribe, e-mail:

Re: PDFBox 2.0.0 replacement for PDPageContentStream.appendRawCommands

2015-12-18 Thread Tilman Hausherr
Am 18.12.2015 um 13:50 schrieb Peter Prusinowski: Hello, is there an alternative command for PDPageContentStream.appendRawCommands in version 2.0.0 ? I am using this command for linebreaks and saw that its deprectaed in this version. You don't need extra line breaks, the methods already do

Issues with extraction content of PDF files

2015-12-18 Thread Zheng Lin Edwin Yeo
Hi, I'm indexing some PDF documents in Solr. However, for certain PDF files, there are chinese text in the documents, but after indexing, what is indexed in the content is either a series of "??" or an empty content. i've also tried on the Tika app, and I get the same results. What could be

Re: Issues with extraction content of PDF files

2015-12-18 Thread Tilman Hausherr
Am 18.12.2015 um 18:57 schrieb Zheng Lin Edwin Yeo: I've shared one of the file with the issue on dropbox, which you can access via the link here: https://www.dropbox.com/s/rufi9esmnsmzhmw/Desmophen%2B670%2BBAe.pdf?dl=0 Adobe Reader is also unable to extract text.

RE: Issues with extraction content of PDF files

2015-12-18 Thread Allison, Timothy B.
Colleagues, So that you don't have to do the initial diagnosis at least. From [0]: >>That said, PDFBox 2.0-RC2 extracts no text and warns: WARNING: No Unicode >>mapping for CID+71 (71) in font 505Eddc6Arial >>So, if the file has no Unicode mapping for the font, I doubt they'll be able >>to

Re: PDFBox 2.0.0 replacement for PDPageContentStream.appendRawCommands

2015-12-18 Thread Peter Prusi
Which Methods do that ? Am 18.12.2015 um 16:50 schrieb Tilman Hausherr: Am 18.12.2015 um 13:50 schrieb Peter Prusinowski: Hello, is there an alternative command for PDPageContentStream.appendRawCommands in version 2.0.0 ? I am using this command for linebreaks and saw that its deprectaed in

Re: PDFBox 2.0.0 replacement for PDPageContentStream.appendRawCommands

2015-12-18 Thread Tilman Hausherr
Am 18.12.2015 um 19:46 schrieb Peter Prusi: Which Methods do that ? The methods you use to do things, e.g. showText. They call the private method writeOperator(), which is this: private void writeOperator(String text) throws IOException {