Re: exception while feeding converted text from pdf

2008-05-15 Thread Brian Carmalt
Hello Cam, Are you writing your xml by hand, as in no xml writer? That can cause problems. In your exception it says latitude 59, the should have converted to 'amp;'(I think). If you can use Java6, there is a XMLStreamWriter in java.xml.stream that does automatic special character escaping. This

RE: exception while feeding converted text from pdf

2008-05-15 Thread Mendes, Richard
Are you writing your xml by hand, as in no xml writer? That can cause problems. In your exception it says latitude 59, the should have converted to 'amp;'(I think). If you can use Java6, there is a XMLStreamWriter in java.xml.stream that does automatic special character escaping. This can

exception while feeding converted text from pdf

2008-05-14 Thread Cam Bazz
Hello, I made a simple java program to convert my pdfs to text, and then to xml file. I am getting a strange exception. I think the converted files have some errors. should I encode the txt string that I extract from the pdfs in a special way? Best, -C.B. EVERE:

Re: exception while feeding converted text from pdf

2008-05-14 Thread Shalin Shekhar Mangar
Yes, you need to XML encode your test. If you use SolrJ to add documents to Solr, it will take care of the encoding for you. On Wed, May 14, 2008 at 9:53 PM, Cam Bazz [EMAIL PROTECTED] wrote: Hello, I made a simple java program to convert my pdfs to text, and then to xml file. I am getting