Hi Antoine, Thanks for the response. I did stumbled upon that thread when searching for a solution. What I discovered was even though the extracted text is not showing the proper characters when viewed from the browser, if I download and open it in a text editor, it is showing the proper extracted text. Also, I can search the non english characters and the snippets of the search results are displaying the characters properly. I will try to do a "index-discovery -b" to see if the extracted text will display properly.
Best regards, euler -- View this message in context: http://dspace.2283337.n4.nabble.com/Issues-in-Media-Filter-PDF-Text-Extractor-PDFFilter-and-XPDF-tp4678283p4678291.html Sent from the DSpace - Tech mailing list archive at Nabble.com. ------------------------------------------------------------------------------ _______________________________________________ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette