Hello,
I'm using CharsetDetector for determine charset of any file type. But in XML type and when it have tag <?xml version="1.0" encoding="ISO-8859-1"?> tika api use this information to determine charset. This behavior not work in my scenario, because my customers sometimes sending file with one xml encoding and other real charset. Can I disable xml verification by tag xml encoding? Regards, [cid:[email protected]] Ramon Rosa da Silva Developer | Archictecture and Frameworks [email protected]<mailto:[email protected]> Skype: ramon.silva.neogrid.com
<<inline: image001.gif>>
