Hello,

I'm using CharsetDetector for determine charset of any file type.
But in XML type and when it have tag <?xml version="1.0" 
encoding="ISO-8859-1"?> tika api use this information to determine charset.
This behavior not work in my scenario, because my customers sometimes sending 
file with one xml encoding and other real charset.

Can I disable xml verification by tag xml encoding?



Regards,

[cid:[email protected]]



Ramon Rosa da Silva

Developer | Archictecture and Frameworks
[email protected]<mailto:[email protected]>
Skype: ramon.silva.neogrid.com




<<inline: image001.gif>>

Reply via email to