Hi,

I am crawling my site and am using Tika for binary content parsing.

But, how can I know if a certain url contains binary content or plain text ?

I can get the contentType.


So for now I am using:
if (typeStr.contains("image") || typeStr.contains("audio") ||
typeStr.contains("video") || typeStr.contains("application")) {
 return true;
}


Which is dumb code.

I will replace the plain strings with Tika's MediaType objects but still I
need better code

Does anyone have any better idea ?




Thank you for your help,
Avi

Reply via email to