Hi,
I am crawling my site and am using Tika for binary content parsing.
But, how can I know if a certain url contains binary content or plain text ?
I can get the contentType.
So for now I am using:
if (typeStr.contains("image") || typeStr.contains("audio") ||
typeStr.contains("video") || typeStr.contains("application")) {
return true;
}
Which is dumb code.
I will replace the plain strings with Tika's MediaType objects but still I
need better code
Does anyone have any better idea ?
Thank you for your help,
Avi