Hello,

I have crawled a webpage as a part of my assignment(CS572). I have the
segment folder with the url metadata and data(parsed and otherwise).

I have also merged all the segments, to dump into an output file.

This dump file, when opened in a text editor contains some parsed content
and some encoded content, like special characters that is actually data
from that url.

The problem is, I am not very clear how to use tika here? Please help

Thanks
Nancy

Reply via email to