Hello, I have crawled a webpage as a part of my assignment(CS572). I have the segment folder with the url metadata and data(parsed and otherwise).
I have also merged all the segments, to dump into an output file. This dump file, when opened in a text editor contains some parsed content and some encoded content, like special characters that is actually data from that url. The problem is, I am not very clear how to use tika here? Please help Thanks Nancy

