On Mon, 10 Feb 2014, Rupak Khurana wrote:
I am trying to parse out JIL(Job Information Language) scripts that happen to have Name:Value pairs. Perhaps Tika is an overkill but wanted to use its parsing ability and SAX event firing to make life easier.
Sounds like you'll want to define / identify a suitable mimetype for these, add some mime magic so they get detected, then write your own parser that spots these name/value pairs and emmits suitable sax events for you to consume
See http://tika.apache.org/1.4/parser_guide.html for a guide as to how to do all of that
Nick
