+1, I think this is a good idea. Why not make it override-able and fallback on the (existing) default mechanism for back compat and for API/end-user stability.
Cheers, Chris On 7/7/10 8:08 AM, "Julien Nioche" <[email protected]> wrote: Hi guys, One of the recent changes on Tika is the possibility to specify a custom HTMLMapper via the Context - which I think is an elegant mechanism. I was wondering whether there would be a reason NOT to be able to do the same for the HTMLHandler and if nothing is passed via the Context, rely on the current implementation. This would give more control to the user on what to do with the SAX events while at the same time preserving the functionality by default. Any thoughts on this? Julien -- DigitalPebble Ltd Open Source Solutions for Text Engineering http://www.digitalpebble.com ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
