Hi Lewis 1) Why are Tika methods not declared as public? >
why should they? If they are not used outside the class then I think it is good practice to keep them private. Since TikaParser implements Parser the only method that we can expect to be called is getParse() and it is public > 2) Why doesn't TikaParser ship with a main method? > it could have one for testing but IMHO using the ParserChecker is a better way of testing as it is closer to real use > 3) We previously discussed implementing the Any23 parser plugin as a tika > wrapper, therefore it would look very similar to parse-tika? > I don't remember this but I remember suggesting that the Any23 parser should be a tika parser which is not the same as a Tika wrapper. I expect other people in Tika-land to have a use for it, and we'd get the benefit of it automatically with parse-tika > 4) I like the look of the feed plugin where it also ships with a custom > indexingfilter implementation, my thoughts were also to provide a custom > Any23IndexingFilter implementation? > Depends on what you want to do? What would we get out of Any23? How would that be used on the search side? Thanks Julien -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

