Hi,
This is just a heads-up that I will be working extensively (under a contract) on increasing crawling accuracy of our Fetcher. The results will be contributed to the project under ASF license.
Since this involves changing some of the widely-used interfaces (Parse, Protocol, HTMLParseFilter, etc), and the way they interact, if some of you intend to work on this in the nearest two weeks, or if you have been thinking about implementing some improvements, please contact me so that we can coordinate the work.
I expect to provide the first round of patches within a couple of days.
-- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
