On 10/17/07, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > On Sat, 2007-10-13 at 09:24 +0200, Roland Weber wrote: > > Hello, > > > > a few years ago, the HttpComponents team at Jakarta received > > a code donation of "Norbert, the (no)robots.txt parser". It > > came without a community, nobody asked for it since, so it > > resides mostly forgotten in our SVN repository at [1]. The > > Droids lab might find this code useful. > > > > On a related matter... when we discussed the HttpComponents > > project scope in early 2005, http-spider was named as a > > potential component, see the respective section in [2]. There > > was always the disclaimer that a community would have to form > > to turn that idea into reality. We're currently more than > > busy with the Core and Client components, and we'll be pushing > > to become a TLP before the year is out. Maybe when Droids > > grows out of the lab, you'll consider us as a potential home? > > What you're doing seems to be exactly what Oleg suggested for > > http-spider. > > > Actually droids is very hybrid ATM. The code [1] is very nice since this > part (till now) is grateful ignored by droids. I will incorporate it > ASAP. > > Regarding [2] it is very interesting indeed for droids to become part of > HttpComponents. I want to rewrite the code to use spring instead of the > underlying slimed down version of nutch plugins infrastructure code to > make it more standard and understandable.
Interesting - I've always had an urge to rewrite my scraping-engine framework to use Spring rather than its own container :) *mental note to look at Droids soon* Hen --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]