On Fri, 2008-09-12 at 10:12 -0400, Grant Ingersoll wrote:
> Additionally, how hard would it be to add Crawlers for things like:

Droids does not force you to use anything besides the queue. From there
you can do what you want.

> 
> 1. IMAP and other mail stores (even things like PST files, etc.)

One can add new protocol implementation in no time. 

> 2.  Somewhat strange:  Databases.  Just point it at a DB and have it  
> suck in tables/rows/columns

I did this in one of my droids @work. Actually it was part of the
business logic and my droid only invokes this business logic. 

> 3. Things like web APIs (Flickr, del.icio.us, etc),

That should be as well very easy. To add basic support for them would be
a couple of hours and an advance support depends on the API.

> 
> Any comments on fault tolerance and incremental crawling would also be  
> appreciated.   

That would need some small enhancements in the queue (basically adding a
comparison on the last modified date). Should not be a problem. 

> Is there anything in the current design that you think  
> prevents these things?

No. See the great example that Oleg wrapped up in no time. He
implemented a simpleRuntime without using Spring. 

Droids follows the slogan "Impossible is nothing". The great flexibility
of droids is/should/will be the great feature of Droids.

salu2

> 
> Thanks,
> Grant
> 
> On Aug 27, 2008, at 5:26 PM, Grant Ingersoll wrote:
> 
> > Is there a feature list for Droids anywhere?
> >
> > Or, can it do:
> >
> > 1. Honor robots.txt
> > 2. Crawl throttling
> > 3. Distributed crawling (i.e. give a bunch of links to it and some  
> > distributed compute resources and have it go to town)
> >
> > Thanks,
> > Grant
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
-- 
Thorsten Scherler                                 thorsten.at.apache.org
Open Source Java                      consulting, training and solutions


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to