If you like to get some insight on the LARM crawler, feel free to read
http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr
awler-LARM/doc/webcrawler_tech_overview.pdf
http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr
awler-LARM/CHANGES.txt
http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr
awler-LARM/README.txt
http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr
awler-LARM/TODO.txt

These two threads on the lucene-dev list are especially important, as they
contain thoughts about the future directions of the crawler, as well as
further explanations that might not be included in the tech_overview
document (I still owe Otis a response on one of these):
http://nagoya.apache.org/eyebrowse/BrowseList?[EMAIL PROTECTED]
ache.org&by=thread&from=201679
http://nagoya.apache.org/eyebrowse/BrowseList?[EMAIL PROTECTED]
ache.org&by=thread&from=203151


Contact me if you have any ideas on how you could contribute to that.

Clemens

----- Original Message -----
From: "Tarek M. Nabil" <[EMAIL PROTECTED]>
To: "Lucene Developers List" <[EMAIL PROTECTED]>
Sent: Sunday, July 28, 2002 9:36 PM
Subject: RE: I need your advice


> Thanks Brian,
>
> I'm looking forward to that. So, what's the starting point? Are there are
any documents I can read?
>
> -----Original Message-----
> From: Brian Goetz [mailto:[EMAIL PROTECTED]]
> Sent: Sunday, July 28, 2002 10:21 PM
> To: Lucene Developers List
> Subject: Re: I need your advice
>
>
> > All I meant was to ask whether my current qualifications can after a
> > while permit me to be an active contributor.
>
> I don't see any reason why not.  Enthusiasm and interest is probably
> the most important qualification for contributing (assuming you are a
> competent programmer.)
>
> Lucene is a great project because the architecture is so clean and
> simple, its easy to understand immediately.
>
> There are a bunch of new subprojects going on in this group --
> crawlers, indexing of various file types (Word, PDF, HTML/XML, etc)
> which I'm sure could use contributions.
>
> --
> To unsubscribe, e-mail:
<mailto:[EMAIL PROTECTED]>
> For additional commands, e-mail:
<mailto:[EMAIL PROTECTED]>
>
>
> --
> To unsubscribe, e-mail:
<mailto:[EMAIL PROTECTED]>
> For additional commands, e-mail:
<mailto:[EMAIL PROTECTED]>
>


--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to