If you like to get some insight on the LARM crawler, feel free to read http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr awler-LARM/doc/webcrawler_tech_overview.pdf http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr awler-LARM/CHANGES.txt http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr awler-LARM/README.txt http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/webcr awler-LARM/TODO.txt
These two threads on the lucene-dev list are especially important, as they contain thoughts about the future directions of the crawler, as well as further explanations that might not be included in the tech_overview document (I still owe Otis a response on one of these): http://nagoya.apache.org/eyebrowse/BrowseList?[EMAIL PROTECTED] ache.org&by=thread&from=201679 http://nagoya.apache.org/eyebrowse/BrowseList?[EMAIL PROTECTED] ache.org&by=thread&from=203151 Contact me if you have any ideas on how you could contribute to that. Clemens ----- Original Message ----- From: "Tarek M. Nabil" <[EMAIL PROTECTED]> To: "Lucene Developers List" <[EMAIL PROTECTED]> Sent: Sunday, July 28, 2002 9:36 PM Subject: RE: I need your advice > Thanks Brian, > > I'm looking forward to that. So, what's the starting point? Are there are any documents I can read? > > -----Original Message----- > From: Brian Goetz [mailto:[EMAIL PROTECTED]] > Sent: Sunday, July 28, 2002 10:21 PM > To: Lucene Developers List > Subject: Re: I need your advice > > > > All I meant was to ask whether my current qualifications can after a > > while permit me to be an active contributor. > > I don't see any reason why not. Enthusiasm and interest is probably > the most important qualification for contributing (assuming you are a > competent programmer.) > > Lucene is a great project because the architecture is so clean and > simple, its easy to understand immediately. > > There are a bunch of new subprojects going on in this group -- > crawlers, indexing of various file types (Word, PDF, HTML/XML, etc) > which I'm sure could use contributions. > > -- > To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> > For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> > > > -- > To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> > For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> > -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>