Re: a complete solution for building a website search with lucene

2010-01-10 Thread Simon Willnauer
do advise. > > Thanks. > > > --- On *Sat, 9/1/10, Simon Willnauer *wrote: > > > From: Simon Willnauer > Subject: Re: a complete solution for building a website search with lucene > To: java-user@lucene.apache.org > Date: Saturday, 9 January, 2010, 6:16 PM > >

Re: a complete solution for building a website search with lucene

2010-01-10 Thread jyzhou817
a-user@lucene.apache.org >> Sent: Fri, January 8, 2010 5:03:41 AM >> Subject: Re: a complete solution for building a website search with lucene >> >> Hi Paul, >> >> Thanks. >> Use Nutch to do crawling. and integrate Lucene to the web application, so >> that

Re: a complete solution for building a website search with lucene

2010-01-10 Thread jyzhou817
Thanks. --- On Sat, 9/1/10, Simon Willnauer wrote: From: Simon Willnauer Subject: Re: a complete solution for building a website search with lucene To: java-user@lucene.apache.org Date: Saturday, 9 January, 2010, 6:16 PM I don't know that much about nutch but hadoop shouldn't really

Re: a complete solution for building a website search with lucene

2010-01-09 Thread Simon Willnauer
under Cygwin? >  Otis > -- > Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch > > > > - Original Message >> From: "jyzhou...@yahoo.com" >> To: java-user@lucene.apache.org >> Sent: Fri, January 8, 2010 5:03:41 AM >> Subject: Re

Re: a complete solution for building a website search with lucene

2010-01-08 Thread Otis Gospodnetic
- Original Message > From: "jyzhou...@yahoo.com" > To: java-user@lucene.apache.org > Sent: Fri, January 8, 2010 5:03:41 AM > Subject: Re: a complete solution for building a website search with lucene > > Hi Paul, > > Thanks. > Use Nutch to do craw

Re: a complete solution for building a website search with lucene

2010-01-08 Thread jyzhou817
: Re: a complete solution for building a website search with lucene To: java-user@lucene.apache.org Date: Friday, 8 January, 2010, 4:27 PM Zhou, Lucene is a back-end library, it's very useful for developer but it is not a complete site-search-engine. A lucene-based site-search-engine is Nutc

Re: a complete solution for building a website search with lucene

2010-01-08 Thread Paul Libbrecht
Zhou, Lucene is a back-end library, it's very useful for developer but it is not a complete site-search-engine. A lucene-based site-search-engine is Nutch, it does crawl. Solr also provides functions close to these with a large amount of thoughts on flexible integration; crawling methods are

a complete solution for building a website search with lucene

2010-01-07 Thread jyzhou817
Hi , I am new in Lucene. To build a web search function, it need to have a backendc indexing function. But, before that, should run a Crawler? because Lucene index based on Html documents, while Crawler can change the website pages to Html documents. Am i right? If so, please anyone suggest