do advise.
>
> Thanks.
>
>
> --- On *Sat, 9/1/10, Simon Willnauer *wrote:
>
>
> From: Simon Willnauer
> Subject: Re: a complete solution for building a website search with lucene
> To: java-user@lucene.apache.org
> Date: Saturday, 9 January, 2010, 6:16 PM
>
>
a-user@lucene.apache.org
>> Sent: Fri, January 8, 2010 5:03:41 AM
>> Subject: Re: a complete solution for building a website search with lucene
>>
>> Hi Paul,
>>
>> Thanks.
>> Use Nutch to do crawling. and integrate Lucene to the web application, so
>> that
Thanks.
--- On Sat, 9/1/10, Simon Willnauer wrote:
From: Simon Willnauer
Subject: Re: a complete solution for building a website search with lucene
To: java-user@lucene.apache.org
Date: Saturday, 9 January, 2010, 6:16 PM
I don't know that much about nutch but hadoop shouldn't really
under Cygwin?
> Otis
> --
> Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch
>
>
>
> - Original Message
>> From: "jyzhou...@yahoo.com"
>> To: java-user@lucene.apache.org
>> Sent: Fri, January 8, 2010 5:03:41 AM
>> Subject: Re
- Original Message
> From: "jyzhou...@yahoo.com"
> To: java-user@lucene.apache.org
> Sent: Fri, January 8, 2010 5:03:41 AM
> Subject: Re: a complete solution for building a website search with lucene
>
> Hi Paul,
>
> Thanks.
> Use Nutch to do craw
: Re: a complete solution for building a website search with lucene
To: java-user@lucene.apache.org
Date: Friday, 8 January, 2010, 4:27 PM
Zhou,
Lucene is a back-end library, it's very useful for developer but it is not a
complete site-search-engine.
A lucene-based site-search-engine is Nutc
Zhou,
Lucene is a back-end library, it's very useful for developer but it is
not a complete site-search-engine.
A lucene-based site-search-engine is Nutch, it does crawl.
Solr also provides functions close to these with a large amount of
thoughts on flexible integration; crawling methods are
Hi ,
I am new in Lucene.
To build a web search function, it need to have a backendc indexing function.
But, before that, should run a Crawler? because Lucene index based on Html
documents, while Crawler can change the website pages to Html documents. Am i
right?
If so, please anyone suggest