Thank you,I was installed this crawler and I run it,but I would like to index the web site and not to list the visited links by the crawler,Is there a way to serch a web page by lucene witch use this crawler for visiting the pages.thanks--- On Mon 11/04, Karl Marx < [EMAIL PROTECTED] > wrote:From: Karl Marx [mailto: [EMAIL PROTECTED]]To: [EMAIL PROTECTED]: Mon, 4 Nov 2002 12:31:50 +0100Subject: Re: Indexing distant web sitesAs stated in the official FAQ Lucene doesn't implement a web-crawler, you can however use a self-made crawler or customate a crawler framework like websphinx (http://www-2.cs.cmu.edu/~rcm/websphinx/) to retrieve html documents from a site and then feed them to Lucene.mvh karl �ieOn Monday, Nov 4, 2002, at 11:49 Europe/Oslo, Friaa Nafaa wrote:> Hello,is there any way to index web sites by lucene, assuming we know > only the url of the site ? :--&gt;In local use we passe to lucene the > full arborexcence or directory of our site (contain all the documents) > and we begin the indexing operation, but when I would like to index a > distant site on the web... what i do ?For exemple I installed Lucene > on my computer and I would like to index the site : > http://www.excite.com ...Thanks>> _______________________________________________> Join Excite! - http://www.excite.com> The most personalized portal on the Web!--To unsubscribe, e-mail: For additional commands, e-mail:
_______________________________________________ Join Excite! - http://www.excite.com The most personalized portal on the Web!
