Hello Fred,

You can inspect a Lucene index with Luke (http://www.getopt.org/luke/), 
or from the commandline (see 
https://issues.apache.org/jira/browse/NUTCH-330 to install the patch).

We also experienced some issues with Hadoop on WinXP/Nutch 0.8, you 
might want to look at https://issues.apache.org/jira/browse/NUTCH-266

HTH,
Renaud


Fred Tyre wrote:
> I have installed nutch 0.8 on a Windows XP machine.
>
> I hacked my way through the crawling/indexing and feel like I am right on
> the verge of getting this working.
>
> However, I cannot get any search results via the web page.
>
> Is there any way I can verify the indexes that were generated on the command
> line?
>
> When I run "bin/nutch readlinkdb ..." nutch generates a text file with 4,454
> lines of links and anchor text.
>
> So up to that point it should be working (hopefully).
>
> Please help.
>
> Thanks.
>
> Here is the Tomcat Log...
>
> 2006-08-01 17:47:31,656 INFO  NutchBean - creating new bean
> 2006-08-01 17:47:31,671 INFO  NutchBean - opening indexes in
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/crawl/indexes
> 2006-08-01 17:47:31,765 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/common-terms.utf8
> 2006-08-01 17:47:31,796 INFO  NutchBean - opening segments in
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/crawl/segments
> 2006-08-01 17:47:31,828 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2006-08-01 17:47:31,828 INFO  NutchBean - opening linkdb in
> file:/C:/Program%20Files/Apache%20Software%20Foundation/Tomcat%205.5/webapps
> /nutch-0.8/WEB-INF/classes/crawl/linkdb
> 2006-08-01 17:47:31,843 INFO  NutchBean - query request from 127.0.0.1
> 2006-08-01 17:47:31,859 INFO  NutchBean - query: forums
> 2006-08-01 17:47:31,859 INFO  NutchBean - lang: en
> 2006-08-01 17:47:31,890 INFO  NutchBean - searching for 20 raw hits
> 2006-08-01 17:47:31,984 INFO  NutchBean - total hits: 0
> 2006-08-01 17:50:30,343 INFO  NutchBean - query request from 127.0.0.1
> 2006-08-01 17:50:30,343 INFO  NutchBean - query: the
> 2006-08-01 17:50:30,343 INFO  NutchBean - lang: en
> 2006-08-01 17:50:30,343 INFO  NutchBean - searching for 20 raw hits
> 2006-08-01 17:50:30,343 INFO  NutchBean - total hits: 0
> 2006-08-01 17:50:35,390 INFO  NutchBean - query request from 127.0.0.1
> 2006-08-01 17:50:35,390 INFO  NutchBean - query: publishing
> 2006-08-01 17:50:35,390 INFO  NutchBean - lang: en
> 2006-08-01 17:50:35,390 INFO  NutchBean - searching for 20 raw hits
> 2006-08-01 17:50:35,390 INFO  NutchBean - total hits: 0
>
> Sincerely,
> Fred
>
>   
>> <><><><><><><><><><><><><><><><><><
>>     
>    Fred Tyre
>    Information Services
>    Heartland Communications, Inc.
>    515-574-2147
>    [EMAIL PROTECTED]
>   
>> <><><><><><><><><><><><><><><><><><
>>     
>
>
>
>
>   

-- 
Renaud Richardet
COO America
Wyona Inc.  -   Open Source Content Management   -   Apache Lenya
office +1 857 776-3195                     mobile +1 617 230 9112
renaud.richardet <at> wyona.com              http://www.wyona.com


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to