Copying from Hadooop to local and then performing a search on the index is a
question that needs to be posted to the list. My guess would be that you
have an older version of the code and there were some bugs copying crc
files.  I think I remember something about that on the list a little while
back.  So you might want to update and rebuild you code base.

If you want to do a crawl and search without using hadoop follow the nutch
0.8 tutorial on the website (not wiki) for a regular crawl.  You would also
want to set fs.default.name to local and comment out the rest of the
hadoop-site.xml file options.  Also make sure to set the nutch-site.xml file
in the WEB-INF/classes directory to the absolute path of the crawl directory
as below.

Dennis

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration>
  <property>
    <name>fs.default.name</name>
    <value>local</value>
  </property>
  <property>
    <name>searcher.dir</name>
    <value>C:\TESTBED\NUTCH\CRAWLED</value>
  </property>
</configuration>

-----Original Message-----
From: lin yuan [mailto:[EMAIL PROTECTED] 
Sent: Friday, April 07, 2006 4:33 AM
To: [email protected]
Subject: please help!! It always return 0 hit.

Hi Denis ,
 According to your tutorial
(http://wiki.apache.org/nutch/NutchHadoopTutorial):
I have setup Nutch and Hadoop,so far so good.But when I performing a search,
It always return 0 hit.
  So I want to do a search without hadoop, and used the command followed: 
    bin/hadoop dfs -copyToLocal crawled crawled

 It seems that there is somthing wrong.would you give me some tips to debug
it? I use the nutch 0.8 392087 revision.
 The output said:

060407 172334 parsing 
jar:file:/nutch/search/lib/hadoop-0.1-dev.jar!/hadoop-defa                  
                            ult.xml
060407 172334 parsing file:/nutch/search/conf/hadoop-site.xml
060407 172334 No FS indicated, using default:boxA:9000
060407 172334 Client connection to 127.0.0.1:9000: starting
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     000/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00000/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     001/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00001/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     002/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00002/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     003/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00003/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     004/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00004/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     005/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00005/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     006/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00006/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     007/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00007/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     008/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00008/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     009/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00009/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     010/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00010/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file: 
/user/nutch/crawled/indexes/part-00                                         
     011/index.done.  Ignoring with exception java.rmi.RemoteException: 
java.io.IOExc                                              eption: Cannot 
open filename /user/nutch/crawled/indexes/part-00011/.index.done.           
                                   crc
        at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
        at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces       
                                       sorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:585)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.




Best regards,
   Lin Yuan

_________________________________________________________________
与联机的朋友进行交流,请使用 MSN Messenger:  http://messenger.msn.com/cn  




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to