Hi Denis ,
According to your tutorial
(http://wiki.apache.org/nutch/NutchHadoopTutorial):
I have setup Nutch and Hadoop,so far so good.But when I performing a
search,It always return 0 hit.
So I want to do a search without hadoop, and used the command followed:
bin/hadoop dfs -copyToLocal crawled crawled
It seems that there is somthing wrong.would you give me some tips to debug
it? I use the nutch 0.8 392087 revision.
The output said:
060407 172334 parsing
jar:file:/nutch/search/lib/hadoop-0.1-dev.jar!/hadoop-defa
ult.xml
060407 172334 parsing file:/nutch/search/conf/hadoop-site.xml
060407 172334 No FS indicated, using default:boxA:9000
060407 172334 Client connection to 127.0.0.1:9000: starting
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
000/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00000/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
001/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00001/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
002/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00002/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
003/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00003/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
004/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00004/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
005/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00005/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
006/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00006/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
007/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00007/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
008/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00008/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
009/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00009/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
010/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00010/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
060407 172335 Problem opening checksum file:
/user/nutch/crawled/indexes/part-00
011/index.done. Ignoring with exception java.rmi.RemoteException:
java.io.IOExc eption: Cannot
open filename /user/nutch/crawled/indexes/part-00011/.index.done.
crc
at org.apache.hadoop.dfs.NameNode.open(NameNode.java:120)
at sun.reflect.GeneratedMethodAccessor45.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcces
sorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:585)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:237)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:216)
.
Best regards,
Lin Yuan
_________________________________________________________________
与联机的朋友进行交流,请使用 MSN Messenger: http://messenger.msn.com/cn
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general