Hello,

due to the "File not found" Errors in the nightly nutch build when using
local filesystem I tried ndfs instead (with nutch-nightly 10th October).


This brings up the following Errors
Maybe  I�m doing something wrong running ndfs.
Server is running on Windows 2000 Server.
The directories testdata1(incl. "data" and "tmp") and testname(incl. "image"
and the file "edits") are created ok but everything is empty.

Thanks for replying

Regards
Carsten

===== ERRORS and LOGS ======================================

1st Starting Nameserver:
-----------------------
$ bin/nutch datanode testdata1 myServer myServer:9000
041011 130646 Opened server at 7000
041011 130647 Lost connection to namenode [myServer/10.x.x.x:9000].  Re
trying...
041011 130653 Client connection to 10.x.x.x:9000: starting
...

2nd Starting DataNode
---------------------
$ bin/nutch namenode 9000 testname
041011 130652 Server listener on port 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server handler on 9000: starting
041011 130653 Server connection on port 9000 from 10.x.x.x: starting
...


3rd PUTting File Error:
-----------------------
041011 130447 Client connection to 10.x.x.x:9000: starting
Exception in thread "main" java.io.IOException: Could not obtain new output
bloc
k for file README.txt
        at
net.nutch.fs.NDFSClient$NameNodeCaller.getNewOutputBlock(NDFSClient.j
ava:925)
        at
net.nutch.fs.NDFSClient$NDFSOutputStream.nextBlockOutputStream(NDFSCl
ient.java:603)
        at
net.nutch.fs.NDFSClient$NDFSOutputStream.<init>(NDFSClient.java:584)
        at net.nutch.fs.NDFSClient.create(NDFSClient.java:71)
        at net.nutch.util.NDFSFileSystem.create(NDFSFileSystem.java:48)
        at net.nutch.util.NDFSFileSystem.create(NDFSFileSystem.java:43)
        at
net.nutch.util.NDFSFileSystem.addLocalFile(NDFSFileSystem.java:140)
        at net.nutch.fs.TestClient.put(TestClient.java:32)
        at net.nutch.fs.TestClient.main(TestClient.java:199)
041011 130532 Client connection to 10.x.x.x:9000 caught: java.net.SocketExc
eption: Connection reset
java.net.SocketException: Connection reset
        at java.net.SocketInputStream.read(SocketInputStream.java:168)
        at java.io.BufferedInputStream.fill(BufferedInputStream.java:183)
        at java.io.BufferedInputStream.read(BufferedInputStream.java:201)
        at java.io.DataInputStream.readInt(DataInputStream.java:443)
        at net.nutch.ipc.Client$Connection.run(Client.java:94)
041011 130532 Client connection to 10.x.x.x:9000: closing
041011 130533 Problem making IPC call on myServer/10.x.x.x:9000
041011 130533 Stopping client
....


3rd testing Crawl Error:
------------------------
$ bin/nutch crawl -ndfs myServer:9000 myurlList.txt -depth 10
041011 141342 loading file:/E:/Search/nutch-nightly/conf/nutch-default.xml
041011 141342 Client connection to 10.x.x.x:9000: starting
041011 141342 loading file:/E:/Search/nutch-nightly/conf/crawl-tool.xml
041011 141342 loading file:/E:/Search/nutch-nightly/conf/nutch-site.xml
041011 141342 crawl started in: crawl-20041011141342
041011 141342 rootUrlFile = url
041011 141342 threads = 40
041011 141342 depth = 10
041011 141342 Client connection to 10.x.x.x:9000: starting
Exception in thread "main" java.io.IOException: Database
E:\Search\nutch-nightly
\crawl-20041011141342\db is not a directory.
        at net.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1461)
        at net.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1424)
        at net.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:158)
        at net.nutch.tools.CrawlTool.main(CrawlTool.java:119)


-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to