Hi,
I downloaded SMB protocol plugin from following location:
http://issues.apache.org/jira/browse/NUTCH-427
I configured it (as mentioned it in read.txt) with Nutch. But when I tried to
crawl, nothing gets crawled and get following exception in hadoop log.
2009-12-21 16:25:04,728 FATAL smb.SMB - Could not read content of protocol:
smb://10.88.45.140/shared_folder/
jcifs.smb.SmbException:
jcifs.util.transport.TransportException
java.net.SocketException: Invalid argument or cannot assign requested address
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
at java.net.Socket.connect(Socket.java:525)
at java.net.Socket.connect(Socket.java:475)
at java.net.Socket.<init>(Socket.java:372)
at java.net.Socket.<init>(Socket.java:246)
at jcifs.smb.SmbTransport.negotiate(SmbTransport.java:244)
at jcifs.smb.SmbTransport.doConnect(SmbTransport.java:299)
at jcifs.util.transport.Transport.run(Transport.java:240)
at java.lang.Thread.run(Thread.java:619)
at jcifs.util.transport.Transport.run(Transport.java:256)
at java.lang.Thread.run(Thread.java:619)
at jcifs.smb.SmbTransport.connect(SmbTransport.java:289)
at jcifs.smb.SmbTree.treeConnect(SmbTree.java:139)
at jcifs.smb.SmbFile.connect(SmbFile.java:798)
at jcifs.smb.SmbFile.connect0(SmbFile.java:768)
at jcifs.smb.SmbFile.exists(SmbFile.java:1275)
at
org.apache.nutch.protocol.smb.SMBResponse.<init>(SMBResponse.java:74)
at org.apache.nutch.protocol.smb.SMB.getProtocolOutput(SMB.java:62)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:535)
Has anyone used SMB protocol plugin before?
Thanks,
Rupesh
DISCLAIMER
==========
This e-mail may contain privileged and confidential information which is the
property of Persistent Systems Ltd. It is intended only for the use of the
individual or entity to which it is addressed. If you are not the intended
recipient, you are not authorized to read, retain, copy, print, distribute or
use this message. If you have received this communication in error, please
notify the sender and delete all copies of this message. Persistent Systems
Ltd. does not accept any liability for virus infected mails.