Hi
Can u attach the crawl-urlfilter...
Thanx
kishore
-Original Message-
From: Jaya Ghosh [mailto:[EMAIL PROTECTED]
Sent: Tuesday, January 29, 2008 5:22 PM
To: nutch-user@lucene.apache.org
Subject: RE: Nutch Implementation query
Hello Bhupal,
Thanks for the mail. I used src/java/org/apach
Hi
http://wiki.apache.org/nutch/NutchHadoopTutorial
kishore
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Tuesday, January 22, 2008 11:27 PM
To: nutch-user@lucene.apache.org
Subject: Re: Crawl taking too much time
Hi,
Which article? Do you have a link?
Hi
Did u go thru the article in wiki?
Thanx
kishore
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Tuesday, January 22, 2008 8:14 AM
To: nutch-user@lucene.apache.org
Subject: Re: Crawl taking too much time
Can you please let me know how to set nutch workin
Hi Dennis
Thanx for the repli...
But my both machines have 1G of Ram
Thanx
kishore
-Original Message-
From: Dennis Kubes [mailto:[EMAIL PROTECTED]
Sent: Monday, January 21, 2008 8:05 PM
To: [EMAIL PROTECTED]
Cc: nutch-user@lucene.apache.org
Subject: Re: Crawl taking too much time
Fr
hi...
hi im running nutch(nightly build ...last week) in 2 machines...1 master
n both as slaves...
but the crawling takes too much timemy machines r of 1gb ram...is
that the problem...
but normal crawling(ie without hadoop clusters) is working fine
pls help me
thanx
kishore
The information con
Hi ...
Ya...it was a hadoop version mismatch...
Thanx
kishore
-Original Message-
From: Dennis Kubes [mailto:[EMAIL PROTECTED]
Sent: Sunday, January 20, 2008 4:56 AM
To: nutch-user@lucene.apache.org
Subject: Re: pls help: rpc version mismatch
What this states is you are using a newer versi
hi all
im running nutch in two systems...
its showing problem in starting datanode in client( rpc version mismatch
exception)
the entries of the log file is this...
2008-01-18 14:05:56,250 ERROR dfs.DataNode -
org.apache.hadoop.ipc.RPC$VersionMismatch: Protocol
org.apache.hadoop.dfs.DatanodeProtoco
hi
put
+.
at the top of the txt file(url-filter)
thanx
kishore
From: Volkan Ebil [mailto:[EMAIL PROTECTED]
Sent: Thu 1/17/2008 3:57 PM
To: nutch-user@lucene.apache.org
Subject: Eclipse-Crawl Problem
I configured Eclipse following RunNutchInEclipse0.9 document.B
Hi
I dnt knw abt the special character part...but u can limit the urls using
conf/urfilter.txt...
Thanx
kishore
-Original Message-
From: Volkan Ebil [mailto:[EMAIL PROTECTED]
Sent: Tuesday, January 15, 2008 6:13 PM
To: nutch-user@lucene.apache.org
Subject: Customize Crawling..
Hi,
I
Which version did u use...?...
Regards
kishore
-Original Message-
From: Suherdy Yacob [mailto:[EMAIL PROTECTED]
Sent: Tuesday, January 08, 2008 5:27 PM
To: [EMAIL PROTECTED]
Subject: Help me! got a problem when running nutch in eclipse
hi,
i want to debug nutch for my final assignmen
Hi
Check out suffix-urlfilter.xml in conf directory
Thanx n regards
kishore
-Original Message-
From: Jesiel Trevisan [mailto:[EMAIL PROTECTED]
Sent: Friday, January 04, 2008 4:16 PM
To: nutch-user@lucene.apache.org
Subject: Re: How To Create a Filter to Index Files Using Nutch 0.8.1
Hello
11 matches
Mail list logo