RE: Nutch Implementation query

2008-01-29 Thread kishore.krishna2
Hi Can u attach the crawl-urlfilter... Thanx kishore -Original Message- From: Jaya Ghosh [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 29, 2008 5:22 PM To: nutch-user@lucene.apache.org Subject: RE: Nutch Implementation query Hello Bhupal, Thanks for the mail. I used src/java/org/apach

RE: Crawl taking too much time

2008-01-22 Thread kishore.krishna2
Hi http://wiki.apache.org/nutch/NutchHadoopTutorial kishore -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 22, 2008 11:27 PM To: nutch-user@lucene.apache.org Subject: Re: Crawl taking too much time Hi, Which article? Do you have a link?

RE: Crawl taking too much time

2008-01-21 Thread kishore.krishna2
Hi Did u go thru the article in wiki? Thanx kishore -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 22, 2008 8:14 AM To: nutch-user@lucene.apache.org Subject: Re: Crawl taking too much time Can you please let me know how to set nutch workin

RE: Crawl taking too much time

2008-01-21 Thread kishore.krishna2
Hi Dennis Thanx for the repli... But my both machines have 1G of Ram Thanx kishore -Original Message- From: Dennis Kubes [mailto:[EMAIL PROTECTED] Sent: Monday, January 21, 2008 8:05 PM To: [EMAIL PROTECTED] Cc: nutch-user@lucene.apache.org Subject: Re: Crawl taking too much time Fr

Crawl taking too much time

2008-01-20 Thread kishore.krishna2
hi... hi im running nutch(nightly build ...last week) in 2 machines...1 master n both as slaves... but the crawling takes too much timemy machines r of 1gb ram...is that the problem... but normal crawling(ie without hadoop clusters) is working fine pls help me thanx kishore The information con

RE: pls help: rpc version mismatch

2008-01-20 Thread kishore.krishna2
Hi ... Ya...it was a hadoop version mismatch... Thanx kishore -Original Message- From: Dennis Kubes [mailto:[EMAIL PROTECTED] Sent: Sunday, January 20, 2008 4:56 AM To: nutch-user@lucene.apache.org Subject: Re: pls help: rpc version mismatch What this states is you are using a newer versi

pls help: rpc version mismatch

2008-01-18 Thread kishore.krishna2
hi all im running nutch in two systems... its showing problem in starting datanode in client( rpc version mismatch exception) the entries of the log file is this... 2008-01-18 14:05:56,250 ERROR dfs.DataNode - org.apache.hadoop.ipc.RPC$VersionMismatch: Protocol org.apache.hadoop.dfs.DatanodeProtoco

RE: Eclipse-Crawl Problem

2008-01-17 Thread kishore.krishna2
hi put +. at the top of the txt file(url-filter) thanx kishore From: Volkan Ebil [mailto:[EMAIL PROTECTED] Sent: Thu 1/17/2008 3:57 PM To: nutch-user@lucene.apache.org Subject: Eclipse-Crawl Problem I configured Eclipse following RunNutchInEclipse0.9 document.B

RE: Customize Crawling..

2008-01-15 Thread kishore.krishna2
Hi I dnt knw abt the special character part...but u can limit the urls using conf/urfilter.txt... Thanx kishore -Original Message- From: Volkan Ebil [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 15, 2008 6:13 PM To: nutch-user@lucene.apache.org Subject: Customize Crawling.. Hi, I

RE: Help me! got a problem when running nutch in eclipse

2008-01-08 Thread kishore.krishna2
Which version did u use...?... Regards kishore -Original Message- From: Suherdy Yacob [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 08, 2008 5:27 PM To: [EMAIL PROTECTED] Subject: Help me! got a problem when running nutch in eclipse hi, i want to debug nutch for my final assignmen

RE: How To Create a Filter to Index Files Using Nutch 0.8.1

2008-01-04 Thread kishore.krishna2
Hi Check out suffix-urlfilter.xml in conf directory Thanx n regards kishore -Original Message- From: Jesiel Trevisan [mailto:[EMAIL PROTECTED] Sent: Friday, January 04, 2008 4:16 PM To: nutch-user@lucene.apache.org Subject: Re: How To Create a Filter to Index Files Using Nutch 0.8.1 Hello