Where Can I find Nutch war file??

2011-06-21 Thread Mohammad Hassan Pandi
Hi guys, there is no war file in build folder of my nutch. where can I find nutch war file to deploy on tomcat?

Re: Where Can I find Nutch war file??

2011-06-21 Thread Hasan Diwan
You'll need to build it yourself -- try, $ANT_HOME/bin/ant war or %ANT_HOME%\bin\ant war. Let me know how you get on... On 20 June 2011 23:19, Mohammad Hassan Pandi pandi...@gmail.com wrote: Hi guys, there is no war file in build folder of my nutch. where can I find nutch war file to deploy on

Re: Where Can I find Nutch war file??

2011-06-21 Thread Mohammad Hassan Pandi
Thanks for your response I got nutch-2010-07-07_04-49-04.tar.gz extracted and opened up the directory in Eclipse and run build.xml. There are several tasks in build.xml such as init, compile, compile-core, The tutorial I followed http://wiki.apache.org/nutch/NutchHadoopTutorial; says choose

Re: indexing hierarchical data, schema design

2011-06-21 Thread Khang Ich
Hi, how do you use Solr to index your documents ? The big question for me now is where can I access the current document, count how many persons are in the content,copy it n times, each containing one person and let the parsers that I already have deal with each document separately? And what

Empty indexes folder after crawling!

2011-06-21 Thread Mohammad Hassan Pandi
follwing http://wiki.apache.org/nutch/NutchHadoopTutorial I crawled lucene.apache.org with command bin/nutch crawl urlsdir -dir crawl -depth 3 and copy the whole thing to local file system by running the command bin/hadoop dfs -copyToLocal crawl /d01/local/ but the indexes folder is empty.

Re: Questions about upgrade to Nutch 1.3

2011-06-21 Thread Chip Calhoun
Ahh, thanks again. Based on your advice, I'm going back to Nutch 1.2 / Solr 1.4 and adding the Velocity contrib. Once I get that working, I'll try with Nutch 1.3 again. When I try to use Velocity now, I get this message: java.lang.RuntimeException: Can't find resource 'velocity.properties'

Re: how to classify the search results by an indexed field with lucene?

2011-06-21 Thread Joey
Hi all, Is there anyone who had ever encounted this problem before? Looking forward to your replying. :-) Thanks. Regards, Joey On 06/20/2011 02:09 PM, Joey Ma wrote: Hi all, I use lucene as the indexer in nutch 1.2. I want to get the classified search results by an indexed field, for

Re: Questions about upgrade to Nutch 1.3

2011-06-21 Thread Markus Jelsma
You can safely use 1.3 with Solr 3.1 and Velocity. I've got the stuff up and running as well. On Tuesday 21 June 2011 15:45:53 Chip Calhoun wrote: Ahh, thanks again. Based on your advice, I'm going back to Nutch 1.2 / Solr 1.4 and adding the Velocity contrib. Once I get that working, I'll

TestFetcher hangs

2011-06-21 Thread Nutch User - 1
I followed the tutorial Testing Nutch 2.0 under Eclipse (http://techvineyard.blogspot.com/2010/12/build-nutch-20.html) and ran into a problem. The JUnit test TestFetcher doesn't complete. It hangs on map 100% reduce 0%. The entire output is here (http://pastebin.com/m2XLjCTy). At first it didn't

hardware config / problems

2011-06-21 Thread BartHomer
hi everyone, i'm new to nutch and i have some trouble to get a good working nutch-cluster setup.. (nutch 1.2) my setup: 1 master (namenode, jobtracker, secondarynamenode) 2 nodes (datanode, tasktracker) all pc's are virtual machines and have 500mb ram each nutch-config: - mapred.map.tasks 2 -

Re: indexing hierarchical data, schema design

2011-06-21 Thread vinay vaish
Please don't mail me further. Thanks On 6/21/11, Khang Ich khang...@gmail.com wrote: Hi, how do you use Solr to index your documents ? The big question for me now is where can I access the current document, count how many persons are in the content,copy it n times, each containing one

helpful books or tutorials on nutch

2011-06-21 Thread Shouguo Li
hey guys i know this question has been asked several times on this mailing list but i didn't see good answers in the archive. are there any books or online tutorials that walks you through nutch with couple of real world scenarios? there are several wiki pages on nutch.apache.org, but they're too

Re: How do I debug why a url doesn't pass through generate despite being the only one?

2011-06-21 Thread Markus Jelsma
Did you rebuild the Nutch job file with the updated configuration? You were right, and indeed fixing that it now works locally. However trying it on the server it seems the configuration won't update. I'm not sure why! Where is that documented? On Mon, Jun 20, 2011 at 11:22 PM, Markus

Re: How do I debug why a url doesn't pass through generate despite being the only one?

2011-06-21 Thread Gabriele Kahlout
I tried that, but my code won't compile anymore. I was convinced the conf dir was external. On Tue, Jun 21, 2011 at 8:16 PM, Markus Jelsma markus.jel...@openindex.iowrote: Did you rebuild the Nutch job file with the updated configuration? You were right, and indeed fixing that it now works

Solrdedup NPE

2011-06-21 Thread Markus Jelsma
Hi, Any idea what the exception below can result from? The dedup queries go allright and produce normal results. Some indices will not generate this NPE. Cheers, 11/06/21 20:47:37 WARN mapred.LocalJobRunner: job_local_0001 java.lang.NullPointerException at