I checked it, it includes the same error, which I already indicated; crawl started in: crawl rootUrlDir = urls threads = 10 depth = 1 indexer=lucene Injector: starting Injector: crawlDb: crawl/crawldb Injector: urlDir: urls Injector: Converting injected urls to crawl db entries. Injector: Merging injected urls into crawl db. Injector: done Generator: Selecting best-scoring urls due for fetch. Generator: starting Generator: filtering: true Generator: normalizing: true Generator: jobtracker is 'local', generating exactly one partition. Generator: 0 records selected for fetching, exiting ... Exception in thread "main" java.lang.NullPointerException at org.apache.nutch.crawl.Crawl.main(Crawl.java:133)
Problem looks related with plugins. -- Yavuz Selim YILMAZ 2010/10/6 Ahmad Al-Amri <[email protected]> > have you checked the log file ? > > > > > > ________________________________ > From: Yavuz Selim YILMAZ <[email protected]> > To: [email protected] > Sent: Wed, October 6, 2010 9:50:18 AM > Subject: Re: Nutch-Eclipse > > I checked the post, my conf is in my classpath, the problem is not related > with that. > It looks everything is same with a fresh nutch build with ant, but there is > such error, still can't understand. > -- > > Yavuz Selim YILMAZ > > > 2010/10/6 Ahmad Al-Amri <[email protected]> > > > > > "check wether your "Working directory"(Run -> Run Configurations -> Tab > > Arguments -> Working Directory) points to the Nutch base directory (where > > your conf/nucht-site.xml is located)." > > check this: http://search-lucene.com/m/xwpG02rotOD1/v=threaded > > > > you can see hadoop.log to get what is the error exactly > > > > Regards; > > Ahmad > > > > > > > > > > ________________________________ > > From: Yavuz Selim YILMAZ <[email protected]> > > To: [email protected] > > Sent: Tue, October 5, 2010 7:23:14 PM > > Subject: Re: Nutch-Eclipse > > > > I use standard arguments with standard nutch script > > > > bin/nutch crawl urls -dir crawl -depth 2. > > -- > > > > Yavuz Selim YILMAZ > > > > > > 2010/10/5 Bahadir Cambel <[email protected]> > > > > > Could you share your script ? What are the arguments ? > > > > > > Did you try to run it via bin/nutch script ? > > > > > > Regards , > > > Bahadir Cambel > > > > > > On Tue, Oct 5, 2010 at 2:11 PM, Yavuz Selim YILMAZ > > > <[email protected]>wrote: > > > > > > > I can't compile healthy from eclipse, I tried ant and managed to > > compile, > > > > but still same error, any suggestion ? > > > > -- > > > > > > > > Yavuz Selim YILMAZ > > > > > > > > > > > > 2010/10/5 Yavuz Selim YILMAZ <[email protected]> > > > > > > > > > No, my urls folder has urls and, in my regex filter, there is just > > "+" > > > to > > > > > allow everything. > > > > > -- > > > > > > > > > > Yavuz Selim YILMAZ > > > > > > > > > > > > > > > 2010/10/5 Markus Jelsma <[email protected]> > > > > > > > > > > It seems you're trying to fetch 0 url's. Inject correct url's or > > adjust > > > > >> your url filters as not to filter out your injected url's. > > > > >> > > > > >> -----Original message----- > > > > >> From: Yavuz Selim YILMAZ <[email protected]> > > > > >> Sent: Tue 05-10-2010 13:16 > > > > >> To: user <[email protected]>; > > > > >> Subject: Nutch-Eclipse > > > > >> > > > > >> I tried to configure nutch on eclipse. > > > > >> Use Nutch 1.2 > > > > >> Eclipse Helios > > > > >> > > > > >> I got this kind of error, trying to crawl; > > > > >> > > > > >> crawl started in: crawl > > > > >> rootUrlDir = urls > > > > >> threads = 10 > > > > >> depth = 1 > > > > >> indexer=lucene > > > > >> Injector: starting > > > > >> Injector: crawlDb: crawl/crawldb > > > > >> Injector: urlDir: urls > > > > >> Injector: Converting injected urls to crawl db entries. > > > > >> Injector: Merging injected urls into crawl db. > > > > >> Injector: done > > > > >> Generator: Selecting best-scoring urls due for fetch. > > > > >> Generator: starting > > > > >> Generator: filtering: true > > > > >> Generator: normalizing: true > > > > >> Generator: jobtracker is 'local', generating exactly one > partition. > > > > >> Generator: 0 records selected for fetching, exiting ... > > > > >> Exception in thread "main" java.lang.NullPointerException > > > > >> at org.apache.nutch.crawl.Crawl.main(Crawl.java:133) > > > > >> > > > > >> I guess, I can'y configure plugin enviroment that, after build my > > > code, > > > > I > > > > >> see that, there is no class files for plugins. > > > > >> Anybody has a suggestion? I also send my classpath. > > > > >> -- > > > > >> > > > > >> Yavuz Selim YILMAZ > > > > >> > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >

