Re: nutch 1.0 Question

2009-08-29 Thread yangfeng
Your should use JDK,not JRE and please change the JDK version to 1.6 2009/8/29 �v 磊 > Dears, > > I come across a problem when I use eclipse to import nutch 1.0. > > The problem source file is DistributedSegmentBean.java. > > The problem code is RPC.getProxy(RPCSegmentBean.class, > FetchedSegmen

add parse-wml plugin to Nutch!

2009-11-26 Thread yangfeng
hi, i have to add parse-wml plugin to Nutch, if it has been finished,pls give me some advise. Tks!

Re: newbie questions

2009-12-07 Thread yangfeng
you should add property below: hadoop.job.ugi rider,iamsolomon it's ok! 2009/12/1 Mischa Tuffield > Hello Brian, > > Getting a response from another newbie here, so I could be wrong (do excuse > if I am). > > If you are attempting to run a search index from the filesystem yo

Re: recrawl.sh stopped at depth 7/10 without error

2009-12-07 Thread yangfeng
I sill want to know the reason. 2009/12/2 BELLINI ADAM > > hi, > > anay idea guys ?? > > > > thanx > > > From: mbel...@msn.com > > To: nutch-user@lucene.apache.org > > Subject: RE: recrawl.sh stopped at depth 7/10 without error > > Date: Fri, 27 Nov 2009 20:11:12 + > > > > > > > > hi, > > >

Re: How to successfully crawl and index office 2007 documents in Nutch 1.0

2009-12-07 Thread yangfeng
docx should be parsed,A plugin can be used to Parsed docx file. you get some help info from parse-html plugin and so on. 2009/12/4 Rupesh Mankar > Hi, > > I am new to Nutch. I want to crawl and search office 2007 documents (.docx, > .pptx etc) from Nutch. But when I try to crawl, crawler throws

Nutch 1.0 wml plugin

2009-12-07 Thread yangfeng
I have completed the plugin for parsing the wml(wiredless mark language). I hope to add it to lucene, what i do?