Thank you very mutch, this tutorial was very useful Cristopher. I get do what i was doing but i change a little the way that i will follow to do this. I go transfer all archieves to servidor and do a local indexer. The problem indexing everything still wasn't resolved. This the depth of the search but is dificult known the number of sub-folders to index. Thank you.
Valmir On 9/19/05, Valmir Macário <[EMAIL PROTECTED]> wrote: > > Alexander, Christoph and All > > When i was running de crawl command was giving this error: > > 050919 092356 impl: point=org.apache.nutch.searcher.QueryFilter class= > org.apache.nutch.searcher.site.SiteQueryFilter > 050919 092356 parsing: /files/home/vmf/nutch-0.7 > /plugins/query-url/plugin.xml > 050919 092356 impl: point=org.apache.nutch.searcher.QueryFilter class= > org.apache.nutch.searcher.url.URLQueryFilter > 050919 092356 not including: /files/home/vmf/nutch-0.7 > /plugins/urlfilter-regex > 050919 092356 not including: /files/home/vmf/nutch-0.7 > /plugins/urlfilter-prefix > Exception in thread "main" java.lang.ExceptionInInitializerError > at org.apache.nutch.db.WebDBInjector.addPage(WebDBInjector.java:437) > at org.apache.nutch.db.WebDBInjector.injectURLFile(WebDBInjector.java:378) > at org.apache.nutch.db.WebDBInjector.main(WebDBInjector.java:535) > at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:134) > Caused by: java.lang.RuntimeException: org.apache.nutch.net.URLFilter not > found. at org.apache.nutch.net.URLFilters.<clinit>(URLFilters.java:44) > ... 4 more > > > i fixed it putting the it on nutch-site.xml: > > <property> > <name>plugin.includes</name> > > <value>protocol-file|protocol-http|parse-(text|html|msword|pdf)|index-basic|query-(basic|site|url)|urlfilter-regex</value> > </property> > > > my urls.txt file is : file:/export/home/vmf > > but is indexing everyting later de home. > > How i can index another account but in the intranet? > > I'm trying out the ip in crawl-urlfilter.txt but i don't obtained succes. > > Some one can give some suggestion, please. > > Thanks, Valmir > > > On 9/16/05, Valmir Macário <[EMAIL PROTECTED]> wrote: > > > > Hi all, > > > > I'm using solaris and try to index my local system, i follow all steps > > in the FAQ but i still don't obtained success. This FAQ is missing some step > > or has anything wrong? I apreciate if some one couls help me, my objective > > is to index local system in a intranet. Thanks > > > >
