problem in crawling......

2006-08-22 Thread Abdelhakim Diab
Hi all: What can I do if I were crawling a big list of sites and suddenly the crawler stopped for any problem ? must I return the whole process or I can continue from the point I stopped at ? Thanks in advance ...

Re: ö ü ä! German language

2006-08-22 Thread dee
maybe its a configuration problem with your tomcat or whatever you use. had the same problem after configuring my $TOMCAT/conf/server.xml. Locate the Connector tag for the web (look for 8080) and insert this parameter assignment: URIEncoding=UTF-8 as explained in Tomcat 5 FAQ at [WWW]

differ search in filesystem or webpages

2006-08-22 Thread David Podunavac
I wonder if i need 2 index directories. One for searching the internet and another one for the filesystem and what do i need to modify in order to tell the nutch-webinterface where to look for the search term? any help appreciated regards david

Re: ö ü ä! German language

2006-08-22 Thread Frank Kempf
Make sure your index.jsp and search.jsp output ISO-8859-1 content when doing a German search. See http://oe247.de and do a search for ÖNetz. Regards Frank

Re: problem in crawling......

2006-08-22 Thread Dennis Kubes
Unfortunately you have to start over. We started breaking our crawls into 100K to 500K runs because of this. Dennis Abdelhakim Diab wrote: Hi all: What can I do if I were crawling a big list of sites and suddenly the crawler stopped for any problem ? must I return the whole process or I can

Re: index/search filtering by category

2006-08-22 Thread Ernesto De Santis
Hi All Please, some body can answer my questions? I'm a nutch beginner, I hope that my questions/doubts are easy... ;) Or if my email is wrong, tell me. Or confirm me if I'm in the right way. Thanks a lot! Ernesto. Ernesto De Santis escribió: Hi I'm new in nutch, start yesterday. But I have

Re: index/search filtering by category

2006-08-22 Thread Lourival Júnior
Hi Ernesto! I know what you mean. Sometimes I get no answers too. Unfortunately, I'm new in nutch and lucene and I can't help you. Continue trying, the comunity will help you :). On 8/22/06, Ernesto De Santis [EMAIL PROTECTED] wrote: Hi All Please, some body can answer my questions? I'm a

log4j:WARN Please initialize the log4j system properly.

2006-08-22 Thread Chris Stephens
How do I get rid of this error? How do I initialize the log4j system properly? log4j:WARN No appenders could be found for logger (org.apache.catalina.startup.Embedded). log4j:WARN Please initialize the log4j system properly.

Re: index/search filtering by category

2006-08-22 Thread Chris Stephens
You can't do it unless you write a plugin to parse a custom meta tag called category. I'm trying to do something like this now, but the plugin documentation is horrible. Lourival Júnior wrote: Hi Ernesto! I know what you mean. Sometimes I get no answers too. Unfortunately, I'm new in

Re: log4j:WARN Please initialize the log4j system properly.

2006-08-22 Thread Dennis Kubes
its looking for the ${hadoop.log.dir} variable so you can either set that on the command line or you can change you log4j.properties file the conf like I did: log4j.rootLogger=INFO,console log4j.appender.console=org.apache.log4j.ConsoleAppender log4j.appender.console.target=System.err

Re: index/search filtering by category

2006-08-22 Thread Howie Wang
You're right about it being not documented well, but it's actually pretty simple to do. You need to write an indexing filter and a query filter. For the indexing filter, I would copy the index-more plugin, and change names, dirs, and build files appropriately. The main thing you'll change is the