Re: Jira - Nutch 48 - did you mean patch

2005-10-31 Thread Zaheed Haque
I just tried http://slashdot.org/search.pl?query=slashdt doesn't work! or maybe the URL above is not correct? Cheers Zaheed On 10/31/05, Byron Miller [EMAIL PROTECTED] wrote: I got this to work this evening.. was a problem with patch on the system i was working on.. feel free to check it

Re: Jira - Nutch 48 - did you mean patch

2005-10-31 Thread Byron Miller
brainfar, meant mozdex.com using slashdot.org as an example http://www.mozdex.com/search.jsp?query=slashdt Try that one. --- Zaheed Haque [EMAIL PROTECTED] wrote: I just tried http://slashdot.org/search.pl?query=slashdt doesn't work! or maybe the URL above is not correct? Cheers

RE: Jira - Nutch 48 - did you mean patch

2005-10-31 Thread Rajan, Renuka
Hello all I am a newbie to Nutch and Lucene and am experimenting with this combination to 'scrape' web pages. To this end, I need to use regular expressions in combination with Lucene to search the pages fetched by nutch. Does Lucene support regular expressions? The book 'Lucene in Action'

Re: mapred error on windows

2005-10-31 Thread Doug Cutting
It looks like you are using ndfs but not running any datanodes. An ndfs filesystem requires one namenode and at least one datanode, typically a large number running on different machines. Look at the bin/start-all.sh script for an example of what is started in a typical mapred/ndfs

Lucene basic Document fields used by Nutch

2005-10-31 Thread Bruno Patini Furtado
Hi, I´ve looked into org.apache.nutch.indexer.basic.BasicIndexingFilter and saw that the fields indexed into Lucene index by nutch are: host, site, url, content. anchors and title. Of these, the field content is always used. But with this simple code using the Lucene API I couldn't retrieve any

Re: Jira - Nutch 48 - did you mean patch

2005-10-31 Thread Erik Hatcher
No, Lucene does not have a built-in query that uses regular expressions. It's trivial to write a custom Query class like WildcardQuery that does regular expression searching. In fact, I've created this and am contributing it to Lucene as soon as I can (slowly but surely). As for how