On 7/29/06, Sami Siren <[EMAIL PROTECTED]> wrote:
Not expert on this area but perhaps you need to upgrade lucene .jar
files that are used by luke?
I believe I was a little bit hasty with the message I sent. I took a
second look and it just might be that luke was right and the index is
invalid -
Not expert on this area but perhaps you need to upgrade lucene .jar
files that are used by luke?
--
Sami Siren
Tomi NA wrote:
I successfully used luke with indexes created with nutch 0.7.2.
I tried the same with nutch 0.8, but luke sees it as a corrupt index.
Should this be happening?
I know
please try
bin/nutch invertlinks crawl/linkdb -dir crawl/segments/
--
Sami Siren
Alexander E Genaud wrote:
Hello, I am receiving an IOException when running a Whole web crawl
via cygwin. Interestingly (to me at least), the error reads:
/crawl/segments/parse_data
rather than
/crawl
I successfully used luke with indexes created with nutch 0.7.2.
I tried the same with nutch 0.8, but luke sees it as a corrupt index.
Should this be happening?
I know this isn't the luke mailing list, but the information will
still be usefull to people using nutch.
Thanks,
t.n.a.
Hi, Matthew
I think we should use fieldName instead of field, or not...
===stemming code begin===
public TokenStream tokenStream(String field, Reader reader) {
Analyzer analyzer;
if ("anchor".equals(field)) {
analyzer = ANCHOR_ANALYZER;
}
e
Hello, I am receiving an IOException when running a Whole web crawl
via cygwin. Interestingly (to me at least), the error reads:
/crawl/segments/parse_data
rather than
/crawl/segments/20060729123456/parse_data
$ nutch-0.8/bin/nutch invertlinks crawl/linkdb crawl/segments
Exception in