Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-16 Thread Mattias Persson
I previously wrote that you could override LuceneFulltextIndexService and assign your Analyzer there, but now I see that it can't be done there... it's on a lower level. So as Peter pointed out, one option would be to go with the new index framework where you can specify an analyzer at index

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-16 Thread Mattias Persson
Or, a slightly uglier solution: if you are going to use that analyzer for all your fulltext indexing needs then you could modify the source of the index component (very small patch), build a jar and use that instead of the standard one. I attached an example patch (for neo4j-index 1.1). 2010/9/16

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-16 Thread Peter Neubauer
Mattias, no patch coming through ... Cheers, /peter neubauer VP Product Development, Neo Technology GTalk:      neubauer.peter Skype       peter.neubauer Phone       +46 704 106975 LinkedIn   http://www.linkedin.com/in/neubauer Twitter      http://twitter.com/peterneubauer

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-16 Thread Mattias Persson
Oh, I forgot... attachments have a hard time survive the trip to the mailing list: Index: src/main/java/org/neo4j/index/lucene/LuceneFulltextDataSource.java === --- src/main/java/org/neo4j/index/lucene/LuceneFulltextDataSource.java

[Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread rick.bullotta
I've noticed that when indexing full text, the last term/word is always ignored. This is a major issue, but I'm not sure if it is in the index utils or in Lucene itself. Any thoughts? Thanks, Rick ___ Neo4j mailing list

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Mattias Persson
That sounds weird. Look at TestLuceneFulltextIndexService#testSimpleFulltext method, it queries for the last word and it seems to work. Could you provide more info on this? 2010/9/15 rick.bullo...@burningskysoftware.com I've noticed that when indexing full text, the last term/word is always

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread rick.bullotta
Using neo4j-index-1.1 and lucene-core-2.9.2, by the way. Original Message Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term From: Mattias Persson [1]matt...@neotechnology.com Date: Wed, September 15, 2010 10:37 am

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Mattias Persson
an Analyzer/Tokenizer which gets rid of such punctuation characters? 2010/9/15 rick.bullo...@burningskysoftware.com Using neo4j-index-1.1 and lucene-core-2.9.2, by the way. Original Message Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread rick.bullotta
of a custom tokenizer/analyzer we could start from? Thanks, Rick Original Message Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term From: Mattias Persson [1]matt...@neotechnology.com Date: Wed, September 15, 2010 11:47 am

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Morten Barklund
example code of a custom tokenizer/analyzer we could start from? Thanks, Rick Original Message Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term From: Mattias Persson [1]matt...@neotechnology.com Date: Wed, September 15

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread rick.bullotta
how use a specific analyzer instead of the default one... Thanks, Rick Original Message Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term From: Morten Barklund [1]mor...@barklund.dk Date: Wed, September 15, 2010 12:29

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Toby Matejovsky
Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term From: Mattias Persson [1]matt...@neotechnology.com Date: Wed, September 15, 2010 11:47 am To: Neo4j user discussions [2]u...@lists.neo4j.org Couldn't it be that sentences ends with a dot... so

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Rick Bullotta
of the default one... Thanks, Rick Original Message Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term From: Morten Barklund [1]mor...@barklund.dk Date: Wed, September 15, 2010 12:29 pm To: Neo4j user discussions [2]u

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Rick Bullotta
2:23 PM To: 'Neo4j user discussions' Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term Actually, I ended up coming with a workaround that involved using HTMLStripReader/HTMLStripCharFilter for pre-parsing the text before passing it into the neo .index(node,key,value

Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term

2010-09-15 Thread Toby Matejovsky
Message- From: user-boun...@lists.neo4j.org [mailto:user-boun...@lists.neo4j.org] On Behalf Of Rick Bullotta Sent: Wednesday, September 15, 2010 2:44 PM To: 'Neo4j user discussions' Subject: Re: [Neo4j] Bug: LuceneFullTextQueryIndex service ignoring last word/term Well, I have