Re: Lucene Unicode Usage

2005-02-10 Thread Andrzej Bialecki
Owen Densmore wrote: I'm building an index from a FileMaker database by dumping the data to a tab-separated file. Because the FileMaker output is encoded in MacRoman, and uses Mac line separators, I run a script across the tab file to clean it up: tr '\r\v' '\n ' | iconv -f MAC -t UTF-8 Thi

Re: sounds like spellcheck [auf Viren geprueft]

2005-02-10 Thread Jonathan O'Connor
Aad, Well at least that's easier. Ciao, Jonathan O'Connor XCOM Dublin Aad Nales <[EMAIL PROTECTED]> 09/02/2005 16:16 Please respond to "Lucene Users List" To Lucene Users List cc Subject Re: sounds like spellcheck [auf Viren geprueft] Jonathan O'Connor wrote: >Aad, >Are you trying t

Re: wildcards, stemming and searching

2005-02-10 Thread Erik Hatcher
How would you deal with a query like "a*z" though? I suspect, however, that you only care about suffix queries and stemming those. If thats the case, then you could subclass getWildcardQuery and do internal stemming (remove trailing wildcard, run it through the analyzer directly there and retur

Re: wildcards, stemming and searching

2005-02-10 Thread aaz
How would you deal with a query like "a*z" though? Yeah I know, a user submitting that is certainly possible. I have no idea. I am starting to think that NOT stemming on indexing might be the safest solution. - Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: "Lucene User

Re: Problem searching Field.Keyword field

2005-02-10 Thread Luke Shannon
Are there any issues with having a bunch of boolean queries and than adding them to one big boolean queries (making them all required)? Or should I be looking at Query.combine()? Thanks, Luke - Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: "Lucene Users List" Sent: Tu

Re: Problem searching Field.Keyword field

2005-02-10 Thread Luke Shannon
Are there any issues with having a bunch of boolean queries and than adding them to one big boolean queries (making them all required)? Or should I be looking at Query.combine()? Thanks, Luke - Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: "Lucene Users List" Sent: Tu

!= queries

2005-02-10 Thread aaz
Hi, I have an index with field "documentNumber". There are 10 documents. One of the documents has documentNumber A5058970 I want to return all matches where documentNumber != A505*. I should get 9 docs back. I construct a query like wq = WildcardQuery("documentNumber","a505*") BooleanQuery bq

Re: != queries

2005-02-10 Thread Miles Barr
On Thu, 2005-02-10 at 11:02 -0700, aaz wrote: > I have an index with field "documentNumber". There are 10 documents. One of > the documents has documentNumber A5058970 > > I want to return all matches where documentNumber != A505*. I should get 9 > docs back. > > I construct a query like > > w

Re: != queries

2005-02-10 Thread aaz
Ok, that makes sense. Any suggestions on how to AND that prohibited clause with a query to get everything? - Original Message - From: "Miles Barr" <[EMAIL PROTECTED]> To: "Lucene Users List" Sent: Thursday, February 10, 2005 11:07 AM Subject: Re: != queries On Thu, 2005-02-10 at 11:02

Re: != queries

2005-02-10 Thread Jason Haruska
If this is a query you need to support often, you could create a field "x" that contains "x" in every document. Then search on that with your prohibited query. If not, you could get the document list by doing your search then removing all of those documents from a complete set outside of lucene.

Re: Problem searching Field.Keyword field

2005-02-10 Thread Paul Elschot
On Thursday 10 February 2005 18:44, Luke Shannon wrote: > Are there any issues with having a bunch of boolean queries and than adding > them to one big boolean queries (making them all required)? The 1.4.3 and earlier BooleanScorer has an out of bounds exception for "More than 32 required/prohibit

Negative Match

2005-02-10 Thread Luke Shannon
I think I found a pretty good way to do a negative match. In this query I am looking for all the Documents that have a kcfileupload field with any value except for jpg. Query negativeMatch = new WildcardQuery(new Term("kcfileupload", "*jpg*")); BooleanQuery typeNegAll = new Boole

new segment for each document

2005-02-10 Thread Ravi
How can I create a new index segment for each new document? The reason for doing this is I want to be able to retrieve a document as soon as it is added. I tried setting the minMergeFactor on the writer to one. But it did not work. Thanks in advance, Ravi. -

Re: new segment for each document

2005-02-10 Thread Daniel Naber
On Thursday 10 February 2005 22:27, Ravi wrote: > I tried setting the minMergeFactor on the writer to one. But > it did not work. I think there's an off-by-one bug so two is the smallest value that works as expected. Regards Daniel -- http://www.danielnaber.de -

Newbie questions

2005-02-10 Thread Paul Jans
Hi, A couple of newbie questions. I've searched the archives and read the Javadoc but I'm still having trouble figuring these out. 1. What's the best way to index and handle queries like the following: Find me all users with (a CS degree and a GPA > 3.0) or (a Math degree and a GPA > 3.5). 2.

Re: new segment for each document

2005-02-10 Thread Doug Cutting
Daniel Naber wrote: On Thursday 10 February 2005 22:27, Ravi wrote: I tried setting the minMergeFactor on the writer to one. But it did not work. I think there's an off-by-one bug so two is the smallest value that works as expected. You can simply create a new IndexWriter for each add and then clo

Re: Negative Match

2005-02-10 Thread Erik Hatcher
On Feb 10, 2005, at 4:06 PM, Luke Shannon wrote: I think I found a pretty good way to do a negative match. In this query I am looking for all the Documents that have a kcfileupload field with any value except for jpg. Query negativeMatch = new WildcardQuery(new Term("kcfileupload", "*jpg

Access Lucene from PHP or Perl

2005-02-10 Thread Andy
Greetings. Can anyone point me to a how-to tutorial on how to access Lucene from a web page generated by PHP pr Perl? I've been looking but couldn't find anything. Thanks a lot. And __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam pr

Multiple Fields with same name

2005-02-10 Thread Ramon Aseniero
Hi All, If I store multiple fields with same name for example “Author” with 3 values “bob,”jane”,”bill” once I retrieve the doc are the values in the same order? Thanks, Ramon -- No virus found in this outgoing message. Checked by AVG Anti-Virus. Version: 7.0.300 / Virus Database: 265.8

BOOLEAN EXCEPTION APPSERVER

2005-02-10 Thread Karthik N S
  Hi guys Apologies...   I am getting this error on  ' Every FIRST SEARCH  after  Startup of the WEBSERVER ' and I have declared the following code only once in the  method of execution <%@ page import="org.apache.lucene.search.BooleanQuery"%>BooleanQuery.setMaxClauseCount(Integer