Hi Guys
Apologies...
I have several MERGERINDEXES [ MGR1,MGR2,MGR3].
for searching across these MERGERINDEXES I use the following Code
IndexSearcher[] indexToSearch = new IndexSearcher[CNTINDXDBOOK];
for(int all=0;allCNTINDXDBOOK;all++){
indexToSearch[all] = new
Hi!
What is the simplest way to add synonyms for AND/OR/NOT operators?
I'd like to support two sets of operator words, so people can use either the
original english
operators and my custom ones for our local language.
Thank you for your attention!
Sanyi
As obvious as it may seem, you could always store the index ID in which
you are indexing the document in the document itself and have that
fetched with the search results, or is there something stopping you from
doing that.
Nader Henein
Karthik N S wrote:
Hi Guys
Apologies...
I
On Tuesday 21 December 2004 05:49, aurora wrote:
I'm testing the rebuilding of the index. I add several hundred documents,
optimize and add another few hundred and so on. Right now I have around
7000 files. I observed after the index gets to certain size. Everytime
after optimize, the
Karthik,
On Tuesday 21 December 2004 09:04, Karthik N S wrote:
Hi Guys
Apologies...
I have several MERGERINDEXES [ MGR1,MGR2,MGR3].
for searching across these MERGERINDEXES I use the following Code
IndexSearcher[] indexToSearch = new IndexSearcher[CNTINDXDBOOK];
On Dec 21, 2004, at 3:04 AM, Sanyi wrote:
What is the simplest way to add synonyms for AND/OR/NOT operators?
I'd like to support two sets of operator words, so people can use
either the original english
operators and my custom ones for our local language.
There are two options that I know of: 1)
Hi!
I think we're talking about different things.
My question is about using synonyms for AND/OR/NOT operators, not about
synonyms of words in the
index.
For example, in some language: AND = AANNDD; OR = OORR; NOT = NNOOTT
So, the user can enter:
(cat OR kitty) AND black AND tail
and either:
Erik Hatcher writes:
On Dec 21, 2004, at 3:04 AM, Sanyi wrote:
What is the simplest way to add synonyms for AND/OR/NOT operators?
I'd like to support two sets of operator words, so people can use
either the original english
operators and my custom ones for our local language.
There
Wow, I really did misunderstand. My apologies.
Yes, you will need to fork QueryParser.jj and install JavaCC to build
your custom parser. It should be pretty trivial to add alternatives to
AND(+)/OR/NOT(-).
Erik
On Dec 21, 2004, at 4:42 AM, Sanyi wrote:
Hi!
I think we're talking about
Hi !
Have two applications. Both are supposed
to write Lucene index files and the WebApplication is supposed to read
these index files.
Here are the questions:
1. Can two applications write index files, in the same directory, at the same
time ?
2. If two applications cannot write index
Gururaja H wrote:
Hi !
Have two applications. Both are supposed
to write Lucene index files and the WebApplication is supposed to read
these index files.
Here are the questions:
1. Can two applications write index files, in the same directory, at the same time ?
if you implement the
On Dec 21, 2004, at 5:51 AM, Gururaja H wrote:
1. Can two applications write index files, in the same directory, at
the same time ?
If you mean to the same Lucene index, the answer is no. Only a single
IndexWriter instance may be writing to an index at one time.
2. If two applications cannot
Well, I guess I'd better recognize and replace the operator synonyms to their
original format
before passing them to QueryParser. I don't feel comfortable tampering with
Lucene's source code.
Anyway, thanx for the answers.
Sanyi
--- Morus Walter [EMAIL PROTECTED] wrote:
Erik Hatcher writes:
Sanyi writes:
Well, I guess I'd better recognize and replace the operator synonyms to their
original format
before passing them to QueryParser. I don't feel comfortable tampering with
Lucene's source code.
Apart from knowing how to compile lucene (including the javacc code
generation) you
Another possibility is that you are using an older version of Lucene,
which was known to have a bug with similar symptoms. Get the latest
version of Lucene.
You shouldn't really have multiple .cfs files after optimizing your
index. Also, optimize only at the end, if you care about indexing
I sent this mail yesterday but had no luck in receiving responses. Trying it
again .
Hi all,
I am getting null pointer exception when I am sorting on a field that has null
value for some documents. Order by in sql does work on such fields and I
think it puts all results with null
I read a lot of messages that Lucene can index a DB because it use that
INPUTSTREAM type
I don't understand how to do this. For example if I've a forum with
Mysql and a lot of files on my web, for every search I've to select the
index that I want use in my search, true? But I don't know how to
I want to be able to use stopwords in exact phrase searches. I have
looked at Nutch and used the same approach (replace common words with
n-grams. Look at net.nutch.analysis.CommonGrams).
So if to,be,or and not are stop words, for the string to be
or not to be, the analyzer produces the
On Dec 21, 2004, at 10:39 AM, Daniel Cortes wrote:
I read a lot of messages that Lucene can index a DB because it use
that INPUTSTREAM type
Where have you read that? This is incorrect.
I don't understand how to do this. For example if I've a forum with
Mysql and a lot of files on my web, for
On Dec 21, 2004, at 10:41 AM, Ravi wrote:
I want to be able to use stopwords in exact phrase searches. I have
looked at Nutch and used the same approach (replace common words with
n-grams. Look at net.nutch.analysis.CommonGrams).
So if to,be,or and not are stop words, for the string to be
or
Depending on what you are doing, there are some problems with
MultiSearcher. See
http://issues.apache.org/bugzilla/show_bug.cgi?id=31841 for a
description of the issues and possible patch(es) to fix.
Chuck
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent:
Hello
I'll just paste the relevant MySQL code, you add the calls to it per
your needs..it has no checking of anything so better add that as well...
It's possible I didnt copy/paste everything but you should get the idea
where this is going...
-pedja
--
import
Thanks for the heads up. I'm using Lucene 1.4.2.
I tried to do optimize() again but it has no effect. Adding a just tiny
dummy document would get rid of it.
I'm doing optimize every few hundred documents because I tried to simulate
incremental update. This lead to another question I would
Right now I am incrementally adding about 100 documents to the index a day
and then optimize after that. I find that optimize essentially rebuilding
the entire index into a single file. So the size of disk write is
proportion to the total index size, not to the size of documents
Are you also using the position increment of 0 for the gram tokens
like Nutch does?
Yes.
I don't think considering only gram tokens will work for me because
Nutch uses only bi-grams. It can only have one gram per token. In my
case I have more than one and even if I get only the grams, I still
Hello,
I think some of these questions my be answered in the jGuru FAQ
So my question is would it be an overkill to optimize everyday?
Only if lots of documents are being added/deleted, and you end up with
a lot of index segments.
Is
there
any guideline on how often to optimize?
Hi Folks,
I am pleased to announce the availability of dotLucene 1.4.3 RC1 build-001
This is the first Release Candidate release of version 1.4.3 of Jakarta
Lucene ported to C# and is intended to be Final.
Please visit http://www.sourceforge.net/projects/dotlucene/ to learn more
about dotLucene
27 matches
Mail list logo