Hi all,
I am building a searchinterface where the user looks for product
descriptions that fit his or her specs. One potential specs is for
instance length. The product description holds two fields:
length_from = 1
and
length_to = 10
If the query value is 5 the search should be succesful. Here
Not sure that I get everything:
In the framework that we have built we use a 'simple' object mapping
that connects a database table with an object and implicetely with a
cache. It is build on top of JDBC.
The key fields of the database are used to create a DbKey element a
simple array of
In my Clipper days I could build an index on English words using a
technique that was called soundex. Searching in that index resulted in
hits of words that sounded the same. From what i remember this technique
only worked for English. Has it ever been generalized?
What i am trying to solve is
Morus Walter wrote:
Unfortunatley the website seems to be down.
Do you have the url? The sources are off course very welcome as well.
Cheers,
Aad
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail:
Thanks for the reference to Metaphone et al. This is the direction I am
looking for. What I don't get is why so much of the 'knowledge' of these
algoritms is stored in the 'process'. I guess it has to be performance.
cheers,
Aad
Jonathan O'Connor wrote:
Aad,
Are you trying to check the spelling of English words by Dutch children?
Uh no, I am trying to correct the spelling of Dutch words by Dutch
children who, as most children do, make phonetic spelling mistakes.
Yep,
This is how we do it.
We have a search.xml that maps database fields to search fields and a
parameter part that describes the 'click for detailed result url' and
the parameter names (based on the search fields). In this xml we also
describe how the different fields should be stored we have
If that is a general thought then I will plan for some time to put this
in action.
Cheers,
Aad
David Spencer wrote:
Nice, very similar to what I was thinking of, where the most
significant difference is probably just that I was thinking of a batch
indexer, not one embedded in a web container.
Nope,
it is very possible. We have an index that holds the search info for
documents, messages in discussion threads, filled in forms etc. etc.
each having their own structure.
cheers,
Aad
Karl Koch wrote:
Hello all,
perhaps not such a sophisticated question:
I would like to have a very
Sorry if this is a misspost but I have been visiting Amazon daily the
last few weeks and I still can't get the Lucene book there. How will I
survive the holidays? :-)
But seriously when can we expect the release?
-
To
Thanks but it's the text i am interested in. And the local bookshops
don't carry the book yet.
Hi,
From: Aad Nales [mailto:[EMAIL PROTECTED]
Sorry if this is a misspost but I have been visiting Amazon
daily the last few weeks and I still can't get the Lucene
book there. How will I
James,
How do you kick off your reindex? Could it be a session timeout?
cheers,
Aad
Hello,
I am a Java/Lucene/Tomcat newbie I know that does not bode well as a
start
to a post but I really am in dire straits as far as Lucene goes so bear
with
me. I am working on indexing and replacing
Paul,
We are doing similar stuff. We actually do create a hash of database
name, table name and id to form a unique id. So far I have not had any
problems with it.
Cheers,
Aad
Hi,
I'm creating an index from several database tables. Every item within
every table has a unique id which is saved
Steve,
The behavior that you descibe is as expected. I have tackled a similar
problem to yours by creating a proxy object that acts as a gatekeeper to
all IndexReader, IndexSearcher and IndexWriter operations. With fully
synchronized access to all methods of the proxy you will not run into
any
that responsed to my original mail,
your great suggestions have greatly improved the user and technical
performance of my solution.
--
Aad Nales
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail
is
restarted or something similar? If not, is there any way to figure out
where the lock came from? e.g a timestamp as part of the filename?
BTW: thanks for the earlier responses, I have a working solution now..
Aad
--
Aad Nales
[EMAIL PROTECTED], +31-(0)6 54 207 340
--- Aad Nales [EMAIL PROTECTED] wrote:
Hi,
From earlier postings I understand that when lucene opens an
IndexWriter or executes a number of specific methods on an IndexReader
that a lock
is created in .../tomcat/temp. My question is. Are these files ever
deleted after something has gone wrong
when I open a writer?
2. can I have one or multiple readers open when I open a writer?
And if not. I am writing an application that does regular updates on the
index what kind of strategy could you advise? Should I use
ResourcePooling at all?
TIA,
Aad Nales
--
Aad Nales
[EMAIL PROTECTED], +31
the next query is fired?
2. the same but now during a search?
Thanks for any insights,
Aad
--
Aad Nales
[EMAIL PROTECTED], +31-(0)6 54 207 340
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL
Also,
You can also use an alternative spellchecker for the 'checking part' and
use the Ngram algorithm for the 'suggestion' part. Only if the spell
'check' declares a word illegal the 'suggestion' part would perform its
magic.
cheers,
Aad
Doug Cutting wrote:
David Spencer wrote:
[1] The
David,
Perhaps I misunderstand somehting so please correct me if I do. I used
http://www.searchmorph.com/kat/spell.jsp to look for conts without
changing any of the default values. What I got as results did not
include 'const' which has quite a high frequency in your index and
should have a
likely ;-)
2. something in the code..
-Original Message-
From: Andrzej Bialecki [mailto:[EMAIL PROTECTED]
Sent: Wednesday, 15 September, 2004 12:23
To: Lucene Users List
Subject: Re: NGramSpeller contribution -- Re: combining open office
spellchecker with Lucene
Aad Nales wrote:
David
based on the spellchecker of OpenOffice. My question is: has anybody
tried this before?
Cheers,
Aad
--
Aad Nales
[EMAIL PROTECTED], +31-(0)6 54 207 340
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail
23 matches
Mail list logo