simultanous search and indexing

2004-10-17 Thread Miro Max
hi,

i'm using servlet to search my index and i wish to be
able to create an index at the same time.

do i have to use threads - i'm beginner

thx






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: StopWord elimination pls. HELP

2004-10-18 Thread Miro Max
thans for your help

 --- Morus Walter [EMAIL PROTECTED] schrieb: 
 Miro Max writes:
 
  String cont = rs.getString(x);
  d.add(Field.Text(cont, cont));
  writer.addDocument(d);
  
  to get results from a database into lucene index.
 but
  when i check println(d) i can see the german
 stopwords
  too. how can i eliminate this?
  
 Stopwords in an analyzer don't make the stopwords
 disappear from the document,
 they only prevent them from beeing indexed.
 So you will allways see stopwords in the document
 (before indexing and,
 if the field is stored, when the document is
 retrieved from the index).
 
 A meaningful check, if stopwords are recognized,
 would be to search for
 a stopword. You shouldn't find anything...
 
 HTH
   Morus
 

-
 To unsubscribe, e-mail:
 [EMAIL PROTECTED]
 For additional commands, e-mail:
 [EMAIL PROTECTED]
 
  






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: StopWord elimination pls. HELP

2004-10-18 Thread Miro Max
thans for your help

 --- Morus Walter [EMAIL PROTECTED] schrieb: 
 Miro Max writes:
 
  String cont = rs.getString(x);
  d.add(Field.Text(cont, cont));
  writer.addDocument(d);
  
  to get results from a database into lucene index.
 but
  when i check println(d) i can see the german
 stopwords
  too. how can i eliminate this?
  
 Stopwords in an analyzer don't make the stopwords
 disappear from the document,
 they only prevent them from beeing indexed.
 So you will allways see stopwords in the document
 (before indexing and,
 if the field is stored, when the document is
 retrieved from the index).
 
 A meaningful check, if stopwords are recognized,
 would be to search for
 a stopword. You shouldn't find anything...
 
 HTH
   Morus
 

-
 To unsubscribe, e-mail:
 [EMAIL PROTECTED]
 For additional commands, e-mail:
 [EMAIL PROTECTED]
 
  






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



how to find coherent terms

2004-10-20 Thread Miro Max
Hello,

i've to realize one function in my project and i hope
i can find someone who can help me.

the idee is about search of coherent terms!

my imagination:

1. search for a specific term_a
2. result: hits from lucene
   resultlist:
   term_a term_b term_c term_d
   term_b term_a term_e
   term_e term_a term_b term_f
3. now i can see that the term_a is in a speciall
relation to term_b - but how can i check this with
lucene? is this supported by any function of lucene or
does exist any other api?

thx

miro






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Highlighter problem: null as result

2004-10-27 Thread Miro Max
Hello,

i'm trying to use highlighter from sandbox and
actually i've got a problem with some results getting
from highlighter.

normaly when i search in my index for ex. motor i
get 
circa 150 results -- this results are ok.
but when i use highlighter i get some results as
null values from the field content.

is this a bug in the highlighter class?

greetings

jose






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



searchbean example + servlet

2004-11-03 Thread Miro Max
Hi all,

where can i find the servlet for the searchbean
classes?
thé directory on cvs is now empty

ciao

miro






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Faster highlighting with TermPositionVectors PROBLEM

2004-11-12 Thread Miro Max
Hi all,

i'm trying to implement this Method and i need some
help.

i've downloaded the new lucene-cvs version, compiled
it with ant and i get the following error message:

The method getOffsets(int) is undefined for the type
Object.

Can anyone tell me why?

thx

miro






___
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



how to get most frequented terms from hits

2004-12-28 Thread Miro Max
Hello,

is ist possible to get most frequented terms from
hits?

thx

miro




___ 
Gesendet von Yahoo! Mail - Jetzt mit 250MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



list of most frequent words from Hits

2005-01-20 Thread Miro Max
Hello,

i need to create a list of most frequent words from
the result set.
actually i'm creating new RAMDirectory to add the
Hits. From this i get the list of the most frequent
words.

is this ok? what happens when i create 10 querys at
the same time? does new query overwrite the old
RAMDirectory? is this solution consistent or does
anyone got other way to realize this?

thanks

miro






___ 
Gesendet von Yahoo! Mail - Jetzt mit 250MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Numbers in Index

2005-02-14 Thread Miro Max
hi,

actually i'm using standard analyzer during my index
process. but when i browse the index with luke there
also numbers inside.

which analyzer should i use to eliminate this from my
index or should i specify this in my stopword list?

thx

miro






___ 
Gesendet von Yahoo! Mail - Jetzt mit 250MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



select where from query type in lucene

2005-02-17 Thread Miro Max
Hi,

i've problem with my my classes using lucene.
my index looks like:

type   |   content
-
document   |  x
document   |  x
view   |  x
view   |  x
dbentry|  x
dbentry|  x

my question now:

how can i search for content where type=document or
(type=document OR type=view).
actually i can do it with: (type:document OR
type:entry) AND queryText as QueryString.
but does exist any other better way to realize this?

thx

miro




___ 
Gesendet von Yahoo! Mail - Jetzt mit 250MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



how to get stored fields

2005-02-17 Thread Miro Max
Hello again,

i'm indexing my content as unstored fiels. now i want
to get this fields matching to the query and copy it
to a new index.

do i have to reconstruct this content or can i copy
this content as field to a new index --

Field f = hits.doc(i).getField(content);
d.add(f);

miro







___ 
Gesendet von Yahoo! Mail - Jetzt mit 250MB Speicher kostenlos - Hier anmelden: 
http://mail.yahoo.de

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]