Adding Fields to Document (with same name)

2005-02-01 Thread TheRanger
Hi,

what happens when I add two fields with the same name to one Document?

Document doc = new Document();
doc.add(Field.Text("bla", "this is my first text"));
doc.add(Field.Text("bla", "this is my second text"));

Will the second text overwrite the first, because only one field can be held
with the same name in one document?

Will the first and the second text be merged, when I search in the field bla
(e.g. with query "bla:text") ?

I am working on XML indexing and did not get an error when having repeated
XML fields. Now I am wondering...

Karl

-- 
Sparen beginnt mit GMX DSL: http://www.gmx.net/de/go/dsl

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Exchange Scoring algorithm

2004-02-22 Thread TheRanger
Hello group,

how can I exchange the existing scoring (the way the score is calculated
using TF/IDF, term boost, etc) with my own score. I know there is a way to
overwrite the Similariy class. However I am missing a good example. Could somebody
provide a good example which would show how to exchange the existing
DefaultSimilarity with MySimilarity. I think this would be necessary to implement
common Relevance Feedback algorthms. Did somebody do that and can give me some
hints?

Kind Regards,
Karl

-- 
GMX ProMail (250 MB Mailbox, 50 FreeSMS, Virenschutz, 2,99 EUR/Monat...)
jetzt 3 Monate GRATIS + 3x DER SPIEGEL +++ http://www.gmx.net/derspiegel +++


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Define own Query / Document Scoring - How?

2004-01-22 Thread TheRanger
Hi,

we all know the formular of how Lucene is weighting documents. Does Lucene
also weigthen the query?

How can this formular be exchanged with another formular? Is it possible to
do Standard_Rocchio by calculating a new weight for each term in a new query
becased on the documents retrieved (based on their terms weights)?

Can somebody give me a hint of how this could work? Maybe a short code
sniplet?

Cheers,
Karl

-- 
+++ GMX - die erste Adresse für Mail, Message, More +++
Bis 31.1.: TopMail + Digicam für nur 29 EUR http://www.gmx.net/topmail


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Indexing of deep structured XML

2004-01-16 Thread TheRanger
Hello all,

it is obviously possible to index the follwoing XML structure in Lucene:


  
  
  
  


by mapping all the xml tags (name, street, postcode and city) it to the
documents (address) fields directly. However is it also possible to map these?


  
  
  

  


Here we have a hierarchy in area (niceplace) which I want to preserve.
Suppose that the meaning of niceplace in an area is different from the niceplace
in the first xml structure (closer specified). I want to preserve this. 

Is there a way to index with Lucene means? If not, are there any attempt of
people doing this or does somebody have ideas how this could be solved?

Cheers,
Karl

-- 
+++ GMX - die erste Adresse für Mail, Message, More +++
Neu: Preissenkung für MMS und FreeMMS! http://www.gmx.net



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]