Adding Fields to Document (with same name)
Hi, what happens when I add two fields with the same name to one Document? Document doc = new Document(); doc.add(Field.Text("bla", "this is my first text")); doc.add(Field.Text("bla", "this is my second text")); Will the second text overwrite the first, because only one field can be held with the same name in one document? Will the first and the second text be merged, when I search in the field bla (e.g. with query "bla:text") ? I am working on XML indexing and did not get an error when having repeated XML fields. Now I am wondering... Karl -- Sparen beginnt mit GMX DSL: http://www.gmx.net/de/go/dsl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Exchange Scoring algorithm
Hello group, how can I exchange the existing scoring (the way the score is calculated using TF/IDF, term boost, etc) with my own score. I know there is a way to overwrite the Similariy class. However I am missing a good example. Could somebody provide a good example which would show how to exchange the existing DefaultSimilarity with MySimilarity. I think this would be necessary to implement common Relevance Feedback algorthms. Did somebody do that and can give me some hints? Kind Regards, Karl -- GMX ProMail (250 MB Mailbox, 50 FreeSMS, Virenschutz, 2,99 EUR/Monat...) jetzt 3 Monate GRATIS + 3x DER SPIEGEL +++ http://www.gmx.net/derspiegel +++ - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Define own Query / Document Scoring - How?
Hi, we all know the formular of how Lucene is weighting documents. Does Lucene also weigthen the query? How can this formular be exchanged with another formular? Is it possible to do Standard_Rocchio by calculating a new weight for each term in a new query becased on the documents retrieved (based on their terms weights)? Can somebody give me a hint of how this could work? Maybe a short code sniplet? Cheers, Karl -- +++ GMX - die erste Adresse für Mail, Message, More +++ Bis 31.1.: TopMail + Digicam für nur 29 EUR http://www.gmx.net/topmail - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Indexing of deep structured XML
Hello all, it is obviously possible to index the follwoing XML structure in Lucene: by mapping all the xml tags (name, street, postcode and city) it to the documents (address) fields directly. However is it also possible to map these? Here we have a hierarchy in area (niceplace) which I want to preserve. Suppose that the meaning of niceplace in an area is different from the niceplace in the first xml structure (closer specified). I want to preserve this. Is there a way to index with Lucene means? If not, are there any attempt of people doing this or does somebody have ideas how this could be solved? Cheers, Karl -- +++ GMX - die erste Adresse für Mail, Message, More +++ Neu: Preissenkung für MMS und FreeMMS! http://www.gmx.net - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]