Yes. That is wha I am trying. But for some reason it is not working.. Does these fields should be lower case only. ?
Andrzej Bialecki <[EMAIL PROTECTED]> wrote: Stefan Neufeind wrote: > sudhendra seshachala wrote: > >> I am experiencing a similar problem. >> What I have done is as follows. >> I have different parse-plugin for each site ( I have 3 sites to crawl and >> fetch data). But I capture data into same format I call it datarepository. >> I have one index-plugin which indexes on data repository and one >> query-plugin on the data repository, >> I dont have to run multiple instances. I just run one instance of search >> engine. >> However the parse configuration is different for each site so I run >> different crawler for each site >> Then I index and merge all of them. So far the results are good if not "WOW". >> I still have to figure a way of ranking the page. For example I would like >> to be able to apply ranking on the data repository. Let me know If I was >> clear... >> > > Hi, > > not sure if I got you right with your last point, but it just came to my > mind: > It would be nice to be able to have something like > "If it's from indexA, give it 100 extra-points - if from indexB give it > 50 extra-points". Or some "if indexA give it 20% extra-weight" or so. > But I don't believe this is easily doable. Or is it? > > I got a similar problem with languages: give priority to documents in > German and English. But somewhere after those results also list > documents in other languages. So I'd need to be able to give > "extra-points" on a "per-language"-basis, based on the indexed > language-field, right? > This is not only doable, but fairly easy - just add these fields to the index through a custom IndexingFilter plugin, and then implement a corresponding QueryPlugin that will expand your query appropriately - this "prioritization" that you describe is equivalent to adding a non-required and non-prohibited clause to a Lucene query. Please see how it's done in the existing index-more/query-more and index-basic/query-basic plugins. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com Sudhi Seshachala http://sudhilogs.blogspot.com/ __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com
