Yes, indeed. Maybe I should come up with such an Analyzer in a boilerpipe-lucene package...
Christian Am 14.12.2009 um 16:15 schrieb Ted Dunning: > Storing the original would be an excellent idea and would be quite doable. > > 2009/12/14 Christian Kohlschütter <[email protected]> > >> However it would also be great (in order to increase recall) to also store >> non-content and just add some kind of static boosting for content blocks >> over non-content blocks. I am not sure whether this will work right now >> using an Analyzer. What you could do though, is to store the text into >> separate fields ("content"/"boilerplate") and add field-specific boosts at >> query time. >> > > > > -- > Ted Dunning, CTO > DeepDyve -- Christian Kohlschütter [email protected] L3S Research Center Forschungszentrum L3S / Leibniz Universität Hannover http://www.L3S.de/~kohlschuetter
