Can i use the highlighter even if i don't store text in clucene index?

Le samedi 07 août 2010 à 09:50 +0200, Ben van Klinken a écrit :
> Check out the highlighter code in the contribs folder. Should do what u need.
> 
> Ben
> 
> On Saturday, August 7, 2010,  <e...@telexis.com> wrote:
> > Sorry if duplicate post, I wasn't subscribed when I did the 1st one, so not 
> > sure if it was aloud or blocked by spam filters.  This is my first "post", 
> > so I hope I'm doing this correctly.  I've only used Google Groups or 
> > on-line forums in the past, never email based lists.
> >
> > I'm using CLucene to index a custom ISAM legacy database of mine.  The 
> > database holds emails and other messages.  I've got the indexing working, 
> > and also the searching.  However, our current (very slow) search method 
> > provides a "summary" for each search result.  Basically fragments of the 
> > text around the words that were found.
> >
> > I have to read the records from my database for each result found by 
> > CLucene anyway, because I don't want to store all that data in Lucene, so I 
> > was thinking I would just scan my text manually to provide the same 
> > "summary" I'm currently providing.  However, now I realize that isn't as 
> > simple as it was before, because I don't always know what words/phrases (I 
> > think Lucene calls them "terms") were found.  For example, if the user does 
> > a fuzzy or wildcard search.  Without recreating the CLucene logic, how can 
> > I do this?
> >
> > I understand that there are some additional libraries (maybe only for the 
> > Java version?) that do "highlighting", which is similar to what I want to 
> > do.  I haven't found any source code for them yet, and even if I did, I'm 
> > not sure I could figure out how to extract that and recreate for my 
> > purposes (they may assume the text is stored in Lucene, or that I want the 
> > results in HTML format).  My scenario seems to be unique, because I'm not 
> > working with a website, or indexing basic files.
> >
> > If I could find the section of code in CLucene where it "finds" results for 
> > my query, then I would know what words it found for each document.  I'm not 
> > too worried about showing the "most relavent" fragments at this point, 
> > although that would be a nice feature down the road.
> >
> > p.s.-I'm loving CLucene so far, and if I can get past this last hurdle, I 
> > should be set =]
> >
> > -Eric Selk
> >
> >
> > ------------------------------------------------------------------------------
> > This SF.net email is sponsored by
> >
> > Make an app they can't live without
> > Enter the BlackBerry Developer Challenge
> > http://p.sf.net/sfu/RIM-dev2dev
> > _______________________________________________
> > CLucene-developers mailing list
> > CLucene-developers@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/clucene-developers
> >
> 
> ------------------------------------------------------------------------------
> This SF.net email is sponsored by 
> 
> Make an app they can't live without
> Enter the BlackBerry Developer Challenge
> http://p.sf.net/sfu/RIM-dev2dev 
> _______________________________________________
> CLucene-developers mailing list
> CLucene-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/clucene-developers



------------------------------------------------------------------------------
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers

Reply via email to