Any easy link to the bug report of this utf8 lucene issue?

On 3/31/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:
>
> On Mar 30, 2006, at 4:10 PM, mike c wrote:
> > Hi Erik,
> > Thanks for pointing this out - as I just got Ferret working with
> > indexes created using Nutch.  Any recommendations on how to address
> > this issue?
>
> This is a particularly insidious issue.  Java Lucene is not using
> pure UTF-8, whereas ports like Ferret are.  But changing Java Lucene
> is a big deal and does introduce a (slight) performance hit
> apparently.  The plan is for Java Lucene to be corrected in this
> regard at some point in the future, perhaps as soon as Lucene 2.0.
>
> But for now, I don't know of a way to address this issue.  I gave up
> on Ferret for the time being because of this incompatibility and am
> now prototyping with Solr while still using my custom XML-RPC search
> server for now.
>
>         Erik
>
>
>
> >
> > -Mike
> >
> > On 3/30/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:
> >> There is one incompatibility between Ferret and Java Lucene of note.
> >> It is the "UTF-8" issue that has surfaced with regards to Java
> >> Lucene.  All can be well between Java Lucene and Ferret, until
> >> characters in another range are indexed, and then Ferret will blow up
> >> trying to search the index.  Maybe this has been worked around in a
> >> more recent version of Ferret than I've tried?
> >>
> >>         Erik
> >>
> >>
> >> On Mar 30, 2006, at 2:50 PM, mike c wrote:
> >>
> >>> Thanks.  I'll try it out.  In the mean time, if I get Ferret working
> >>> I'll post an update.
> >>>
> >>> -Mike
> >>>
> >>> On 3/30/06, Steven Yelton <[EMAIL PROTECTED]> wrote:
> >>>> I use WEBrick instead of tomcat to query and serve search
> >>>> results.  I
> >>>> used ruby's 'rjb' to bridge the gap.
> >>>>
> >>>> http://raa.ruby-lang.org/project/rjb/
> >>>>
> >>>> There may be more direct ways (ruby<->lucene), but this was
> >>>> quick and
> >>>> easy and still has decent performance.
> >>>>
> >>>> Steven
> >>>>
> >>>> mike c wrote:
> >>>>
> >>>>> Hi all,
> >>>>> I was wondering if anyone is using Nutch (for crawling) with
> >>>>> Ferret
> >>>>> (indexing / searching).  Basically, my front-end is built using
> >>>>> Ruby
> >>>>> on Rails that's why I'm asking.  I have the Nutch crawler up and
> >>>>> running fine, but can't seem to figure out how to integrate the
> >>>>> two.
> >>>>> Any help is appreciated.
> >>>>>
> >>>>> Regards,
> >>>>> Mike
> >>>>>
> >>>>>
> >>>>
> >>>
> >>>
> >>> -------------------------------------------------------
> >>> This SF.Net email is sponsored by xPML, a groundbreaking scripting
> >>> language
> >>> that extends applications into web and mobile media. Attend the
> >>> live webcast
> >>> and join the prime developer group breaking into this new coding
> >>> territory!
> >>> http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
> >>> _______________________________________________
> >>> Nutch-general mailing list
> >>> Nutch-general@lists.sourceforge.net
> >>> https://lists.sourceforge.net/lists/listinfo/nutch-general
> >>
> >>
> >
> >
> > -------------------------------------------------------
> > This SF.Net email is sponsored by xPML, a groundbreaking scripting
> > language
> > that extends applications into web and mobile media. Attend the
> > live webcast
> > and join the prime developer group breaking into this new coding
> > territory!
> > http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
> > _______________________________________________
> > Nutch-general mailing list
> > Nutch-general@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nutch-general
>
>


--
"Minds are like parachutes, they work best when open."

Bruno Patini Furtado
Software Developer
webpage: http://bpfurtado.net
software development blog: http://bpfurtado.livejournal.com

Reply via email to