[ https://issues.apache.org/jira/browse/LUCENE-180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Grant Ingersoll resolved LUCENE-180. ------------------------------------ Resolution: Duplicate Assignee: (was: Lucene Developers) See LUCENE-826. > [PATCH] Language guesser contribution > ------------------------------------- > > Key: LUCENE-180 > URL: https://issues.apache.org/jira/browse/LUCENE-180 > Project: Lucene - Java > Issue Type: Improvement > Components: Other > Affects Versions: unspecified > Environment: Operating System: other > Platform: Other > Reporter: Jean-François Halleux > Priority: Minor > Attachments: LanguageGuesser.zip, tlg.zip > > > Hello, > I'd like to contribute this language guesser to Lucene. > It contains language guessing interfaces and classes as well as trigram > specific classes and some language reference files I generated myself using > the > trigram file generation utily in there. I included a unit test as well. > I didn't do any extensive tests on guessing quality and performance but I > would > tend to think that they are both OK for a first pass. > I thought about writing a custom Analyzer for this but realized that this > wouldn't be the way to go and that probably the language decision should be > left to the developper, definitely when the Analyzer is used to tokenize a > query. > Have fun, > Jean-François Halleux -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]