Hi

Is it reasonable to guess language info. from target servers geographical info.?

/Jack

On 1/23/06, Jérôme Charron <[EMAIL PROTECTED]> wrote:
> > Any plan to implement this ? I mean move LanguageIdentifier class
> > intto nutch core.
>
> As I already suggested it on this list, I really would like to move the
> LanguageIdentifier class (and profiles) to
> an independant Lucene sub-project (and the MimeType repository too).
> I don't remember why but there were some objections about this...
>
> Here is a short status of what I have in mind for next improvements with the
> LanguageIdentifier / MultiLanguage support :
> * Enhance LanguageIdentifier APIs by returning something like an ordered
> LangDetail[] array when guessing language (each LangDetail should contains
> the language code and its score) - I have a prototype version of this on my
> disk but I doesn't take time to finalize it
> * I encountered some identification problems with some specific sites (with
> blogger for instance), and I plan to investigate on this point.
> * Another pending task : the analysis (and coding) of multilingual querying
> support.
>
> Regards
>
> Jérôme
>
> --
> http://motrech.free.fr/
> http://www.frutch.org/
>
>


--
Keep Discovering ... ...
http://www.jroller.com/page/jmars


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to