Re: cld 0.1.0 - Clojure Language Detection

2012-03-01 Thread Ulises
Apologies for hijacking this thread, but this is probably the most relevant place to point you to a simple web-frontend to cld I cooked up in a couple of hours: http://detector-de-idioma.herokuapp.com/index.html Additionally, you can call it as a service like $ url 'http://detector-de-idioma.he

Re: cld 0.1.0 - Clojure Language Detection

2012-02-29 Thread Ulises
> I'm more curious about why the output isn't even deterministic. The > same input string produced three different results. You can see in the FAQ (http://code.google.com/p/language-detection/wiki/FrequentlyAskedQuestion) that: "Langdetect uses random sampling for avoiding local noises(person nam

Re: cld 0.1.0 - Clojure Language Detection

2012-02-29 Thread Cedric Greevey
I'm more curious about why the output isn't even deterministic. The same input string produced three different results. -- You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to clojure@googlegroups.com Note that posts from

Re: cld 0.1.0 - Clojure Language Detection

2012-02-29 Thread Michael Wood
On 29 February 2012 17:48, Lee Hinman wrote: > On Tuesday, February 28, 2012 6:03:26 PM UTC-7, Robin Kraft wrote: >> >> Awesome! I'm seeing some inconsistency though. Does anyone know why a >> Bayesian classifier would produce such different results? Could it be >> because of the short input text?

Re: cld 0.1.0 - Clojure Language Detection

2012-02-29 Thread Lee Hinman
On Tuesday, February 28, 2012 6:03:26 PM UTC-7, Robin Kraft wrote: > > Awesome! I'm seeing some inconsistency though. Does anyone know why a > Bayesian classifier would produce such different results? Could it be > because of the short input text? > > (lang/detect "My name is joe") > ["af" {"af

Re: cld 0.1.0 - Clojure Language Detection

2012-02-29 Thread Robin Kraft
Awesome! I'm seeing some inconsistency though. Does anyone know why a Bayesian classifier would produce such different results? Could it be because of the short input text? (lang/detect "My name is joe") ["af" {"af" "0.8571390166207665", "lt" "0.14285675907555712"}] (lang/detect "My name is joe")

Re: [ANN] cld 0.1.0 - Clojure Language Detection

2012-02-28 Thread Raju Bitter
Tested with Korean and German, and works great! user> (cld.core/detect "한국 음식중에 김치가 제일 맛있어요.") ["ko" {"ko" "0.9998"}] cld.core=> (cld.core/detect "In München steht ein Hofbräuhaus.") ["de" {"de" "0.972552285171"}] -- You received this message because you are subscribed to the Go

Re: [ANN] cld 0.1.0 - Clojure Language Detection

2012-02-27 Thread Alex Ott
similar functionality is also available in clj-tika (https://github.com/alexott/clj-tika, and clojars) - you can detect language, mime-type of data & extract text On Tue, Feb 28, 2012 at 3:24 AM, Lee Hinman wrote: > Hi all, > I'm pleased to announce the initial 0.1.0 release of cld (Clojure > Lan

Re: [ANN] cld 0.1.0 - Clojure Language Detection

2012-02-27 Thread Devin Walters
Cool. Time to get my cores to work extracting from pastebins. :) '(Devin Walters) On Feb 27, 2012, at 8:24 PM, Lee Hinman wrote: > Hi all, > I'm pleased to announce the initial 0.1.0 release of cld (Clojure > Language Detection). CLD a tiny library wrapping language-detect[1] > that can be used

[ANN] cld 0.1.0 - Clojure Language Detection

2012-02-27 Thread Lee Hinman
Hi all, I'm pleased to announce the initial 0.1.0 release of cld (Clojure Language Detection). CLD a tiny library wrapping language-detect[1] that can be used to determine the language of a particular piece of text very quickly. You should be able to use it from Clojars[2] with the following: [cld