Re: [Haskell-cafe] NLP libraries and tools?

2011-07-10 Thread Ketil Malde
Perhaps this is interesting? On the relationship between exploratory (a.k.a. sloppy or theoretical) and rigorous math. http://arxiv.org/pdf/math/9307227v1 -k -- If I haven't seen further, it is by standing in the footprints of giants ___

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-10 Thread ivan vadovic
Hi, Also a library for string normalization in the sense of stripping diacritical marks would be handy too. Does anything in this respect exist that would be usable from haskell? Thanks On Fri, Jul 01, 2011 at 02:31:34PM +0400, Dmitri O.Kondratiev wrote: Hi, Please advise on NLP libraries

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-10 Thread Jason Dagit
On Sun, Jul 10, 2011 at 12:59 PM, ivan vadovic p...@pobox.sk wrote: Hi, Also a library for string normalization in the sense of stripping diacritical marks would be handy too. Does anything in this respect exist that would be usable from haskell? The closest thing I know of is this:

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-09 Thread Jack Henahan
Oof, you're liable to wound my (pure) mathematician's pride with remarks like that, wren. :P Now go intone the Litany of Categories as penance. :D I'll start you off… Set, Rel, Top, Ring, Grp, Cat, Hask… On Jul 7, 2011, at 10:53 PM, wren ng thornton wrote: I can't help but be a

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-09 Thread wren ng thornton
(Psst, the nlp list is n...@projects.haskell.org :) On 7/9/11 3:10 AM, Jack Henahan wrote: On Jul 7, 2011, at 10:53 PM, wren ng thornton wrote: I can't help but be a (meta)theorist. But then, I'm of the firm opinion that theory must be grounded in actual practice, else it belongs more to the

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-09 Thread Jack Henahan
Heh, I just hit Reply All and I guess the address came in wrong. Ah, well. I strongly agree with you on the state of linguistics, et al. Having done little bits of work in a few of those fields (or at least work _with_ people in them), your comments are spot on. Though perhaps I wouldn't say

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-07 Thread Aleksandar Dimitrov
It's actually a shame we're discussing this on -cafe and not on -nlp. Then again, maybe it's going to prompt somebody to join -nlp, and I'm gonna CC it there, because some folks over there might be interested, but not read -cafe. On Wed, Jul 06, 2011 at 07:22:41PM -0700, wren ng thornton wrote:

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-07 Thread wren ng thornton
On 7/7/11 3:38 AM, Aleksandar Dimitrov wrote: On Wed, Jul 06, 2011 at 07:27:10PM -0700, wren ng thornton wrote: I definitely agree with the iteratees comment, but I'm curious about the leaks you mention. I haven't run into leakiness issues (that I'm aware of) in my use of ByteStrings for NLP.

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-07 Thread wren ng thornton
On 7/7/11 3:50 AM, Aleksandar Dimitrov wrote: It's actually a shame we're discussing this on -cafe and not on -nlp. Then again, maybe it's going to prompt somebody to join -nlp, and I'm gonna CC it there, because some folks over there might be interested, but not read -cafe. Quite :) When

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Dmitri O.Kondratiev
Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell libraries exist (googling them does not help): 1) End of Sentence (EOS) Detection. Break text into a collection of meaningful sentences. 2) Part-of-Speech (POS) Tagging. Assign part-of-speech information to

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell libraries exist (googling them does not help): 1) End of Sentence (EOS) Detection. Break text into a collection of meaningful sentences. Depending on how

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Dmitri O.Kondratiev
On Wed, Jul 6, 2011 at 8:32 PM, wren ng thornton w...@freegeek.org wrote: On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell libraries exist (googling them does not help): 1) End of Sentence (EOS)

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Aleksandar Dimitrov
On Wed, Jul 06, 2011 at 09:32:27AM -0700, wren ng thornton wrote: On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell libraries exist (googling them does not help): 1) End of Sentence (EOS) Detection.

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Aleksandar Dimitrov
On Wed, Jul 06, 2011 at 11:04:30PM +0400, Dmitri O.Kondratiev wrote: On Wed, Jul 6, 2011 at 8:32 PM, wren ng thornton w...@freegeek.org wrote: On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Rogan Creswick
On Wed, Jul 6, 2011 at 3:03 PM, Aleksandar Dimitrov aleks.dimit...@googlemail.com wrote: So you'd use, say, UIMA+OpenNLP to do sentence boundaries, tokens, tags, named-entities whatnot, then spit out some annotated format, read it in with Haskell, and do the logic/magic there. Have you used

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Aleksandar Dimitrov
On Wed, Jul 06, 2011 at 03:14:07PM -0700, Rogan Creswick wrote: Have you used that particular combination yet? I'd like to know the details of how you hooked everything together if that's something you can share. (We're working on a similar Frankenstein at the moment.) These Frankensteins, as

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread Richard O'Keefe
On 7/07/2011, at 7:04 AM, Dmitri O.Kondratiev wrote: I am looking for Haskell implementation of sentence tokenizer such as described by Tibor Kiss and Jan Strunk’s in “Unsupervised Multilingual Sentence Boundary Detection”, which is implemented in NLTK: That method is multilingual but

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 5:58 PM, Aleksandar Dimitrov wrote: On Wed, Jul 06, 2011 at 09:32:27AM -0700, wren ng thornton wrote: On 7/6/11 9:27 AM, Dmitri O.Kondratiev wrote: Hi, Continuing my search of Haskell NLP tools and libs, I wonder if the following Haskell libraries exist (googling them does not

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 6:45 PM, Aleksandar Dimitrov wrote: One hint, if you ever find yourself reading in quantitative linguistic data with Haskell: forget lazy IO. Forget strict IO, except your documents aren't ever bigger than a few hundred megs. In case you're not keeping the whole document in memory,

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-06 Thread wren ng thornton
On 7/6/11 8:46 PM, Richard O'Keefe wrote: I've been working over the last year+ on an optimized HMM-based POS tagger/supertagger with online tagging and anytime n-best tagging. I'm planning to release it this summer (i.e., by the end of August), though there are a few things I'd like to polish

[Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Dmitri O.Kondratiev
Hi, Please advise on NLP libraries similar to Natural Language Toolkit ( www.nltk.org) First of all I need: - tools to construct 'bag of words' ( http://en.wikipedia.org/wiki/Bag_of_words_model), which is a list of words in the article. - tools to prune common words, such as prepositions and

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Rogan Creswick
On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev doko...@gmail.com wrote: Hi, Please advise on NLP libraries similar to Natural Language Toolkit There is a (slowly?) growing NLP community for haskell over at: http://projects.haskell.org/nlp/ The nlp mailing list may be a better place to

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Dmitri O.Kondratiev
On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick cresw...@gmail.com wrote: On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev doko...@gmail.com wrote: First of all I need: ... - tools to construct 'bag of words' (http://en.wikipedia.org/wiki/Bag_of_words_model), which is a list of words

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Rogan Creswick
On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev doko...@gmail.com wrote: On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick cresw...@gmail.com wrote: On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev doko...@gmail.com wrote: First of all I need: Unfortunately 'cabal install' fails with

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Dmitri O.Kondratiev
On Fri, Jul 1, 2011 at 11:58 PM, Rogan Creswick cresw...@gmail.com wrote: On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev doko...@gmail.com wrote: On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick cresw...@gmail.com wrote: On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev

Re: [Haskell-cafe] NLP libraries and tools?

2011-07-01 Thread Rogan Creswick
On Fri, Jul 1, 2011 at 2:52 PM, Dmitri O.Kondratiev doko...@gmail.com wrote: Any other then 'toktok' Haskell word tokenizer that compiles and works? I need something like: http://nltk.googlecode.com/svn/trunk/doc/api/nltk.tokenize.regexp.WordPunctTokenizer-class.html I don't think this exists