Text mining in Python

2010-03-10 Thread mk
Hello everyone, I need to do the following: (0. transform words in a document into word roots) 1. analyze a set of documents to see which words are highly frequent 2. detect clusters of those highly frequent words 3. map the clusters to some special keywords 4. rank the documents on

Re: Text mining in Python

2010-03-10 Thread Robert Kern
On 2010-03-10 12:58 PM, mk wrote: Hello everyone, I need to do the following: (0. transform words in a document into word roots) 1. analyze a set of documents to see which words are highly frequent 2. detect clusters of those highly frequent words 3. map the clusters to some special

Re: Text mining in Python

2010-03-10 Thread Jason Tackaberry
On Wed, 2010-03-10 at 19:58 +0100, mk wrote: I need to do the following: [...] Is there some good open source engine out there that would be suitable to the task at hand? Anybody has experience with them? It sounds like a full text search engine might do a bit more than you need, but based on