Hello,

is the source code of your work public? Would like to have a look.

Jörn

On 01/24/2013 08:52 PM, Lance Norskog wrote:
I wrote a document summarizer based on singular value decomposition, and did detailed testing against the first Reuters corpus. http://ultrawhizbang.blogspot.com/2012/09/document-summarization-with-lsa-1.html

On 01/24/2013 02:59 AM, Renzo wrote:
Hi all,
I'm pretty new to OpenNLP.
My interest is almost related to fetch document summaries using algorithms such as TextRank. This task requires sentence and token splitting - here's where OpenNLP enters the game. I also need some degree of POS to detect nouns, verbs and so on, in order to add some linguistic support to the ranking process.

It was fairly surprising to discover that noun tags - for example - are language dependent. Thus an "isNoun" predicate needs a specific answer for each language. It's "NN" for English, but it may be different for others.

I just wonder if there is a common (e.g. language-independent) way to answer such a kind of questions.

Furthermore, is the logical format of available binary files documented anywhere ? Is there any way to browse those files to inspect the used tag list ?
Thanks,

Renzo



Reply via email to