Josh, I wrongly replied for your question.

On Sun, Dec 11, 2011 at 12:21 PM, Johnson J <johnson.devj...@gmail.com>wrote:

> Thanks for the information Josh, I want a model to identify the topic for
> the given website(this is actually for student to identify subject), for
> this I am using document categorizer with my own corpus with nearly 2 GB,
> (for eg: science <space> describing about science) .
>
> Thanks,
> Johnson.
>
> On Sun, Dec 11, 2011 at 9:37 AM, Josh Patterson <j...@cloudera.com> wrote:
>
>> working with the examples and reading:
>>
>>
>> http://sourceforge.net/apps/mediawiki/opennlp/index.php?title=Sentence_Detector
>>
>> I've noticed the section on "Training"; Given that the models already
>> detect things like sentences and POS, in what circumstances would one
>> want to "train" the model further?
>>
>> Josh
>>
>> --
>> Twitter: @jpatanooga
>> Solution Architect @ Cloudera
>> hadoop: http://www.cloudera.com
>>
>
>

Reply via email to