2011/6/22 Jörn Kottmann <[email protected]>:
> On 6/22/11 10:27 AM, Olivier Grisel wrote:
>>
>> Currently there is only a basic command line interface. I plan to work
>> on a SWING version too and Hannes started to work on a HTML /
>> Javascript frontend.
>
> Did you have a look at the Cas Editor? It might have already many of
> the features you need.

I had some time ago but AFAIK it does not focus on line by line,
keyboard based "binary" (true / false) validation and fast keyboard
based scanning of a large corpus.

My use case if to bootstrap a OpenNLP model from the output an rough
incomplete extraction of Wikipedia with pignlproc as explained in [1],
use such (roughly) trained NameFinder model on new wikipedia text and
then use the refiner to validate the output of the NameFiner model so
as to be able to iterate later (potentially with a more active
learning strategy as in Dualist).

I will have another look at Cas Editor before embarking in the Swing
version though.

[1]
http://blogs.nuxeo.com/dev/2011/01/mining-wikipedia-with-hadoop-and-pig-for-natural-language-processing.html


-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

Reply via email to