2011/6/22 Jörn Kottmann <[email protected]>: > On 6/22/11 10:27 AM, Olivier Grisel wrote: >> >> Currently there is only a basic command line interface. I plan to work >> on a SWING version too and Hannes started to work on a HTML / >> Javascript frontend. > > Did you have a look at the Cas Editor? It might have already many of > the features you need.
I had some time ago but AFAIK it does not focus on line by line, keyboard based "binary" (true / false) validation and fast keyboard based scanning of a large corpus. My use case if to bootstrap a OpenNLP model from the output an rough incomplete extraction of Wikipedia with pignlproc as explained in [1], use such (roughly) trained NameFinder model on new wikipedia text and then use the refiner to validate the output of the NameFiner model so as to be able to iterate later (potentially with a more active learning strategy as in Dualist). I will have another look at Cas Editor before embarking in the Swing version though. [1] http://blogs.nuxeo.com/dev/2011/01/mining-wikipedia-with-hadoop-and-pig-for-natural-language-processing.html -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
