On 10/01/2013 09:03 PM, Giorgio Valoti wrote:
Il giorno 01/ott/2013, alle ore 20:48, Jörn Kottmann <[email protected]> ha
scritto:
On 10/01/2013 07:36 PM, Giorgio Valoti wrote:
I'd be interested in creating an annotated corpus for Italian or at least to
begin the process. The first problem for me is finding an annotation guide.
Does anyone have some links?
Re. brat. I have a little familiarity with GATE. How do they compare? Are they
even comparable?
The nice thing about wikinews is that it is available in many languages and we
could probably develop some cross language
principles.
What do you mean by "principles"? Are you talking about the guidelines?
Yes.
I never really used GATE, not sure how that could work out. Brat is web based
and could easily be used by
multiple users. I guess that would fit our needs quite well.
How is your experience with GATE?
(Please keep in mind that I'm a GATE expert.)
It's a very mature tool, with a bunch of features and a robust architecture.
For example, I swapped out the default POS tagger with OpenNLP's in zero time.
Nothing fancy, but it just worked.
We need a tool where a user can manually annotate/correct the data. How
is you experience doing that with GATE?
To create a corpus with a group of contributors it must be easy to get
started and then contribute changes.
Jörn