you could use the annotated version of europarl from OPUS: http://www.let.rug.nl/~tiedeman/OPUS/ there is also a tool to convert to moses format: http://www.let.rug.nl/~tiedeman/OPUS/tools.php (opus2moses.pl - you will need Perl & XML::Parser)
it's all machine-annotated data - so no corrections done. but for English, for example, you can choose between the results of 3 different taggers and select the most reliable one (or combine their information). however, not all languages are tagged. good luck! jorg On Sun, 18 Jan 2009 15:30:32 +0100 "Sanne Korzec" <[email protected]> wrote: > Hi mailing, > > > > I was wondering if someone knows if there are labeled part of speech >tags > available for the europarl corpus. And if so, where to find them. > > > > Regards, > > Sanne > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
