2012/4/2 Orosz György <[email protected]>: > We are one step closer now, but just wondering if there is any easy way to > create a .dix file from Apertium stream format. (or any easy way to use an > analysed text file for the tagger, instead of enumerating lemmata and > paradigms.)
That usually involves much messing around with sed and the like, so here's a little program for that: https://gist.github.com/2283105 g++ apertium-cleanstream.cc -o apertium-cleanstream cat $YOURTEXT | ./apertium-cleanstream -n|sort|uniq should give you what you need. -- <Sefam> Are any of the mentors around? <jimregan> yes, they're the ones trolling you ------------------------------------------------------------------------------ This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
