2012/4/2 Orosz György <[email protected]>:
> We are one step closer now, but just wondering if there is any easy way to
> create a .dix file from Apertium stream format. (or any easy way to use an
> analysed text file for the tagger, instead of enumerating lemmata and
> paradigms.)

That usually involves much messing around with sed and the like, so
here's a little program for that: https://gist.github.com/2283105

g++ apertium-cleanstream.cc -o apertium-cleanstream

cat $YOURTEXT | ./apertium-cleanstream -n|sort|uniq

should give you what you need.

-- 
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you

------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to