Per Tunedal <[email protected]> writes: > Hi, > I've successfully extracted a Swedish word list from > apertium.sv-da.sv.dix as follows: > > lt-expand apertium-sv-da.sv.dix | cut -f1 -d':' > > apertium-sv-da.sv.dix.expanded > > I would like to get English and French word lists as well. How do I > proceed with the pairs fr-es and en-es or en-ca: > > there aren't any similar files for English or French in those pairs. > Only for Spanish.
The dix file is compiled from a .metadix file. First, compile the pair, then look for a .dix file, possibly in .deps/, like .deps/en.dix or something. > BTW Would it be better to extract words from > http://wiki.apertium.org/wiki/Languages , rather than from the pairs? Probably not for those languages … though if you're only after forms anyway, you could just grab all the words from all the directories and then do cat apertium-sv-da.sv.dix.expanded apertium-swe.swe.dix.expanded > \ sort -u > combined-apertium-swe.swe.dix.expanded -Kevin
signature.asc
Description: PGP signature
------------------------------------------------------------------------------ Dive into the World of Parallel Programming. The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
