I corrected the rule and it works without errors. But I still have a problem with merging lines.
Am So., 4. Nov. 2018 um 15:36 Uhr schrieb mansur <6688...@gmail.com>: > My previous letter shows that merging doesn't happen because of additional > cg-proc, because I tried to remove that part completely. > By the way, I also tried your recommendation and it gives an error: > > root@apertium:~# apertium -n -d ./apertium-tat tat-tagger-devcg file.txt > VISL CG-3 Disambiguator version 0.9.9.11656 > cg-proc: process a stream with a constraint grammar > USAGE: cg-proc -t -s -d -r rule grammar_file input_file output_file > Options: > -d: morphological disambiguation (default behaviour) > -s: specify number of sections to process > -f: set the format of the I/O stream to NUM, > where `0' is VISL format, `1' is > Apertium format and `2' is Matxin (default: 1) > -r: run only the named rule > -t: print debug output on stderr > -w: enforce surface case on lemma/baseform > (to work with -w option of lt-proc) > -n: do not print out the word form of each cohort > -1: only output the first analysis if ambiguity remains > -z: flush output on the null character > -v: version > > > ---------- Forwarded message --------- > From: mansur <6688...@gmail.com> > Date: So., 4. Nov. 2018 um 15:14 Uhr > Subject: Re: [Apertium-stuff] Stop merging lines > To: <apertium-stuff@lists.sourceforge.net> > > > Hi, Kevin! > > Yes we have the same result with > cat file.txt | sed -r 's/$/\n\n\n\n\n\n\n\n\n\n/' | apertium -n -d > ./apertium-tat tat-tagger > file.txt > > Am So., 4. Nov. 2018 um 15:00 Uhr schrieb Kevin Brubeck Unhammer < > unham...@fsfe.org>: > >> mansur <6688...@gmail.com> čálii: >> >> > Hello! >> > >> > I have a very big file (some millions of lines) with one sentence per >> line. >> > >> > When I run Apertium's tagger sometimes it merges those lines. I tried to >> > insert empty lines between real lines and it merged fewer lines. I >> inserted >> > 10 empty lines and it merges even fewer lines, but there are some >> merging >> > what is not acceptable for me. What can I do to stop merging lines? >> > >> > cat file.txt | sed -r 's/$/\n\n\n\n\n\n\n\n\n\n/' | apertium -n -d >> > ./apertium-tat tat-tagger | cg-proc ./apertium-tat/dev/mansur.bin > >> file.txt >> >> Does tat-tagger without cg-proc do it too? >> _______________________________________________ >> Apertium-stuff mailing list >> Apertium-stuff@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff >> >
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff