Source: dasher Severity: important Some language files, notably German, have spaces at the end of each and every paragraph in the training text. Please remove them; they severely skew the character distributions and make entering a linefeed after any character that's not a space virtually impossible.
The German text also contains embedded links: "Die haarsträubende [2]Verschwörungsbastelanweisung" That also doesn't make much sense in this context. -- System Information: Debian Release: wheezy/sid APT prefers testing APT policy: (700, 'testing'), (650, 'unstable'), (600, 'stable'), (550, 'experimental') Architecture: amd64 (x86_64) Foreign Architectures: i386 Kernel: Linux 3.2.0-3-amd64 (SMP w/1 CPU core) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org