Revision: 9707
http://languagetool.svn.sourceforge.net/languagetool/?rev=9707&view=rev
Author: dnaber
Date: 2013-03-17 12:29:04 +0000 (Sun, 17 Mar 2013)
Log Message:
-----------
add German tagset documentation again
Modified Paths:
--------------
trunk/languagetool/languagetool-language-modules/de/src/main/java/org/languagetool/tagging/de/GermanTagger.java
Added Paths:
-----------
trunk/languagetool/languagetool-language-modules/de/src/main/resources/org/languagetool/resource/de/tagset.txt
Modified:
trunk/languagetool/languagetool-language-modules/de/src/main/java/org/languagetool/tagging/de/GermanTagger.java
===================================================================
---
trunk/languagetool/languagetool-language-modules/de/src/main/java/org/languagetool/tagging/de/GermanTagger.java
2013-03-17 11:04:04 UTC (rev 9706)
+++
trunk/languagetool/languagetool-language-modules/de/src/main/java/org/languagetool/tagging/de/GermanTagger.java
2013-03-17 12:29:04 UTC (rev 9707)
@@ -39,7 +39,7 @@
/**
* German part-of-speech tagger, requires data file in
<code>resource/de/german.dict</code>.
* The POS tagset is described in
- * <a
href="https://languagetool.svn.sourceforge.net/svnroot/languagetool/trunk/JLanguageTool/src/main/resources/org/languagetool/resource/de/tagset.txt">tagset.txt</a>
+ * <a
href="https://languagetool.svn.sourceforge.net/svnroot/languagetool/trunk/languagetool/languagetool-language-modules/de/src/main/resources/org/languagetool/resource/de/tagset.txt">tagset.txt</a>
*
* @author Marcin Milkowski, Daniel Naber
*/
Copied:
trunk/languagetool/languagetool-language-modules/de/src/main/resources/org/languagetool/resource/de/tagset.txt
(from rev 9047,
trunk/JLanguageTool/src/main/resources/org/languagetool/resource/de/tagset.txt)
===================================================================
---
trunk/languagetool/languagetool-language-modules/de/src/main/resources/org/languagetool/resource/de/tagset.txt
(rev 0)
+++
trunk/languagetool/languagetool-language-modules/de/src/main/resources/org/languagetool/resource/de/tagset.txt
2013-03-17 12:29:04 UTC (rev 9707)
@@ -0,0 +1,32 @@
+
+The tags used by the German tagger are based on Morphy. They
+are described in the PDF "Die Wortklassensysteme von Morphy" linked
+on http://www.wolfganglezius.de/lib/exe/fetch.php?media=cl:wklassen.pdf
+
+LanguageTool will tag words by returning one string per reading.
+For example, "Baum" will be tagged with these strings:
+
+SUB:AKK:SIN:MAS
+SUB:DAT:SIN:MAS
+SUB:NOM:SIN:MAS
+
+These are abbreviations for:
+
+Substantiv, Akkusativ, Singular, Maskulinum
+Substantiv, Dativ, Singular, Maskulinum
+Substantiv, Nominativ, Singular, Maskulinum
+
+For example, if you want to have a rule that matches all adjectives use this:
+
+ <token postag_regexp="yes" postag="ADJ:.*" />
+
+For a rule that matches plural nouns use:
+
+ <token postag_regexp="yes" postag="SUB:.*:PLU:.*" />
+
+For a rule that matches singular nouns use:
+
+ <token postag_regexp="yes" postag="SUB:.*:SIN:.*" />
+
+Also try to run LanguageTool on the command line with the -v option,
+it will display how words have been tagged.
This was sent by the SourceForge.net collaborative development platform, the
world's largest Open Source development site.
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
Languagetool-commits mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-commits