Revision: 17733 http://sourceforge.net/p/gate/code/17733 Author: ian_roberts Date: 2014-03-24 20:45:07 +0000 (Mon, 24 Mar 2014) Log Message: ----------- More detail on annotations.
Modified Paths: -------------- gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html Modified: gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/ANNIE/.annie-defaults-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -6,3 +6,50 @@ <p>It is the prototypical information extraction pipeline distributed with the <a href="http://gate.ac.uk">GATE framework</a> and forms the base of many more complex GATE-based IE applications.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> + Modified: gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_Arabic/resources/.arabic-pipeline-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -1,4 +1,51 @@ <p>A named entity recognition pipeline that identifies basic entity types, such as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em> amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents -in the Arabic language.</p> \ No newline at end of file +in the Arabic language.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> + Modified: gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_French/.french-pipeline-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -1,4 +1,51 @@ <p>A named entity recognition pipeline that identifies basic entity types, such as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em> amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents -in the French language.</p> \ No newline at end of file +in the French language.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> + Modified: gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_German/resources/.german-pipeline-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -1,4 +1,51 @@ <p>A named entity recognition pipeline that identifies basic entity types, such as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em> amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents -in the German language.</p> \ No newline at end of file +in the German language.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> + Modified: gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_Romanian/resources/.romanian-pipeline-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -1,4 +1,51 @@ <p>A named entity recognition pipeline that identifies basic entity types, such as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em> amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents -in the Romanian language.</p> \ No newline at end of file +in the Romanian language.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> + Modified: gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_Russian/resources/.russie-inflex-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -5,3 +5,58 @@ <p>This version of the pipeline includes an <em>inflexional gazetteer</em> to recognise more morphological variants of target names.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Lookup</tt></td> + <td>Individual gazetteer lookups – for those lookups that come from the inflectional gazetteer this includes a "lemma" feature giving the base word form</td> + </tr> + <tr> + <td><tt>:MSD</tt></td> + <td>"Morpho-Syntactic Description" for selected tokens, including features for "lemma" (the base form of inflected words) and "type" (roughly equivalent to a part of speech tag in English, though more complex as it encodes features such as gender, grammatical case, etc.)</td> + </tr> +</table> + Modified: gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_Russian/resources/.russie-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -2,3 +2,57 @@ as <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em> amounts, <em>Time</em> and <em>Date</em> expressions. It works on documents in the Russian language.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Lookup</tt></td> + <td>Individual gazetteer lookups</td> + </tr> + <tr> + <td><tt>:MSD</tt></td> + <td>"Morpho-Syntactic Description" for selected tokens, including features for "lemma" (the base form of inflected words) and "type" (roughly equivalent to a part of speech tag in English, though more complex as it encodes features such as gender, grammatical case, etc.)</td> + </tr> +</table> Modified: gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-inflex-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -7,3 +7,57 @@ recognise more morphological variants of target names, and an <em>orthomatcher</em> to perform basic coreference resolution based on orthographic similarity.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Lookup</tt></td> + <td>Individual gazetteer lookups – for those lookups that come from the inflectional gazetteer this includes a "lemma" feature giving the base word form</td> + </tr> + <tr> + <td><tt>:MSD</tt></td> + <td>"Morpho-Syntactic Description" for selected tokens, including features for "lemma" (the base form of inflected words) and "type" (roughly equivalent to a part of speech tag in English, though more complex as it encodes features such as gender, grammatical case, etc.)</td> + </tr> +</table> Modified: gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Lang_Russian/resources/.russie-ortho-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -5,3 +5,57 @@ <p>This version of the pipeline includes an <em>orthomatcher</em> to perform basic coreference resolution based on orthographic similarity.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Lookup</tt></td> + <td>Individual gazetteer lookups</td> + </tr> + <tr> + <td><tt>:MSD</tt></td> + <td>"Morpho-Syntactic Description" for selected tokens, including features for "lemma" (the base form of inflected words) and "type" (roughly equivalent to a part of speech tag in English, though more complex as it encodes features such as gender, grammatical case, etc.)</td> + </tr> +</table> Modified: gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/OpenNLP/resources/.opennlp-de-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -3,3 +3,17 @@ <a href="http://opennlp.apache.org/">Apache OpenNLP</a>. The components are based on the maxent machine learning algorithm, and produce Token and Sentence annotations in a form compatible with other standard GATE tools.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> Modified: gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/OpenNLP/resources/.opennlp-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -4,3 +4,45 @@ <a href="http://opennlp.apache.org/">Apache OpenNLP</a>. The components are based on the maxent machine learning algorithm, and produce Token and Sentence annotations in a form compatible with other standard GATE tools.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percentage</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Time</tt></td> + <td>Time expressions</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS and "chunk" feature for for the I/O/B-style chunk tags. Complete chunks derived from the tags are also available as their respective annotation types (e.g. a sequence of tokens tagged B-NP, I-NP, I-NP gives rise to an "NP" annotation spanning the sequence).</td> + </tr> +</table> Modified: gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/OpenNLP/resources/.opennlp-nl-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -4,3 +4,45 @@ <a href="http://opennlp.apache.org/">Apache OpenNLP</a>. The components are based on the maxent machine learning algorithm, and produce Token and Sentence annotations in a form compatible with other standard GATE tools.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percentage</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Time</tt></td> + <td>Time expressions</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS and "chunk" feature for for the I/O/B-style chunk tags. Complete chunks derived from the tags are also available as their respective annotation types (e.g. a sequence of tokens tagged B-NP, I-NP, I-NP gives rise to an "NP" annotation spanning the sequence).</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_Framework/resources/AbGene/.abgene-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -6,3 +6,13 @@ <p>For full details, see Tanabe and Wilbur (2002) "Tagging gene and protein names in biomedical text", <em>Bioinformatics</em> 18(8):1124&endash;1132, <a href="http://dx.doi.org/10.1093/bioinformatics/18.8.1124">doi:10.1093/bioinformatics/18.8.1124</a>.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Gene</tt></td> + <td>Expressions denoting genes</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-en-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -3,3 +3,24 @@ <p>This application tags English language text using the <a href="http://code.google.com/p/hunpos/downloads/detail?name=en_wsj.model.gz">en_wsj</a> model from the Hunpos distribution.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_Framework/resources/Hunpos/.hunpos-hu-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -3,3 +3,24 @@ <p>This application tags Hungarian language text using the <a href="http://code.google.com/p/hunpos/downloads/detail?name=hu_szeged_kr.model.gz">hu_szeged_kr</a> model from the Hunpos distribution.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_Measurements/resources/.annie-measurements-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -6,3 +6,57 @@ <p>This pipeline combines the basic ANNIE named entity system with taggers to recognise numeric expressions (digits and words) and to annotate and normalise measurement expressions with features giving their value in SI units.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td><tt>:Measurement</tt></td> + <td>Measurement expressions, with features giving the value and unit of the measurement, both in the original form specified in the document and in a form normalized to SI units</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td><tt>:Ratio</tt></td> + <td>Expressions denoting a ratio rather than a simple measurement, typically percentages but also expressions like "300 parts per million"</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_Measurements/resources/.measurements-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -6,3 +6,58 @@ one unit matching results expressed in another.</p> <p>As a side-effect this pipeline also annotates tokens and sentences.</p> + +<table> + <tr> + <td colspan="4"><b>Default annotations</b></td> + </tr> + <tr> + <td colspan="2"><tt>:Measurement</tt></td> + <td colspan="2">Measurement expressions, with features:</td> + </tr> + <tr> + <td> </td> + <td colspan="2"><tt>type</tt></td> + <td>"scalar" for single measurements, or "interval" for intervals (e.g. "1 to 5 pounds")</td> + </tr> + <tr> + <td> </td> + <td colspan="2"><tt>unit</tt></td> + <td>The unit of the measurement (gram, mile, ...)</td> + </tr> + <tr> + <td> </td> + <td colspan="2"><tt>value</tt></td> + <td>The numeric value of the measurement quantity as specified in the text</td> + </tr> + <tr> + <td> </td> + <td colspan="2"><tt>normalizedUnit</tt></td> + <td>The "normalized" unit for the measurement in the SI system (kilogram, metre, etc.)</td> + </tr> + <tr> + <td> </td> + <td colspan="2"><tt>normalizedValue</tt></td> + <td>The equivalent value of the measurement in the normalized unit. For interval measurements this is replaced by a "normalizedMaxValue" and "normalizedMinValue" giving the end-points of the interval.</td> + </tr> + <tr> + <td> </td> + <td colspan="2"><tt>dimension</tt></td> + <td>Speed, volume, area, time, etc.</td> + </tr> + <tr> + <td colspan="4"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td colspan="2"><tt>:Sentence</tt></td> + <td colspan="2">Sentences detected by the sentence splitter</td> + </tr> + <tr> + <td colspan="2"><tt>:Token</tt></td> + <td colspan="2">The individual tokens of the text</td> + </tr> + <tr> + <td colspan="2"><tt>:Ratio</tt></td> + <td colspan="2">Expressions denoting a ratio rather than a simple measurement, typically percentages but also expressions like "300 parts per million"</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_NP_Chunking/.np-chunker-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -2,3 +2,28 @@ phrases with a <em>NounChunk</em> annotation. This application also includes a tokeniser, sentence splitter and POS tagger as these are required by the chunking algorithm.</p> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:NounChunk</tt></td> + <td>Noun chunks discovered by the chunker</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> Modified: gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Tagger_PennBio/resources/.pennbio-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -13,3 +13,40 @@ </ul> </li> </ul> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:gene</tt></td> + <td>Expressions denoting genes</td> + </tr> + <tr> + <td><tt>:malignancy-type</tt></td> + <td>Expressions denoting malignancy types</td> + </tr> + <tr> + <td><tt>:location</tt></td> + <td rowspan="5">Expressions relating to genomic variation</td> + </tr> + <tr> + <td><tt>:state-original</tt></td> + </tr> + <tr> + <td><tt>:state-altered</tt></td> + </tr> + <tr> + <td><tt>:variation</tt></td> + </tr> + <tr> + <td><tt>:type</tt></td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text</td> + </tr> +</table> Modified: gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html =================================================================== --- gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html 2014-03-24 19:35:52 UTC (rev 17732) +++ gate/trunk/plugins/Twitter/resources/.twitie-en-metadata/long-desc.html 2014-03-24 20:45:07 UTC (rev 17733) @@ -10,3 +10,65 @@ <em>Person</em>, <em>Location</em>, <em>Organization</em>, <em>Money</em> amounts, <em>Time</em> and <em>Date</em> expressions.</li> </ul> + +<table> + <tr> + <td colspan="2"><b>Default annotations</b></td> + </tr> + <tr> + <td><tt>:Person</tt></td> + <td rowspan="4">Standard named entity types</td> + </tr> + <tr> + <td><tt>:Location</tt></td> + </tr> + <tr> + <td><tt>:Organization</tt></td> + </tr> + <tr> + <td><tt>:Date</tt></td> + </tr> + <tr> + <td><tt>:Address</tt></td> + <td>Includes email and IP addresses as well as street addresses</td> + </tr> + <tr> + <td><tt>:Token</tt></td> + <td>The individual tokens of the text, with "category" feature for POS</td> + </tr> + <tr> + <td><tt>:Emoticon</tt></td> + <td>Emoticons such as <tt>:-)</tt></td> + </tr> + <tr> + <td><tt>:Hashtag</tt></td> + <td>Hashtags, including the leading # character</td> + </tr> + <tr> + <td><tt>:URL</tt></td> + <td>URL mentions</td> + </tr> + <tr> + <td><tt>:UserID</tt></td> + <td>The username part of @user mentions, <em>not</em> including the leading @ sign</td> + </tr> + <tr> + <td colspan="2"><b>Additional annotations available if selected</b></td> + </tr> + <tr> + <td><tt>:Money</tt></td> + <td>Monetary amounts</td> + </tr> + <tr> + <td><tt>:Percent</tt></td> + <td>Expressions representing percentages</td> + </tr> + <tr> + <td><tt>:SpaceToken</tt></td> + <td>The spaces between tokens</td> + </tr> + <tr> + <td><tt>:Sentence</tt></td> + <td>Sentences detected by the sentence splitter</td> + </tr> +</table> This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site. ------------------------------------------------------------------------------ Learn Graph Databases - Download FREE O'Reilly Book "Graph Databases" is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/13534_NeoTech _______________________________________________ GATE-cvs mailing list GATE-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/gate-cvs