Revision: 18718
          http://sourceforge.net/p/gate/code/18718
Author:   markagreenwood
Date:     2015-05-28 09:12:09 +0000 (Thu, 28 May 2015)
Log Message:
-----------
documentation for the DataSift format plugin

Modified Paths:
--------------
    userguide/trunk/misc-creole.tex
    userguide/trunk/recent-changes.tex

Modified: userguide/trunk/misc-creole.tex
===================================================================
--- userguide/trunk/misc-creole.tex     2015-05-28 08:45:57 UTC (rev 18717)
+++ userguide/trunk/misc-creole.tex     2015-05-28 09:12:09 UTC (rev 18718)
@@ -3275,6 +3275,20 @@
 }} output handler is also provided by the \verb!Format_FastInfoset! plugin.
 %
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+\sect[sec:creole:datasift]{DataSift Document Format}
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%
+The \verb!Format_DataSift! plugin provides support for loading JSON files in 
the
+\htlink{http://datasift.com/}{DataSift} format into GATE. The format will
+automatically be used when loading documents with the \verb!datasift.json!
+extension of when the MIME type is explicityl set to 
\verb!text/x-json-datasift!.
+
+Documents loaded using this plugin are constructed by conconcatenating the
+\verb!content! property of each \verb!Interaction! map within the JSON file.
+An \verb!Interaction! annotation is created over the relevant text spans and
+all other associated data is added to the annotations FeatureMap.
+%
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 \sect[sec:creole:csv]{CSV Document Support}
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 %
@@ -3286,7 +3300,7 @@
 CSV files within the directory), which column contains the text data (note that
 the columns are numbered from 0 upwards), if the first row contains column 
labels,
 and if one GATE document should be created per CSV file or per row within a 
file.
-%
+
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 \sect[sec:creole:termraider]{TermRaider term extraction tools}
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

Modified: userguide/trunk/recent-changes.tex
===================================================================
--- userguide/trunk/recent-changes.tex  2015-05-28 08:45:57 UTC (rev 18717)
+++ userguide/trunk/recent-changes.tex  2015-05-28 09:12:09 UTC (rev 18718)
@@ -85,6 +85,7 @@
 data available'' would appear to annotators, this has now been fixed.
 
 A new plugin to allow for the easy use of JSON data produced by 
\htlink{http://datasift.com/}{DataSift}.
+See Section~\ref{sec:creole:datasift} for full usage details.
 
 \rcSubsect{May 2014}
 
@@ -146,7 +147,7 @@
 \item Integration of the Stanford POS Tagger
   (section~\ref{sec:misc:creole:stanford}), which is used by TwitIE
 \item A document normalizer plugin, predominantly to normalize punctuation such
-  as Microsoft Word ``smart quotes'' (see
+  as Microsoft Word ``ssec:creole:datasiftmart quotes'' (see
   section~\ref{sec:misc-creole:doc-normalizer})
 \item Wrappers for the \emph{AlchemyAPI} keyword and entity extraction services
   (in the \verb!AlchemyAPI! plugin)

This was sent by the SourceForge.net collaborative development platform, the 
world's largest Open Source development site.


------------------------------------------------------------------------------
_______________________________________________
GATE-cvs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/gate-cvs

Reply via email to