Revision: 18718
http://sourceforge.net/p/gate/code/18718
Author: markagreenwood
Date: 2015-05-28 09:12:09 +0000 (Thu, 28 May 2015)
Log Message:
-----------
documentation for the DataSift format plugin
Modified Paths:
--------------
userguide/trunk/misc-creole.tex
userguide/trunk/recent-changes.tex
Modified: userguide/trunk/misc-creole.tex
===================================================================
--- userguide/trunk/misc-creole.tex 2015-05-28 08:45:57 UTC (rev 18717)
+++ userguide/trunk/misc-creole.tex 2015-05-28 09:12:09 UTC (rev 18718)
@@ -3275,6 +3275,20 @@
}} output handler is also provided by the \verb!Format_FastInfoset! plugin.
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+\sect[sec:creole:datasift]{DataSift Document Format}
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%
+The \verb!Format_DataSift! plugin provides support for loading JSON files in
the
+\htlink{http://datasift.com/}{DataSift} format into GATE. The format will
+automatically be used when loading documents with the \verb!datasift.json!
+extension of when the MIME type is explicityl set to
\verb!text/x-json-datasift!.
+
+Documents loaded using this plugin are constructed by conconcatenating the
+\verb!content! property of each \verb!Interaction! map within the JSON file.
+An \verb!Interaction! annotation is created over the relevant text spans and
+all other associated data is added to the annotations FeatureMap.
+%
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\sect[sec:creole:csv]{CSV Document Support}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%
@@ -3286,7 +3300,7 @@
CSV files within the directory), which column contains the text data (note that
the columns are numbered from 0 upwards), if the first row contains column
labels,
and if one GATE document should be created per CSV file or per row within a
file.
-%
+
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\sect[sec:creole:termraider]{TermRaider term extraction tools}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Modified: userguide/trunk/recent-changes.tex
===================================================================
--- userguide/trunk/recent-changes.tex 2015-05-28 08:45:57 UTC (rev 18717)
+++ userguide/trunk/recent-changes.tex 2015-05-28 09:12:09 UTC (rev 18718)
@@ -85,6 +85,7 @@
data available'' would appear to annotators, this has now been fixed.
A new plugin to allow for the easy use of JSON data produced by
\htlink{http://datasift.com/}{DataSift}.
+See Section~\ref{sec:creole:datasift} for full usage details.
\rcSubsect{May 2014}
@@ -146,7 +147,7 @@
\item Integration of the Stanford POS Tagger
(section~\ref{sec:misc:creole:stanford}), which is used by TwitIE
\item A document normalizer plugin, predominantly to normalize punctuation such
- as Microsoft Word ``smart quotes'' (see
+ as Microsoft Word ``ssec:creole:datasiftmart quotes'' (see
section~\ref{sec:misc-creole:doc-normalizer})
\item Wrappers for the \emph{AlchemyAPI} keyword and entity extraction services
(in the \verb!AlchemyAPI! plugin)
This was sent by the SourceForge.net collaborative development platform, the
world's largest Open Source development site.
------------------------------------------------------------------------------
_______________________________________________
GATE-cvs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/gate-cvs