Revision: 17095
http://sourceforge.net/p/gate/code/17095
Author: adamfunk
Date: 2013-11-14 15:46:17 +0000 (Thu, 14 Nov 2013)
Log Message:
-----------
Updating the Twitter plugin documentation.
Modified Paths:
--------------
userguide/trunk/misc-creole.tex
Modified: userguide/trunk/misc-creole.tex
===================================================================
--- userguide/trunk/misc-creole.tex 2013-11-14 15:21:56 UTC (rev 17094)
+++ userguide/trunk/misc-creole.tex 2013-11-14 15:46:17 UTC (rev 17095)
@@ -3168,28 +3168,38 @@
}} output handler is also provided by the \verb!Format_FastInfoset! plugin.
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
-\sect[sec:creole:tweet]{Twitter JSON Format}
+\sect[sec:creole:tweet]{Twitter processing}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+The \verb!Twitter! plugin contains several tools useful for processing tweets.
+This plugin depends on the \verb!Tagger_Stanford! plugin, which must be loaded
+first.
%
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+\subsect[sec:creole:tweetformat]{Twitter JSON format}
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%
This plugin contains a format analyser for JSON files from the Twitter
API\footnote{\url{https://dev.twitter.com/docs/platform-objects/tweets}}.
+Loading the plugin registers the document format with GATE, so that it will be
+automatically associated with files whose names end in ``\verb!.json!'';
+otherwise you need to specify \verb!text/x-json-twitter! for the document
+mimeType parameter. This will work both when directly creating a single new
+GATE document and when populating a corpus.
-To use it, simply load the \verb!Format_Twitter! plugin; this will register the
-document format with GATE. The format will be automatically associated with
-files whose names end in ``\verb!.json!''; otherwise you need to specify
-\verb!text/x-json-twitter! for the document mimeType parameter. This will work
-both when directly creating a single new GATE document and when populating a
-corpus.
-
Each tweet object's \verb!text! value is converted into the document content,
which is covered with a \emph{Tweet} annotations whose features represent
(recursively when appropriate, using \emph{HashMap} and \emph{List}) all the
other key-value pairs in the tweet object. \textbf{Note:} these recursive
-values are difficult to work with in JAPE and will probably be changed in the
-near future.
+values are difficult to work with in JAPE; the corpus population tool described
+next allows important key-sequences to be ``brought up'' to the top level of
the
+annotation features.
Multiple tweet objects in the same JSON file are separated by blank lines
(which
are not covered by \emph{Tweet} annotations.
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+\subsect[sec:creole:population]{Corpus population from JSON files}
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+\ldots
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\sect[sec:creole:termraider]{TermRaider term extraction tools}
This was sent by the SourceForge.net collaborative development platform, the
world's largest Open Source development site.
------------------------------------------------------------------------------
DreamFactory - Open Source REST & JSON Services for HTML5 & Native Apps
OAuth, Users, Roles, SQL, NoSQL, BLOB Storage and External API Access
Free app hosting. Or install the open source package on any LAMP server.
Sign up and see examples for AngularJS, jQuery, Sencha Touch and Native!
http://pubads.g.doubleclick.net/gampad/clk?id=63469471&iu=/4140/ostg.clktrk
_______________________________________________
GATE-cvs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/gate-cvs