Hi,
Sentiment analysis will not directly help machine translation. But, machine
translation can definitely help sentiment analysis. Most of the work in
sentiment analysis has been done in English only. After building the
sentiment analysis tool, we can integrate it with a translator to do
sentiment analysis for many languages. This is the idea that I have behind
this project.
Thus, it may be viewed as a new feature for Apertium machine translator.
Please share your ideas on this.
Cheers!
Anand Soni
On Mon, Apr 29, 2013 at 5:50 PM, <
[email protected]> wrote:
> Send Apertium-stuff mailing list submissions to
> [email protected]
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
> or, via email, send a message with subject or body 'help' to
> [email protected]
>
> You can reach the person managing the list at
> [email protected]
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Apertium-stuff digest..."
>
>
> Today's Topics:
>
> 1. [GSoC 2013] Interface for creating tagged corpora (Mihir Rege)
> 2. Idea for GSoC-2013 (Anand Soni)
> 3. Re: Idea for GSoC-2013 (Francis Tyers)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Sun, 28 Apr 2013 21:47:12 +0530
> From: Mihir Rege <[email protected]>
> Subject: [Apertium-stuff] [GSoC 2013] Interface for creating tagged
> corpora
> To: [email protected]
> Message-ID:
> <
> cabb7fqeesgxfw8qcuopcsjyq0r4u-hyzlm4hbyxgp+ty8na...@mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi everybody,
>
> I have been working on the idea for creating an interface for tagged
> corpora since the past few days. I have finished the coding challenge, have
> gone through the documentation pdf and wikis and am currently working on
> designing the interface. I am posting the mockups for the interface so
> that I can get your opinion and improve on them.
>
> There are currently three major interfaces:
>
> a) Manual disambiguator
>
> b) .prob evaluator
>
> c) .tsx file editor
>
> I have put up all the mockups together at http://imgur.com/a/4uk4q#r5Ur8jTand
> have also put links in separate sections.
>
>
> *a) Manual disambiguator*
>
> Mockup: http://i.imgur.com/r5Ur8jT.png
>
> Functions:
>
> - Jump to next ambiguous lexical unit or adjacent lexical-unit using the
> keyboard or mouse.
> - A quick-view bound to a key, to hide the tags and show the raw text
> - If the .tsx file is provided, information like the coarse tags,
> forbid, enforce rules applicable can also be displayed.
> - Show statistics of disambiguation
> - Compile and apply constraint grammar rules to the buffer
> - List the applied constraint grammar rules
> - Train and test the tagger (a prompt will ask the part of the corpus to
> be used as testing data).
> - Train the tagger and export the .prob file
> - Save progress ( this will save the corpus and also create a project
> description file which will keep track of the morphological analyser,
> .tsx
> files used, so that it is easier to resume tagging)
> - The interface will be keyboard centric, though it will be equally
> functional with a mouse.
> - Default keymaps will be provided and the bindings can be changed to
> suit the user
>
> For example
>
> [P] - <previous-ambiguous>
>
> [N] - <next-ambiguous>
>
> [F] - <forward-word>
>
> [B] - <back-word>
>
> [1], [2],[3],[4] for choosing the correct lexical form.
>
> *Evaluating the tagger*
>
> Functions
>
>
> - The trained tagger can be evaluated immediately by having an option of
> setting aside x% of the corpus as testing data.
> - Else, it can be evaluated using the .prob evaluator using an unrelated
> corpus.
>
> *Loading the corpus*
>
> The available options are:
>
> 1.
>
> Load a raw-text file, morphological analyser and .tsx file (optional)
> 2.
>
> Continue on an existing project
> 3.
>
> Pull a wiki-dump and use it as the corpus [
> http://i.imgur.com/F9OXMs4.png ]
>
>
>
> *b) .prob evaluator*
>
> Mockup: http://i.imgur.com/fIo6rV9.png
>
> Functions
>
> - Input the .prob file , the manually disambiguated corpus along with
> morphologically analysed corpus or the morphological analyser for the
> language.
> - Evaluate the .prob file and display statistics about tagger accuracy
> - Generate a log file, which will basically be the diff between the
> provided tagged corpus and the corpus disambiguated by the tagger,
> making
> it easier to frame new sentences to add to the corpus, so as to give
> more
> context to the tagger
>
> *c) .tsx file editor*
>
> Mockup:
>
> TSX Viewer
>
> http://i.imgur.com/pVdsIem.png
>
> Templates
>
> http://i.imgur.com/hFGIQHR.png
>
> Functions:
>
> -
>
> Add new tags
> - categories
> - multi-categories
> - forbid
> - enforce
> - prefer
> - Templates for adding new tags
> - Change the order of the tags (as more specific categories must be
> defined before more general ones) within the same parent tag. The nodes
> in
> the xml viewer can also be made draggable within the same parent node
> to
> make it easier to change the order
> - Search within tags for faster navigation.
> - Validate the tagger definition
> - Editor features like syntax highlighting , auto-indentation and tag
> completion for manual editing in the Node Contents textview for complex
> in-place editing.
>
>
> Looking forward to hear from you. :)
>
>
> Regards,
>
> Mihir Rege,
>
> Second Year Undergraduate,
>
> Department of Computer Science and Engineering,
>
> IIT Kharagpur.
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 2
> Date: Mon, 29 Apr 2013 15:49:32 +0530
> From: Anand Soni <[email protected]>
> Subject: [Apertium-stuff] Idea for GSoC-2013
> To: [email protected]
> Message-ID:
> <
> cabry8bdm6apkihtvygocuqdad_1u+3r+6cy86g8omwwsukd...@mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi all!
>
> I have been away due to my semester exams. So, Sentiment Analysis is what I
> have arrived at finally. I want to develop a sentiment analyzer for
> Apertium. Initially, I am planning to develop a trained analyzer (based on
> Supervised learning). Then if I could complete it before time, I would like
> to implement a non-supervised version of the same. However, my current
> focus is on the trained analyzer. I would be submitting a proposal on the
> same soon. I would definitely like to hear from you all on this idea.
> Cheers!
>
> Anand Soni
> India.
>
> --
> *Anand Soni
> Sophomore,
> *
> *Department of Computer Science & Engineering,*
> *IIT Bombay,
> India*
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 3
> Date: Mon, 29 Apr 2013 11:28:06 +0000
> From: Francis Tyers <[email protected]>
> Subject: Re: [Apertium-stuff] Idea for GSoC-2013
> To: [email protected]
> Message-ID: <[email protected]>
> Content-Type: text/plain; charset="UTF-8"
>
> Hi, how would this sentiment analyser help machine translation?
>
> Fran
>
> El dl 29 de 04 de 2013 a les 15:49 +0530, en/na Anand Soni va escriure:
> > Hi all!
> >
> >
> > I have been away due to my semester exams. So, Sentiment Analysis is
> > what I have arrived at finally. I want to develop a sentiment analyzer
> > for Apertium. Initially, I am planning to develop a trained analyzer
> > (based on Supervised learning). Then if I could complete it before
> > time, I would like to implement a non-supervised version of the same.
> > However, my current focus is on the trained analyzer. I would be
> > submitting a proposal on the same soon. I would definitely like to
> > hear from you all on this idea.
> > Cheers!
> >
> >
> > Anand Soni
> > India.
> >
> >
> > --
> > Anand Soni
> > Sophomore,
> > Department of Computer Science & Engineering,
> > IIT Bombay,
> > India
> >
> ------------------------------------------------------------------------------
> > Try New Relic Now & We'll Send You this Cool Shirt
> > New Relic is the only SaaS-based application performance monitoring
> service
> > that delivers powerful full stack analytics. Optimize and monitor your
> > browser, app, & servers with just a few lines of code. Try New Relic
> > and get this awesome Nerd Life shirt!
> http://p.sf.net/sfu/newrelic_d2d_apr
> > _______________________________________________ Apertium-stuff mailing
> list [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
>
>
>
>
> ------------------------------
>
>
> ------------------------------------------------------------------------------
> Try New Relic Now & We'll Send You this Cool Shirt
> New Relic is the only SaaS-based application performance monitoring service
> that delivers powerful full stack analytics. Optimize and monitor your
> browser, app, & servers with just a few lines of code. Try New Relic
> and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr
>
> ------------------------------
>
> _______________________________________________
> Apertium-stuff mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
>
> End of Apertium-stuff Digest, Vol 72, Issue 48
> **********************************************
>
--
*Anand Soni
Sophomore,
*
*Department of Computer Science & Engineering,*
*IIT Bombay,
India*
------------------------------------------------------------------------------
Try New Relic Now & We'll Send You this Cool Shirt
New Relic is the only SaaS-based application performance monitoring service
that delivers powerful full stack analytics. Optimize and monitor your
browser, app, & servers with just a few lines of code. Try New Relic
and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff