[
https://issues.apache.org/jira/browse/OPENNLP-757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14609845#comment-14609845
]
Mondher Bouazizi commented on OPENNLP-757:
------------------------------------------
Hi,
Actually, I used these data to make the module work first, since I haven't yet
supported semcore. However these training data are for a relatively few number
of words (57 words); therefore, I won't be using them anymore in the next step.
Nevertheless, they can be used for evaluation in case we need them.
> Supervised WSD techniques
> -------------------------
>
> Key: OPENNLP-757
> URL: https://issues.apache.org/jira/browse/OPENNLP-757
> Project: OpenNLP
> Issue Type: New Feature
> Components: wsd
> Reporter: Mondher Bouazizi
> Labels: gsoc, gsoc2015, java, nlp, wsd
> Attachments: opennlp-wsd-supervised.patch, sup.patch, supervised.patch
>
>
> The objective of Word Sense Disambiguation (WSD) is to determine which sense
> of a word is meant in a particular context. Therefore, WSD is a
> classification task, where the classes are the different senses of the
> ambiguous word.
> Different techniques are proposed in the academic literature, which fall
> mainly into two categories: Supervised and Unsupervised.
> For this component, we focus on supervised techniques: these approaches use
> machine-learning techniques to learn a classifier from labeled training sets.
> The object of this project is to create a WSD solution (for English) that
> implements some supervised techniques. For example:
> - Decision Lists
> - Decision Trees
> - Naive Bayes
> - Neural Networks
> - Exemplar-Based or Instance-Based Learning
> - Support Vector Machines
> - Ensemble Methods
> - Semi-supervised Disambiguation
> - Etc.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)