[ 
https://issues.apache.org/jira/browse/OPENNLP-758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mondher Bouazizi updated OPENNLP-758:
-------------------------------------
    Description: 
The objective of Word Sense Disambiguation (WSD) is to determine which sense of 
a word is meant in a particular context. Therefore, WSD is a classification 
task, where the classes are the different senses of the ambiguous word.
Different techniques are proposed in the academic literature, which fall mainly 
into two categories: Supervised and Unsupervised.
For this component, we focus on unsupervised techniques: these methods are 
based on unlabeled data, and do not exploit any manually tagged data.
The object of this project is to create a WSD solution (for English) that 
implements some unsupervised techniques. For example:
- Context Clustering
- Word Clustering
- Cooccurrence Graphs
- Overlap of Sense Definitions
- Selectional Preferences
- Structural Approaches
- Etc.

  was:
The objective of Word Sense Disambiguation (WSD) is to determine which sense of 
a word is meant in a particular context. Therefore, WSD is a classification 
task, where the classes are the different senses of the ambiguous word.
Different techniques are proposed in the academic literature, which fall mainly 
into two categories: Supervised and Unsupervised.
For this component, we focus on unsupervised techniques: these methods are 
based on unlabeled data, and do not exploit any manually tagged data.
The object of this project is to create a WSD solution (for English) that 
implements some unsupervised techniques. For example:
  . Context Clustering
  . Word Clustering
  . Cooccurrence Graphs
  . Overlap of Sense Definitions
  . Selectional Preferences
  . Structural Approaches
  . Etc.


> Unsupervised WSD techniques
> ---------------------------
>
>                 Key: OPENNLP-758
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-758
>             Project: OpenNLP
>          Issue Type: New Feature
>          Components: POS Tagger, Sentence Detector, Stemmer
>            Reporter: Mondher Bouazizi
>              Labels: gsoc, gsoc2015, java, nlp, wsd
>
> The objective of Word Sense Disambiguation (WSD) is to determine which sense 
> of a word is meant in a particular context. Therefore, WSD is a 
> classification task, where the classes are the different senses of the 
> ambiguous word.
> Different techniques are proposed in the academic literature, which fall 
> mainly into two categories: Supervised and Unsupervised.
> For this component, we focus on unsupervised techniques: these methods are 
> based on unlabeled data, and do not exploit any manually tagged data.
> The object of this project is to create a WSD solution (for English) that 
> implements some unsupervised techniques. For example:
> - Context Clustering
> - Word Clustering
> - Cooccurrence Graphs
> - Overlap of Sense Definitions
> - Selectional Preferences
> - Structural Approaches
> - Etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to