Hi Pei, The WSD annotator relies on the semantic similarity component, which is a general purpose tool not strictly limited to ctakes or NLP. I would like to keep the semantic similarity component 'standalone', i.e. with no dependencies on ctakes, and make it redistributable on its own. If that is possible as part of ctakes, I'd love to move it. If not, I'd leave the semantic similarity and the associated WSD annotator on google code.
For those of you who want the back story: http://www.biomedcentral.com/1471-2105/13/261 http://jamia.bmj.com/content/20/5/882.long -vj On Thu, Oct 3, 2013 at 5:13 PM, Chen, Pei <[email protected]> wrote: > vj, > Were you thinking of contributing the new ytext Word Sense Disambiguation > component as well- I think that will be really cool. > --Pei > >> -----Original Message----- >> From: [email protected] [mailto:[email protected]] On Behalf Of Karthik >> Sarma >> Sent: Thursday, October 03, 2013 1:05 PM >> To: [email protected] >> Subject: Re: move ytex annotators to ctakes.apache.org? >> >> This would be quite valuable -- in particular, ytex's annotation database >> connection is much easier to use than what ships with cTAKES. There are a >> fair number of other advantages, and I think they'd all be very valuable! >> >> >> >> >> >> -- >> Karthik Sarma >> UCLA Medical Scientist Training Program Class of 20?? >> Member, UCLA Medical Imaging & Informatics Lab Member, CA Delegation >> to the House of Delegates of the American Medical Association >> [email protected] >> gchat: [email protected] >> linkedin: www.linkedin.com/in/ksarma >> >> >> On Thu, Oct 3, 2013 at 5:50 AM, vijay garla <[email protected]> wrote: >> >> > Hello All, >> > >> > I'd like to contribute ytex to ctakes. YTEX's main feature is the >> > ability to store *any* ctakes (or uima) annotation in a relational >> > database (in a relational format), and the ability to export these >> > annotations to ML packages (weka, libsvm, matlab, R). All of this is >> > purely declarative/via configuration. >> > >> > In addtion, Ytex provides the following: >> > * Negation Detection with Negex >> > * SegmentRegexAnnotator - section detection with regular expressions >> > * NamedEntityRegexAnnotator - named entity detection with regular >> > expressions >> > * Sentence Splitter - modified ctakes sentence splitter making >> > sentence split patterns configurable (not hardcoded to \n) >> > >> > YTEX currently works with ctakes 2.5; I would like to upgrade it to >> > the latest ctakes, and if the community is interested, contribute to >> > ctakes.apache.org. >> > >> > A licensing question: YTEX uses Spring (apache 2.0 license), Hibernate >> > (lgpl 2.1), & weka (gpl). Are there any issues with including these? >> > >> > Cheers >> > >> > vj >> >
