vj, This sounds like a really nice contribution to cTAKES. The license[1] incompatibility may pose a challenge though:
For example, how tightly coupled is weka to the annotator code? On Hibernate, I've seen other apache projects [2,3] interface with Hibernate, so we can probably do something similar. [1] http://www.apache.org/legal/3party.html [2] http://tapestry.apache.org/hibernate.html [3] http://tomee.apache.org/tomee-and-hibernate.html --Pei On Thu, Oct 3, 2013 at 8:50 AM, vijay garla <[email protected]> wrote: > Hello All, > > I'd like to contribute ytex to ctakes. YTEX's main feature is the > ability to store *any* ctakes (or uima) annotation in a relational > database (in a relational format), and the ability to export these > annotations to ML packages (weka, libsvm, matlab, R). All of this is > purely declarative/via configuration. > > In addtion, Ytex provides the following: > * Negation Detection with Negex > * SegmentRegexAnnotator - section detection with regular expressions > * NamedEntityRegexAnnotator - named entity detection with regular > expressions > * Sentence Splitter - modified ctakes sentence splitter making > sentence split patterns configurable (not hardcoded to \n) > > YTEX currently works with ctakes 2.5; I would like to upgrade it to > the latest ctakes, and if the community is interested, contribute to > ctakes.apache.org. > > A licensing question: YTEX uses Spring (apache 2.0 license), Hibernate > (lgpl 2.1), & weka (gpl). Are there any issues with including these? > > Cheers > > vj >
