Igor Sominsky wrote:
Thilo,
As Michael wrote, we will continue to support CFE in the future. It is extensively used in our
research and development process. In fact, CFE is the only software that we use for extracting
features for machine learning and evaluation. We are also extending CFE to include an evaluation
as a part of its functionality. For that purpose we are enabling FESL with
semantics required to
specify rules for annotation comparison. Just to clarify, you observation is correct, as of now
an evaluation is a separate piece of functionality and is not a part of CFE, although it works
off CFE's output. But as I pointed out earlier, it is being integrated with CFE
I was wondering if it's really a good idea to integrate it. It seems orthogonal
to the feature
extraction part.
Regarding EMF-based and XMLBeans parsers. Both parsers are generated from the
FESL schema file. I
do not see any problem in eliminating one parser or the other. I personally prefer to work with
EMF-based parser as it is integrated with Eclipse and allows quick turn-around cycle in the case
it is required to make changes to the schema.
The EMF stuff would make a lot of sense if you planned any Eclipse-based tooling. Outside Eclipse,
EMF-based XML parsing is difficult to get right, in my experience (because it really wasn't intended
to be used outside Eclipse). Maybe that's no longer the case, though. I haven't tried it in a long
time.
So, I'll vote +0 on this one. If the other UIMA committers are happy to pick it up, and do the work
to integrate it into the sandbox, whip the documentation into shape, maybe simplify the user-facing
xml, I won't stop you. I really think the world needs a tool like this, just not convinced this is
the best approach. Mind you, maybe there is no simpler way. But if there isn't, then I for one
still prefer a few lines of Java I've written myself over a couple of pages of xml. No offense.
--Thilo
From: Thilo Goetz <[EMAIL PROTECTED]> To: uima-dev@incubator.apache.org Date:
10/22/2008 05:36 AM
Subject: Re: [VOTE] accept the Configurable Feature Extractor (CFE) into the
sandbox
--------------------------------------------------------------------------------
Michael Tanenblatt wrote:
Igor Sominsky is on vacation now, so he cannot respond, but I think it is safe to say that he
will continue to support this for the foreseeable future. It is something that he and I use
often, and he has been continuing to enhance and support it.
On Sep 29, 2008, at 9:51 AM, Thilo Goetz wrote:
Who will maintain this code once it's in the sandbox?
Igor, are you back? Care to comment? I'd like to see some assurance that this is not just a
code drop before I vote.
I'm also confused about the evaluation part of CFE. I can see that it's useful
to have this sort
of evaluation, but should it really be part of a feature extraction package? It seems a pretty
independent sort of functionality. Or maybe I just didn't understand it.
You write in your user's guide that CFE depends on both XmlBeans and EMF. Are you using EMF for
anything but XML processing? Do you think EMF could be eliminated and completely replaced by
XmlBeans? To be clear, this is not necessary, I'm just curious.
--Thilo