The discussion about the ConceptMapper and the DictionaryAnnotator can be found 
here:
http://markmail.org/message/mfuubh5are7vs5ua

Details about the DictionaryAnnotator can be found in the DictionaryAnnotator 
documentation.

-- Michael Baessler

Michael Tanenblatt wrote:
> I am the author of ConceptMapper. It is our intention, as time permits,
> to merge both of these two projects. Off the top of my head, I can't
> speak to the functionality of DictionaryAnnotator, as our discussions
> about the two systems occurred quite some time ago, so I will just give
> you a summary of the features of ConceptMapper and someone else can
> supply the details of DictionaryAnnotator. Clearly, if you are confused
> about the differences, we should probably make these differences clearer
> on the sandbox site.
> 
> - ConceptMapper (CM) provides token-based dictionary lookup
> - A tokenizer's AE descriptor is supplied as a parameter to CM to be
> used for tokenizing its dictionary, thereby assuring that the dictionary
> is tokenized in the same way as the input document
> - multi-token terms are allowed
> - Any number of synonyms can be associated with an entry
> - numerous lookup strategies are supported, providing for simple
> contiguous-token lookup, or allowing intervening tokens to be skipped
> between tokens that make up a multi-token term. This skipping can be
> controlled, skipping only tokens with certain feature values, or
> uncontrolled, skipping any.
> - In addition to the other mechanisms for token skipping, you can supply
> a list of stop words to ignore during matching
> - The XML-based dictionary can have any arbitrary set of features
> associated with an entry, and any (or all) of those features can be
> mapped to specific features of the resultant annotation. These can be
> associated at the granularity of individual synonyms, or with an entire
> entry. Synonym-specific features will override those specified for the
> dictionary entry if they have the same feature name.
> - Lookup can also be set to allow out-of-order token lookup, thereby
> allowing {A} {B} {C} to match {C} {A} {B}
> - Result can be longest match, or all entries that match against a token
> or set of tokens
> - Can specify features from the dictionary can be written back to
> matching tokens
> - Can match against tokens' covered text, or specify a token specific
> feature to match against
> - A stemmer can be applied to tokens before matching is performed
> 
> I think that covers everything. I hope this helps!
> 
> 
> On Sep 21, 2009, at 6:59 AM, Roberto Franchini wrote:
> 
>> Hi to all,
>> I'm exploring the ConceptMapper and the Dicnionaryannotar from the
>> sandbox and I can't see  very big differences in the porpuse.
>> Maybe the creators(donatos) are going to merge this two projects, am I
>> right?
>> But, at this time, what's the best choice? And what's the best of two?
>> Regards,
>> R.
>>
>> -- 
>> Roberto Franchini
>> http://www.celi.it
>> http://www.blogmeter.it
>> http://www.memesphere.it
>> Tel +39-011-6600814
>> jabber:[email protected] skype:ro.franchini
> 

Reply via email to