I am pretty sure it's a bug in OpenCalaisAnnotator (UIMA side), I ran
OpenCalaisAnnotator with CasVisualDebugger for the text:
 President Obama vows to "make BP pay" for the Gulf oil spill, and says the
US must end its fossil fuel "addiction" (first snippet on BBC website today)
and got 2 Annotations of type org.apache.uima.calais.Company, the first one
with the "fancy" -7 begin and end.
I'll open an issue on UIMA for this one (and hopefully fix it) later today.
Cheers,
Tommaso

still inspecting on what it's causing it.
Cheers,
Tommaso

2010/6/16 Tommaso Teofili <[email protected]>

> Hi Florent,
> I managed to reproduce your same error, now inspecting.
> I'll let you know.
> Thanks,
> Tommaso
>
> 2010/6/16 Tommaso Teofili <[email protected]>
>
> Hi Florent
>>
>> 2010/6/16 Florent André <[email protected]>
>>
>> Hi Tommaso, Hi all,
>>>
>>> Hope you have a more real summer weather that us...
>>>
>>
>> Yes, we have real warm sunny days at 30°C average :)
>>
>>
>>>
>>> $ sudo history | grep "useful part" :
>>>
>>> On Sun, 30 May 2010 16:39:06 +0200, Tommaso Teofili
>>> <[email protected]> wrote:
>>> > 2010/5/30 Oliver Strässer <[email protected]>
>>> >>
>>> >> Maybe we can begin the "getting started" page here ? :)
>>> >> Here comes my "really just user" questions :
>>> >>
>>> >> A/ How to use OpenCalais and AlchemyAPI services ?
>>> >>
>>> >
>>> > There is the UIMA integration which uses AlchemyAPI and OpenCalais to
>>> > enrich
>>> > graphnodes, documentation still needs to be done, I'll do it as soon as
>>> > possible but firstly I think we need to write an ontology for generated
>>> > UIMA
>>> > entities (I am doing this at the moment but I think that if Oliver has
>>> > already something in place it would be nice to have a look) so that the
>>> > graph gets enrihced in a proper RDF way.
>>> >
>>> > Take a look at this in the meantime:
>>> >
>>>
>>> http://svn.apache.org/repos/asf/incubator/clerezza/trunk/org.apache.clerezza.parent/org.apache.clerezza.uima/org.apache.clerezza.uima.metadata-generator/src/main/java/org/apache/clerezza/uima/metadatagenerator/UIMABaseMetadataGenerator.java
>>> > and at
>>> >
>>>
>>> http://svn.apache.org/repos/asf/incubator/clerezza/trunk/org.apache.clerezza.parent/org.apache.clerezza.uima/org.apache.clerezza.uima.utils/src/main/java/org/apache/clerezza/uima/utils/ExternalServicesFacade.java
>>> >
>>>
>>>
>>> I try (and get) a result from ExternalServicesFacade. I have a
>>> List<Annotation> result from esf.getCalaisAnnotations(document).
>>>
>>> But the problem is that I can't use the annot.getCoveredText() on the 2
>>> firsts of the list because annot.begin() and annot.end() contains -7. So
>>> annot.getCoveredText() send an outOfBound error...
>>>
>>
>> It's a strange behavior since tests on ESF involve also a call to
>> annot.getCoveredText() to control text inside it, without arising any issue.
>> Could you paste here which text do you pass to ESF as a parameter?
>>
>>
>>>
>>> As I'm not really involve in Clerezza and uima structure for now, do you
>>> have suggestion for this error ? It's a bug ? If yes more Clerezza part
>>> or
>>> uima one ?
>>>
>>
>> Could you explain better the use case where the method call arises this
>> issue?
>>
>>
>>>
>>> My test class in attachment if any (code on a motorcycle :) ).
>>>
>>
>> I can't find any test class in attachment (or maybe I misunderstood what
>> you mean :P).
>> However I am doing some tests to see what could cause such a issue both on
>> UIMA and Clerezza UIMA modules.
>> Cheers.
>> Tommaso
>>
>>
>>
>>>
>>> Have a good day.
>>>
>>>
>>
>

Reply via email to