Re: Annotating using DBPedia ontology

Rupert Westenthaler Wed, 11 Jan 2012 21:13:14 -0800

Hi Srecko

I think the last time I directly used this API is about 3-4 years ago, but 
after a look at the http client tutorial [1] I think the reason for your 
problem is that you do not execute the GetMethod.


Based on this tutorial the code should look like

    // Create an instance of HttpClient. 
    HttpClient client = new HttpClient();
    GetMethod get = new GetMethod(url);
    try {
        // Execute the method.
        int statusCode = client.executeMethod(get);
        if (statusCode != HttpStatus.SC_OK) {
            //handle the error
        }
        InputStream t_is = get.getResponseBodyAsStream();
        //read the data of the stream
    }

In addition you should not use a Reader if you want to read byte oriented data 
from the input stream.

hope this helps
best
Rupert

[1] http://hc.apache.org/httpclient-3.x/tutorial.html

On 11.01.2012, at 22:34, Srecko Joksimovic wrote:

> That's it. Thank you!
> I have already configured KeywordLinkingEngine when I used my own ontology.
> I think I'm familiar with that and I will try that option too.
> 
> In meanwhile I found another interesting problem. I tried to annotate
> document and web page. With web page, I tried 
> IOUtils.write(byte[], out) and I had to convert URL to byte[]:
> 
> public static byte[] GetBytesFromURL(String _url) throws IOException
> {
>       GetMethod get = new GetMethod(_url);            
>       InputStream t_is = get.getResponseBodyAsStream();               
>       byte[] buffer = new byte[1024];
>       int count = -1;                 
>       Reader t_url_reader = new BufferedReader(new
> InputStreamReader(t_is));             
>       byte[] t_bytes = IOUtils.toByteArray(t_url_reader, "UTF-8");
> 
>       return t_bytes;
> }
> 
> But, the problem is that I'm getting null for InputStream. 
> 
> Any ideas?
> 
> Best,
> Srecko
> 
> 
> 
> -----Original Message-----
> From: Rupert Westenthaler [mailto:[email protected]] 
> Sent: Wednesday, January 11, 2012 22:08
> To: Srecko Joksimovic
> Cc: [email protected]
> Subject: Re: Annotating using DBPedia ontology
> 
> 
> On 11.01.2012, at 21:41, Srecko Joksimovic wrote:
>> Hi Rupert,
>> 
>> When I load localhost:8080/engines it says this:
>> 
>> There are currently 5 active engines.
>> org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine
>> org.apache.stanbol.enhancer.engines.langid.LangIdEnhancementEngine
>> 
> org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhanc
>> ementEngine
>> 
> org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEng
>> ine
>> 
> org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEng
>> ine
>> 
>> Maybe this could tell you something?
>> 
> 
> This are exactly the 5 engines that are expected to run with the default
> configuration.
> Based on this the Stanbol Enhnacer should just work fine.
> 
> After looking at the the text you enhanced I noticed however that is does
> not mention
> any named entities such as Persons, Organizations and Places. So I checked
> it with
> my local Stanbol version and was also not any detected entities.
> 
> So to check if Stanbol works as expected you should try to use an other text
> the
> mentions some Named Entities such as 
> 
>    "John Smith works for the Apple Inc. in Cupertino, California."
> 
> 
> If you want to search also for entities like "Bank", "Blog", "Consumer",
> "Telephone" .
> you need to also configure a KeywordLinkingEngine for dbpedia. Part B or [3]
> provides
> more information on how to do that.
> 
> But let me mention that the KeywordLinkingEngine is more useful if used in
> combination
> with an own domain specific thesaurus rather than a global data set like
> dbpedia. When
> used with dbpedia you will also get a lot of false positives.
> 
> best
> Rupert
> 
> [3] http://incubator.apache.org/stanbol/docs/trunk/customvocabulary.html
>

Re: Annotating using DBPedia ontology

Reply via email to