Hi Rupert, Thank you for the answer. I've probably missed that.
Best, Srecko On Thu, Jan 12, 2012 at 6:12 AM, Rupert Westenthaler < [email protected]> wrote: > Hi Srecko > > I think the last time I directly used this API is about 3-4 years ago, but > after a look at the http client tutorial [1] I think the reason for your > problem is that you do not execute the GetMethod. > > Based on this tutorial the code should look like > > // Create an instance of HttpClient. > HttpClient client = new HttpClient(); > GetMethod get = new GetMethod(url); > try { > // Execute the method. > int statusCode = client.executeMethod(get); > if (statusCode != HttpStatus.SC_OK) { > //handle the error > } > InputStream t_is = get.getResponseBodyAsStream(); > //read the data of the stream > } > > In addition you should not use a Reader if you want to read byte oriented > data from the input stream. > > hope this helps > best > Rupert > > [1] http://hc.apache.org/httpclient-3.x/tutorial.html > > On 11.01.2012, at 22:34, Srecko Joksimovic wrote: > > > That's it. Thank you! > > I have already configured KeywordLinkingEngine when I used my own > ontology. > > I think I'm familiar with that and I will try that option too. > > > > In meanwhile I found another interesting problem. I tried to annotate > > document and web page. With web page, I tried > > IOUtils.write(byte[], out) and I had to convert URL to byte[]: > > > > public static byte[] GetBytesFromURL(String _url) throws IOException > > { > > GetMethod get = new GetMethod(_url); > > InputStream t_is = get.getResponseBodyAsStream(); > > byte[] buffer = new byte[1024]; > > int count = -1; > > Reader t_url_reader = new BufferedReader(new > > InputStreamReader(t_is)); > > byte[] t_bytes = IOUtils.toByteArray(t_url_reader, "UTF-8"); > > > > return t_bytes; > > } > > > > But, the problem is that I'm getting null for InputStream. > > > > Any ideas? > > > > Best, > > Srecko > > > > > > > > -----Original Message----- > > From: Rupert Westenthaler [mailto:[email protected]] > > Sent: Wednesday, January 11, 2012 22:08 > > To: Srecko Joksimovic > > Cc: [email protected] > > Subject: Re: Annotating using DBPedia ontology > > > > > > On 11.01.2012, at 21:41, Srecko Joksimovic wrote: > >> Hi Rupert, > >> > >> When I load localhost:8080/engines it says this: > >> > >> There are currently 5 active engines. > >> org.apache.stanbol.enhancer.engines.metaxa.MetaxaEngine > >> org.apache.stanbol.enhancer.engines.langid.LangIdEnhancementEngine > >> > > > org.apache.stanbol.enhancer.engines.opennlp.impl.NamedEntityExtractionEnhanc > >> ementEngine > >> > > > org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEng > >> ine > >> > > > org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEng > >> ine > >> > >> Maybe this could tell you something? > >> > > > > This are exactly the 5 engines that are expected to run with the default > > configuration. > > Based on this the Stanbol Enhnacer should just work fine. > > > > After looking at the the text you enhanced I noticed however that is does > > not mention > > any named entities such as Persons, Organizations and Places. So I > checked > > it with > > my local Stanbol version and was also not any detected entities. > > > > So to check if Stanbol works as expected you should try to use an other > text > > the > > mentions some Named Entities such as > > > > "John Smith works for the Apple Inc. in Cupertino, California." > > > > > > If you want to search also for entities like "Bank", "Blog", "Consumer", > > "Telephone" . > > you need to also configure a KeywordLinkingEngine for dbpedia. Part B or > [3] > > provides > > more information on how to do that. > > > > But let me mention that the KeywordLinkingEngine is more useful if used > in > > combination > > with an own domain specific thesaurus rather than a global data set like > > dbpedia. When > > used with dbpedia you will also get a lot of false positives. > > > > best > > Rupert > > > > [3] http://incubator.apache.org/stanbol/docs/trunk/customvocabulary.html > > > >
