Hi Rupert, I configured Stanbol, and I thought everything is alright because I could access Stanbol at http://localhost:8080. But, I noticed that during the startup I'm getting this error:
[WARNING] failed org.mortbay.jetty.nio.SelectChannelConnector$1@29978933: java.i o.IOException: Unable to establish loopback connection [WARNING] failed [email protected]:8080: java.io.IOException: Unabl e to establish loopback connection [WARNING] failed Server@62d844a9: java.io.IOException: Unable to establish loopb ack connection [ERROR] Exception while initializing Jetty. java.io.IOException: Unable to establish loopback connection at sun.nio.ch.PipeImpl$Initializer.run(Unknown Source) at java.security.AccessController.doPrivileged(Native Method) at sun.nio.ch.PipeImpl.<init>(Unknown Source) at sun.nio.ch.SelectorProviderImpl.openPipe(Unknown Source) at java.nio.channels.Pipe.open(Unknown Source) at sun.nio.ch.WindowsSelectorImpl.<init>(Unknown Source) at sun.nio.ch.WindowsSelectorProvider.openSelector(Unknown Source) at java.nio.channels.Selector.open(Unknown Source) at org.mortbay.io.nio.SelectorManager$SelectSet.<init>(SelectorManager.j ava:312) at org.mortbay.io.nio.SelectorManager.doStart(SelectorManager.java:223) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java: 50) at org.mortbay.jetty.nio.SelectChannelConnector.doStart(SelectChannelCon nector.java:314) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java: 50) at org.mortbay.jetty.Server.doStart(Server.java:235) at org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java: 50) at org.apache.felix.http.jetty.internal.JettyService.initializeJetty(Jet tyService.java:164) at org.apache.felix.http.jetty.internal.JettyService.startJetty(JettySer vice.java:115) at org.apache.felix.http.jetty.internal.JettyService.run(JettyService.ja va:290) at java.lang.Thread.run(Unknown Source) Caused by: java.nio.channels.ClosedByInterruptException at java.nio.channels.spi.AbstractInterruptibleChannel.end(Unknown Source ) at sun.nio.ch.SocketChannelImpl.connect(Unknown Source) at java.nio.channels.SocketChannel.open(Unknown Source) ... 19 more There is another thing. When I try to annotate text from application, or using web interface, I'm getting something like this: <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:j.0="http://www.semanticdesktop.org/ontologies/2007/01/19/nie#" xmlns:j.1="http://purl.org/dc/terms/" xmlns:j.2="http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#" xmlns:j.3="http://fise.iks-project.eu/ontology/" > <rdf:Description rdf:about="urn:enhancement-39c09311-3095-fbb1-0dfe-551f6fba2baa"> <rdf:type rdf:resource="http://fise.iks-project.eu/ontology/Enhancement"/> <j.3:extracted-from rdf:resource="urn:content-item-sha1-322650339df64c4e5acd17a81af29bd8fed3ba91 "/> <j.1:created rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-01-11T18:44:11 .271Z</j.1:created> <j.1:creator rdf:datatype="http://www.w3.org/2001/XMLSchema#string">org.apache.stanbol.en hancer.engines.metaxa.MetaxaEngine</j.1:creator> <rdf:type rdf:resource="http://fise.iks-project.eu/ontology/TextAnnotation"/> <j.3:confidence rdf:datatype="http://www.w3.org/2001/XMLSchema#double">1.0</j.3:confidence> </rdf:Description> <rdf:Description rdf:about="urn:enhancement-9e659b3e-8978-7191-eb8b-fa7030c2ff68"> <j.1:language>en</j.1:language> <rdf:type rdf:resource="http://fise.iks-project.eu/ontology/Enhancement"/> <j.3:extracted-from rdf:resource="urn:content-item-sha1-322650339df64c4e5acd17a81af29bd8fed3ba91 "/> <j.1:created rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-01-11T18:44:11 .278Z</j.1:created> <j.1:creator rdf:datatype="http://www.w3.org/2001/XMLSchema#string">org.apache.stanbol.en hancer.engines.langid.LangIdEnhancementEngine</j.1:creator> <rdf:type rdf:resource="http://fise.iks-project.eu/ontology/TextAnnotation"/> </rdf:Description> <rdf:Description rdf:about="urn:content-item-sha1-322650339df64c4e5acd17a81af29bd8fed3ba91"> <rdf:type rdf:resource="http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#Plain TextDocument"/> <j.0:plainTextContent>The Web's children became parents. They use tools which can limit the access and the spreading of the information by their children. So, the parents can see at any time the web's logs of their children but they also have a net which is going to filter their "private" identity before it is broadcasted on the network. For example, a third-part trust entity, along with their mobile telephone provider, the post office and the bank, will possess the consumer's identity so as to mask the address of delivery and the payment of this consumer. A public identity also exists to spread a resume (CV), a blog or an avatar for example but the data remain the property of the owner of the server who hosts this data. So, the mobile telephone provider offers a personal server who will contain one public zone who will automatically be copied on the network after every modification. If I want that my resume is not any longer on the network, I just have to erase it of my public zone from my server. So, the mobile telephone provider creates a controllable silo of information for every public profile.</j.0:plainTextContent> </rdf:Description> </rdf:RDF> I am not sure that this is the content I should get. Please, help :) Best, Srecko -----Original Message----- From: Rupert Westenthaler [mailto:[email protected]] Sent: Tuesday, January 10, 2012 15:33 To: srecko joksimovic Cc: [email protected] Subject: Re: Annotating using DBPedia ontology Hi Srecko > > curl -v -X POST -H "Content-Type: application/rdf+xml" --data "@acm-ccs_proton.owl" http://localhost:8080/entityhub/entity > No I would not propose you to upload the dbpedia dataset by using POST to the entityhub. This is fine for small and medium sized datasets, but will not work for dbpedia. Stanbol comes already with a small sample set of DBPedia. This is also used for enhancing documents with the default configuration. This sample dataset contains the 43k DBPedia.org entities with the most incoming links including some often used properties includinglabels in about 10 languages, the english comments, types, redirects stored as rdf:seeAlso, lat/long, populations, birth/death dates, home pages, and category assignments stored in dc-terms:subject. You can easily upgrade this index to a bigger version by downloading the dbpedia.solrindex.zip file form [1] and copying it into the /sling/datafiles folder within the directory where your Stanbol server is running. After some minutes (the time your computer needs to extract a file with ~3GByte) the bigger index will replace the sample set included in the launcher. If you need some additional fields, languages . you can also create your own index by using the indexing tool for dbpedia [2]. See the README.md file for instructions. best Rupert [1] http://dev.iks-project.eu/downloads/stanbol-indices/dbpedia-3.7/ [2] https://svn.apache.org/repos/asf/incubator/stanbol/trunk/entityhub/indexing/ dbpedia/ On 10.01.2012, at 14:01, srecko joksimovic wrote: > Hi, > > Until now I used my ontology when I wanted to annotate document (or text). Now I would like to use DBPedia ontology. Do I have to download ontology and configure Stanbol like I did before, using > > curl -v -X POST -H "Content-Type: application/rdf+xml" --data "@acm-ccs_proton.owl" http://localhost:8080/entityhub/entity > > or there is another procedure? Does Stanbol use DBPedia ontology by default, or I have to configure something similar like when I use another ontology? >
