Hi

On Wed, Jun 19, 2013 at 9:20 AM, Dileepa Jayakody
<dileepajayak...@gmail.com> wrote:
> Hi All,
>
> I'm trying out entityhub indexing tool to configure a site for a sample
> foaf dataset. My data set (sampleNquads.nx) is in n-quad format. Actually
>  it is a set of links to foaf files from various sources in nquad format.
>
> eg:
> <http://www.agfa.com/> <http://www.agfa.com/global/en/main/index.jsp> .
> *<http://sebastian.tramp.name/> <http://sebastian.tramp.name/index.rdf> .*
> <http://gitorious.com/~tobyink> <http://gitorious.org/~tobyink> .
>

I am not completely sure what you are mean by that.

Generally: Links to RDF files are not supported by the Indexing Tool.
You will need to download the RDF files to the
"indexing/resources/rdfdata" directory.

Quad Formats are in principle supported by the Indexing Tool. However
node that only SPO are used and the Context is dropped during the
import.


For debugging the indexing process:

  * the Indexing Tool logs the number of indexed Entities. You should
check this value
  * the IDs off all indexed entities are also stored in
"indexing/destination/indexed-entities-ids.zip". After installing the
index to Stanbol you can use those IDs to retrieve the available data
by using requests like "curl -H "Accept: text/turtle"
"http://localhost:8080/entityhub/site/{site-name}/entity?id={entity-id}";

> I followed the instructions here [1] and in the ReadMe.md,
> indexing.properties files of the tool and created a site {datahub} for my
> data accessible at : http://localhost:8080/entityhub/site/datahub/
>
> However when I try out sample requests to find entities in the site I get
> no results.
> I'm trying to find the entity with *name=Sebastian** which is actually in
> the sample dataset used above but I get an empty results set. Can anyone
> please help me understand what I've done wrong here? Basically I have
> followed the steps in init, index executions of the tool.
>
> Is it because my dataset is only a set of external links to foaf files?
> Do I need to manually download the foaf files to indexing/resources/rdfdata
> directory?
>
> eg :
>
> request: curl -X POST -d "name=Sebastian*"
> http://localhost:8080/entityhub/site/datahub/find
>
> result :
> {
>     "query": {
>         "selected": [
>             "http:\/\/stanbol.apache.org\/ontology\/entityhub\/query#score",
>             "http:\/\/www.w3.org\/2000\/01\/rdf-schema#label"
>         ],
>         "constraints": [{
>             "type": "text",
>             "patternType": "wildcard",
>             "text": "SSebastian Tramp",
>             "field": "http:\/\/www.w3.org\/2000\/01\/rdf-schema#label"
>         }],
>         "limit": 5,
>         "offset": 0
>     },
>     "results": []
> }
>

For queries like that you need to make sure that your entities do have
values for "rdf:label". AFAIK the default
"indexing/config/mapping.txt" configuration does copy the foaf:name
value to rdfs:label, but if you do specifically work with FOAF data
you should preferable query for "foaf:name".

best
Rupert

>
> Your help is much appreciated here.
> Thanks,
> Dileepa
>
>
> [1] http://stanbol.apache.org/docs/trunk/customvocabulary.html



--
| Rupert Westenthaler             rupert.westentha...@gmail.com
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Reply via email to