On 01/09/13 00:02, Brad Moran wrote:
sorry the file type should be saved as .owl

I see no data. If you had an attachment, then they don't get through to the mailing list.

Would it be possible to create a complete, minimal example of your setup? A small amount of data that shows the situation. This description is quite long - is it all needed or can you see the same issues in a smaller configuration?

        Andy



On Sat, Aug 31, 2013 at 7:00 PM, Brad Moran <[email protected]
<mailto:[email protected]>> wrote:

    Hi,
    I am currently having a problem getting the exact results I want
    from my text queries. I attached one example of my rdf that I begin
    with. Then I run tdbloader and successfully create an index using
    this assembler file with jena.textindexer:

    @prefix :        <http://localhost/jena_example/#> .
    @prefix rdf:     <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
    @prefix rdfs:    <http://www.w3.org/2000/01/rdf-schema#> .
    @prefix tdb:     <http://jena.hpl.hp.com/2008/tdb#> .
    @prefix ja:      <http://jena.hpl.hp.com/2005/11/Assembler#> .
    @prefix text:    <http://jena.apache.org/text#> .
    @prefix mms:     <http://rdf.cdisc.org/mms#> .
    @prefix sdtms:   <http://rdf.cdisc.org/sdtm-1-2/schema#> .
    @prefix sdtmigs: <http://rdf.cdisc.org/sdtmig-3-1-2/schema#> .
    @prefix sends: <http://rdf.cdisc.org/send/schema#> .
    @prefix sendigs: <http://rdf.cdisc.org/send-3.0/schema#> .
    @prefix cts: <http://rdf.cdisc.org/ct/schema#> .

    ## Example of a TDB dataset and text index
    ## Initialize TDB
    [] ja:loadClass "com.hp.hpl.jena.tdb.TDB" .
    tdb:DatasetTDB  rdfs:subClassOf  ja:RDFDataset .
    tdb:GraphTDB    rdfs:subClassOf  ja:Model .

    ## Initialize text query
    [] ja:loadClass       "org.apache.jena.query.text.TextQuery" .
    # A TextDataset is a regular dataset with a text index.
    text:TextDataset      rdfs:subClassOf   ja:RDFDataset .
    # Lucene index
    text:TextIndexLucene  rdfs:subClassOf   text:TextIndex .

    ## ---------------------------------------------------------------
    ## This URI must be fixed - it's used to assemble the text dataset.

    :text_dataset rdf:type     text:TextDataset ;
         text:dataset   <#dataset> ;
         text:index     <#indexLucene> ;
         .

    # A TDB dataset used for RDF storage
    <#dataset> rdf:type      tdb:DatasetTDB ;
         tdb:location "tdb" ;
         # if from command line use: "NetBeansProjects/mdr-older/trunk/tdb"
         .

    # Text index description
    <#indexLucene> a text:TextIndexLucene ;
         text:directory <file:luceneIndexes> ;
         text:entityMap <#entMap> ;
         .

    # Mapping in the index
    # URI stored in field "uri"
    # rdfs:label is mapped to field "text"
    <#entMap> a text:EntityMap ;
         text:entityField      "uri" ;
         text:defaultField     "text" ;
         text:map (
              [ text:field "text" ; text:predicate mms:dataElementName ]
              [ text:field "text" ; text:predicate
    mms:dataElementDescription ]
      [ text:field "text" ; text:predicate mms:dataElementLabel ]
      [ text:field "text" ; text:predicate mms:dataElementType ]
      [ text:field "text" ; text:predicate mms:ordinal ]
      [ text:field "text" ; text:predicate mms:broader ]
              [ text:field "text" ; text:predicate mms:Dataset ]
              [ text:field "text" ; text:predicate mms:contextName ]
              [ text:field "text" ; text:predicate mms:contextLabel ]
              [ text:field "text" ; text:predicate mms:contextDescription ]
    [ text:field "text" ; text:predicate sdtms:dataElementType ]
    [ text:field "text" ; text:predicate sdtms:dataElementRole ]
    [ text:field "text" ; text:predicate sdtms:dataElementCompliance ]
              [ text:field "text" ; text:predicate sdtms:supportedBySDTMIG ]
              [ text:field "text" ; text:predicate sdtms:supportedBySEND ]
    [ text:field "text" ; text:predicate sdtmigs:references ]
              [ text:field "text" ; text:predicate sdtmigs:domainStructure ]
              [ text:field "text" ; text:predicate sdtmigs:domainCode ]
              [ text:field "text" ; text:predicate
    sdtmigs:controlledTermsOrFormat ]
              [ text:field "text" ; text:predicate
    sends:dataElementCompliance ]
              [ text:field "text" ; text:predicate sends:dataElementRole ]
              [ text:field "text" ; text:predicate sendigs:domainStructure ]
              [ text:field "text" ; text:predicate sendigs:domainCode ]
              [ text:field "text" ; text:predicate
    sendigs:controlledTermsOrFormat ]
              [ text:field "text" ; text:predicate cts:cdiscDefinition]
              [ text:field "text" ; text:predicate cts:nciPreferredTerm]
              [ text:field "text" ; text:predicate cts:nciCode]
              [ text:field "text" ; text:predicate cts:cdiscSynonyms]
              [ text:field "text" ; text:predicate cts:cdiscSubmissionValue]
              [ text:field "text" ; text:predicate cts:codelistName]
              [ text:field "text" ; text:predicate cts:isExtensibleCodelist]
              ) .


    I then try to run queries against this dataset, as an example say I
    want to search "AE" then I would expect every dataElement within the
    AE domain to be returned. However, I cannot get the desired result.
    If I search:

    PREFIX : <http://localhost/jena_example/#> PREFIX text:
    <http://jena.apache.org/text#> PREFIX mms:
    <http://rdf.cdisc.org/mms#> SELECT * {?s text:query
    (mms:dataElementName 'AE')}

    I get:

    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.DOMAIN>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Table.AE>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.FA.FACAT>

    when I would expect to get:

    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AERELNST>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AEENDY>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AEMODIFY>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AETOXGR>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AEREFID>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AESCAT>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AESEQ>
    <http://rdf.cdisc.org/sdtmig-3-1-2/std#Column.AE.AESMIE>
    (And the rest of the .AE dataElements just listed a few here)

    I also tried playing with this query a lot, but could not get the
    desired result for example I tried the other form of query as well:

    PREFIX : <http://localhost/jena_example/#> PREFIX text:
    <http://jena.apache.org/text#> PREFIX mms:
    <http://rdf.cdisc.org/mms#> SELECT * {?subject mms:contextName ?o .
    ?s text:query (mms:contextName 'SE')}


    I am not sure whether the problem is a result of my query being
    formed incorrectly, or whether the problem could be in my assembler
    file that creates the index (is there a better/more complete way to
    create an index for this rdf model?). Any suggestions would help,
    like I mentioned in the beginning one of the rdf files from tdb is
    attached. Thanks.



Reply via email to