[
https://issues.apache.org/jira/browse/JENA-1930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Seaborne closed JENA-1930.
-------------------------------
Resolution: Feedback Received
Thanks for the feedback!
> How to load data and then start a fuseki server with that dataset and use it?
> -----------------------------------------------------------------------------
>
> Key: JENA-1930
> URL: https://issues.apache.org/jira/browse/JENA-1930
> Project: Apache Jena
> Issue Type: Question
> Affects Versions: Jena 3.14.0, Jena 3.15.0
> Reporter: Wolfgang Fahl
> Priority: Minor
> Attachments: fusekiscreenshot2020-06-26.png
>
>
> With my (unfortunately not successful) tdb2.tdbloader experience for wikidata
> I modified my script to load a smaller dataset like shown below (gnd2jena).
> The output log seems to be ok.
> {code:java}
> 15:01:25 INFO loader :: Add: 9,500,000 authorities-kongress_lds.ttl (Batch:
> 28,457 / Avg: 33,975)
> 15:01:29 INFO loader :: Finished: authorities-kongress_lds.ttl: 9,556,155
> tuples in 282.97s (Avg: 33,771)
> 15:02:06 INFO loader :: Finish - index SPO
> 15:02:12 INFO loader :: Finish - index POS
> 15:02:12 INFO loader :: Finish - index OSP
> 15:02:12 INFO loader :: Time = 326.504 seconds : Triples = 9,556,155 : Rate =
> 29,268 /s{code}
> Now I am trying to run a fuseki server using the data like this:
> {code:java}
> #!/bin/bash
> # WF 2020-06-25
> # Jena Fuseki server installation
> # see https://jena.apache.org/documentation/fuseki2/fuseki-run.html
> version=3.14.0
> fuseki=apache-jena-fuseki-$version
> if [ ! -d $fuseki ]
> then
> if [ ! -f $fuseki.tar.gz ]
> then
> wget http://archive.apache.org/dist/jena/binaries/$fuseki.tar.gz
> else
> echo $fuseki.tar.gz already downloaded
> fi
> echo "unpacking $fuseki.tar.gz"
> tar xvfz $fuseki.tar.gz
> else
> echo $fuseki already downloaded and unpacked
> fi
> cd $fuseki
> java -jar fuseki-server.jar --tdb2 --loc=../data /gnd{code}
> Please note that I used version 3.14.0 here given that there where reports
> about 3.15.0 needing some kind of patching that i hoped to avoid by using the
> previous version assuming that the tdbstore would still be compatible or a
> message would show if not.
> At the servers port 3030 a user interface shows up showing that the server
> status is ok.
> !fusekiscreenshot2020-06-26.png!
> When clicking "Manage datasets" i get two options:
> * existing datasets
> * add new dataset
> None of the two buttons shows and effect. Clicking does not show any visible
> reaction. I would have expected I would be able to work with the imported
> dataset immediately (not even knowing whether you'd call the import a dataset
> ...)
> I already found some hint that I'd have to change the config.ttl manually to
> get the desired effect. I find it quite confusing that the web UI does not
> give any hints on this. I placed this as a question - not knowing whether
> this would end up as a feature request or bug ...
> *gnd2jena*
>
> {code:java}
> #!/bin/bash
> # WF 2020-05-10
> # global settings
> jena=apache-jena-3.15.0
> tgz=$jena.tar.gz
> jenaurl=http://mirror.easyname.ch/apache/jena/binaries/$tgz
> base=/hd/luxio/gnd
> data=$base/data
> tdbloader=$jena/bin/tdb2.tdbloader
> getjena() {
> # download
> if [ ! -f $tgz ]
> then
> echo "downloading $tgz from $jenaurl"
> wget $jenaurl
> else
> echo "$tgz already downloaded"
> fi
> # unpack
> if [ ! -d $jena ]
> then
> echo "unpacking $jena from $tgz"
> tar xvzf $tgz
> else
> echo "$jena already unpacked"
> fi
> # create data directory
> if [ ! -d $data ]
> then
> echo "creating $data directory"
> mkdir -p $data
> else
> echo "$data directory already created"
> fi
> }
> #
> # show the given timestamp
> #
> timestamp() {
> local msg="$1"
> local ts=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
> echo "$msg at $ts"
> }
> #
> # load data for the given data dir and input
> #
> loaddata() {
> local data="$1"
> local input="$2"
> timestamp "start loading $input to $data"
> $tdbloader --loader=parallel --loc "$data" "$input" > tdb2-$phase-out.log 2>
> tdb2-$phase-err.log
> timestamp "finished loading $input to $data"
> }
> getjena
> export TMPDIR=$base/tmp
> if [ ! -d $TMPDIR ]
> then
> echo "creating temporary directory $TMPDIR"
> mkdir $TMPDIR
> else
> echo "using temporary directory $TMPDIR"
> fi
> if [ ! -f authorities-kongress_lds.ttl ]
> then
> wget https://data.dnb.de/opendata/authorities-kongress_lds.ttl.gz
> gunzip authorities-kongress_lds.ttl.gz
> fi
> loaddata $data authorities-kongress_lds.ttl
> {code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)