Right, I'm suggesting that perhaps it should at least try to parse the file when it ends in .xml, as well. I'm not addressing the question of file size.
--- A. Soroka The University of Virginia Library > On Mar 24, 2016, at 10:16 AM, Mikael Pesonen <[email protected]> > wrote: > > > Hi, > > s-put succeeds with smaller file when .xml renamed to .rdf. Tested with a > subset of ~1 million triplets. Entire file is ~100 million triplets. > > Br, > Mikael > > > On 24.3.2016 15:41, A. Soroka wrote: >> I seem to remember that the list has received a question like this before. >> Perhaps s-put should try to parse *.xml files as RDF/XML, and only fail if >> that can't be done? >> >> --- >> A. Soroka >> The University of Virginia Library >> >>> On Mar 24, 2016, at 7:41 AM, Mikael Pesonen <[email protected]> >>> wrote: >>> >>> >>> Hi Osma! >>> >>> Well that was an easy solution that worked. Thanks! >>> >>> Mikael >>> >>> >>> On 24.3.2016 13:25, Osma Suominen wrote: >>>> Hi Mikael! >>>> >>>> Try renaming the file to .rdf instead of .xml. It's likely that s-put >>>> doesn't recognize the file extension .xml - after all, it could be any >>>> kind of XML, not just RDF/XML. >>>> >>>> -Osma >>>> >>>> On 24/03/16 11:31, Mikael Pesonen wrote: >>>>> Hi, >>>>> >>>>> sorry for missing info. So I'm trying to: >>>>> >>>>> /apache-jena-fuseki-2.3.1$ bin/s-put http://localhost:3030/ds/data >>>>> http://www.lingsoft.fi/geonames/ ./tmp.xml >>>>> >>>>> >>>>> tmp.xml is a geonames entry: >>>>> >>>>> <?xml version="1.0" encoding="UTF-8" standalone="no"?> >>>>> <rdf:RDF xmlns:cc="http://creativecommons.org/ns#" >>>>> xmlns:dcterms="http://purl.org/dc/terms/" >>>>> xmlns:foaf="http://xmlns.com/foaf/0.1/" >>>>> xmlns:gn="http://www.geonames.org/ontology#" >>>>> xmlns:owl="http://www.w3.org/2002/07/owl#" >>>>> xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" >>>>> xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" >>>>> xmlns:wgs84_pos="http://www.w3.org/2003/01/geo/wgs84_pos#"> >>>>> <gn:Feature rdf:about="http://sws.geonames.org/3/"><rdfs:isDefinedBy >>>>> rdf:resource="http://sws.geonames.org/3/about.rdf"/> >>>>> <gn:name>Zamīn Sūkhteh</gn:name><gn:alternateName xml:lang="fa">زمين >>>>> سوخته</gn:alternateName> >>>>> <gn:alternateName xml:lang="fa">Zamīn >>>>> Sūkhteh</gn:alternateName><gn:featureClass >>>>> rdf:resource="http://www.geonames.org/ontology#S"/> >>>>> <gn:featureCode rdf:resource="http://www.geonames.org/ontology#S.CRRL"/> >>>>> <gn:countryCode>IR</gn:countryCode> >>>>> <wgs84_pos:lat>32.45831</wgs84_pos:lat> >>>>> <wgs84_pos:long>48.96335</wgs84_pos:long> >>>>> <gn:parentFeature rdf:resource="http://sws.geonames.org/127082/"/> >>>>> <gn:parentCountry rdf:resource="http://sws.geonames.org/130758/"/> >>>>> <gn:parentADM1 rdf:resource="http://sws.geonames.org/127082/"/> >>>>> <gn:nearbyFeatures rdf:resource="http://sws.geonames.org/3/nearby.rdf"/> >>>>> <gn:locationMap >>>>> rdf:resource="http://www.geonames.org/3/zamin-sukhteh.html"/> >>>>> </gn:Feature> >>>>> </rdf:RDF> >>>>> >>>>> >>>>> And error comes from Ruby: >>>>> >>>>> /usr/lib/ruby/1.9.1/net/http.rb:1436:in `block in >>>>> initialize_http_header': undefined method `strip' for nil:NilClass >>>>> (NoMethodError) >>>>> from /usr/lib/ruby/1.9.1/net/http.rb:1434:in `each' >>>>> from /usr/lib/ruby/1.9.1/net/http.rb:1434:in >>>>> `initialize_http_header' >>>>> from bin/s-put:205:in `send_body' >>>>> from bin/s-put:164:in `PUT' >>>>> from bin/s-put:424:in `cmd_soh' >>>>> from bin/s-put:703:in `<main>' >>>>> >>>>> >>>>> XML looks like valid, but is s-put missing some info on what that XML is? >>>>> >>>>> Br, >>>>> Mikael >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> On 18.3.2016 13:32, Andy Seaborne wrote: >>>>>>> No idea? I need to update data to either running database or make >>>>>>> a new db. >>>>>> (to a message 3 days ago ...) >>>>>> >>>>>> "this does not work" is a bit minimal. >>>>>> >>>>>> What does work? Other s-* commands? Other files? >>>>>> >>>>>> I'd guess that ".xml" is not recognized as RDF. It's not the right >>>>>> file extension. The MIME type must be for the request. There's some >>>>>> kind of determination in the soh script. >>>>>> >>>>>>> When trying to start another server to port 3031 server complains >>>>>>> >>>>>>> org.apache.jena.tdb.TDBException: Can't open database at location >>>>>>> /home/text/tools/apache-jena-fuseki-2.3.1/run/system/ >>>>>> You can't have two servers running on the same TDB files at the same >>>>>> time. (For that matter, you can't do that with MySQL either - you need >>>>>> a server process to mediate requests). >>>>>> >>>>>> Andy >>>>>> >>>>>>> Mikael >>>>>>> >>>>>>> On 15.3.2016 13:40, Mikael Pesonen wrote: >>>>>>>> Hi, >>>>>>>> >>>>>>>> okay thats good to know. I tried with s-put >>>>>>>> >>>>>>>> apache-jena-fuseki-2.3.1$ bin/s-put http://localhost:3030/ds/data >>>>>>>> http://www.lingsoft.fi/geonames/ tmp.xml >>>>>>>> >>>>>>>> /usr/lib/ruby/1.9.1/net/http.rb:1436:in `block in >>>>>>>> initialize_http_header': undefined method `strip' for nil:NilClass >>>>>>>> (NoMethodError) >>>>>>>> from /usr/lib/ruby/1.9.1/net/http.rb:1434:in `each' >>>>>>>> from /usr/lib/ruby/1.9.1/net/http.rb:1434:in >>>>>>>> `initialize_http_header' >>>>>>>> from bin/s-put:205:in `send_body' >>>>>>>> from bin/s-put:164:in `PUT' >>>>>>>> from bin/s-put:424:in `cmd_soh' >>>>>>>> from bin/s-put:703:in `<main>' >>>>>>>> >>>>>>>> tmp.xml contains one entry from Geonames dump: >>>>>>>> >>>>>>>> <?xml version="1.0" encoding="UTF-8" standalone="no"?><rdf:RDF >>>>>>>> xmlns:cc="http://creativecommons.org/ns#" >>>>>>>> xmlns:dcterms="http://purl.org/dc/terms/" >>>>>>>> xmlns:foaf="http://xmlns.com/foaf/0.1/" >>>>>>>> xmlns:gn="http://www.geonames.org/ontology#" >>>>>>>> xmlns:owl="http://www.w3.org/2002/07/owl#" >>>>>>>> xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" >>>>>>>> xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" >>>>>>>> xmlns:wgs84_pos="http://www.w3.org/2003/01/geo/wgs84_pos#"><gn:Feature >>>>>>>> rdf:about="http://sws.geonames.org/3/"><rdfs:isDefinedBy >>>>>>>> rdf:resource="http://sws.geonames.org/3/about.rdf"/><gn:name>Zamīn >>>>>>>> Sūkhteh</gn:name><gn:alternateName xml:lang="fa">زمين >>>>>>>> سوخته</gn:alternateName><gn:alternateName xml:lang="fa">Zamīn >>>>>>>> Sūkhteh</gn:alternateName><gn:featureClass >>>>>>>> rdf:resource="http://www.geonames.org/ontology#S"/><gn:featureCode >>>>>>>> rdf:resource="http://www.geonames.org/ontology#S.CRRL"/><gn:countryCode>IR</gn:countryCode><wgs84_pos:lat>32.45831</wgs84_pos:lat><wgs84_pos:long>48.96335</wgs84_pos:long><gn:parentFeature >>>>>>>> >>>>>>>> rdf:resource="http://sws.geonames.org/127082/"/><gn:parentCountry >>>>>>>> rdf:resource="http://sws.geonames.org/130758/"/><gn:parentADM1 >>>>>>>> rdf:resource="http://sws.geonames.org/127082/"/><gn:nearbyFeatures >>>>>>>> rdf:resource="http://sws.geonames.org/3/nearby.rdf"/><gn:locationMap >>>>>>>> rdf:resource="http://www.geonames.org/3/zamin-sukhteh.html"/></gn:Feature></rdf:RDF> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> Br, >>>>>>>> Mikael >>>>>>>> >>>>>>>> >>>>>>>> On 15.3.2016 13:21, Andy Seaborne wrote: >>>>>>>>> On 15/03/16 10:40, Mikael Pesonen wrote: >>>>>>>>>> Hi, >>>>>>>>>> >>>>>>>>>> is it possible to add content to graph from RDF XML with command line >>>>>>>>>> tools? s-put requires SPARQL and tdbloader says >>>>>>>>>> >>>>>>>>>> org.apache.jena.tdb.TDBException: Can't open database at location >>>>>>>>>> /home/text/tools/apache-jena-3.0.1/DB/ as it is already locked by the >>>>>>>>>> process with PID 7672. TDB databases do not permit concurrent usage >>>>>>>>>> across JVMs so in order to prevent possible data corruption you >>>>>>>>>> cannot >>>>>>>>>> open this location from the JVM that does not own the lock for the >>>>>>>>>> dataset >>>>>>>>>> >>>>>>>>>> Br, >>>>>>>>>> Mikael >>>>>>>>>> >>>>>>>>> >>>>>>>>> Yes - use s-put or s-post. >>>>>>>>> >>>>>>>>> These are the SPARQL Graph Store Protocol - no query language, no >>>>>>>>> update language. >>>>>>>>> >>>>>>>>> All they do is HTTP PUT or POST to the right graph name and the right >>>>>>>>> content type. >>>>>>>>> >>>>>>>>> You can PUT and POST to the dataset itself as well using curl or wget >>>>>>>>> or any HTTP tool. You need to set the Content-type header. >>>>>>>>> >>>>>>>>> Andy >>>>>>>>> >>>> >>> -- >>> www.lingsoft.fi >>> >>> Speech Applications - Language Management - Translation - Reader's and >>> Writer's Tools - Text Tools - E-books and M-books >>> >>> Mikael Pesonen >>> System Engineer >>> >>> e-mail: [email protected] >>> Tel. +358 2 279 3300 >>> >>> Time zone: GMT+2 >>> >>> Helsinki Office >>> Eteläranta 10 >>> FI-00130 Helsinki >>> FINLAND >>> >>> Turku Office >>> Linnankatu 10 A >>> FI-20100 Turku >>> FINLAND >>> > > -- > www.lingsoft.fi > > Speech Applications - Language Management - Translation - Reader's and > Writer's Tools - Text Tools - E-books and M-books > > Mikael Pesonen > System Engineer > > e-mail: [email protected] > Tel. +358 2 279 3300 > > Time zone: GMT+2 > > Helsinki Office > Eteläranta 10 > FI-00130 Helsinki > FINLAND > > Turku Office > Linnankatu 10 A > FI-20100 Turku > FINLAND >
