Hi Hugh, Omid, all,
Hugh: You beat me to the punch there, I was typing my reply of how I
finally got the datasets loading (yes, i had to edit the isql port
numbers from 1118 in your .sh files to 1111) when I got your mail.
Thanks so much! I compiled wget from the http://www.gnu.org/software/wget/
and that should work the loading is still chugging along, and
should be finished in a hour or so.
I have a few more questions for Hugh, and everyone. I know I'm being a
pain, but please bear with me while I work this out.:
1)
load_dbpedia.sh had this line:
exec="ttlp_mt (file_to_string_output ('$f'), '', 'http://
dbpedia.org');" > temp.res
as well as this one:
exec="sparql select count(*) where { ?s <http://dbpedia.org/property/wordnet_type
> ?o};" > temp.res
Should I be changing the graph URI to something else? How do I do
that? I am still stumped by the massiveness of Virtuoso, and I'm not
sure how to access the dbpedia datasets once they are loaded. I know I
can call sparql queries via http://localhost:8890/sparql, but should I
be concerned with defining the name of the graph uri?
Will you also be able to give some really quick examples on how to
call sparql queries on the local dataset? I expect them to run the
same as dbpedia.org's public query page (http://dbpedia.org/sparql),
but just in case.
2)
If I am importing just shortabstracts_en, articles_label_en and
image_en, does the census processing portion of post_install.sh still
apply to my case?
3)
Eventually I intend to call the sparql queries via php5 or java. Is
there a quick way to do it from the Virtuoso Open Source server, or
should I look at alternatives like Perl? My main application is a
local Java applet that will send out the queries, and read back the
results for further processing.
Thank you.
Omid, to your response: There were a number of main issues I
encountered, and I still have problems and questions myself, but I can
provide the following points. I'm still waiting for the datasets to
finish loading, and in time, when I have everything running smoothly I
will post a more concise list here:
1) Read Virtuoso's README file, especially re: the required libraries
before you start configuring and building the binaries. I compiled it
on Leopard (10.5.2), and it works. Took a long time though, about 1.5
hours to get it all compiled and installed. Stop Leopard's Apache
server (if it's running) if you are concerned in it conflicting with
the build, but it should be ok to leave it on Virtuoso sets itself
up at localhost:8890 by default.
2) To set up the Virtuoso server, you can copy out the virtuoso.ini
file (located in /yourvirtuosoinstallprefix/var/lib/virtuoso/db) to a
location where you want your database stored, rename that to
dbpedia.ini and then type "sudo virtuoso-t -c dbpedia.ini -f
" (excluding quotes; I also omitted the & at the end as I wanted to
see in Terminal the status messages of the server while it starts up).
3) Unpack the .sh script files Hugh kindly provided. You will have to
go into the .sh files to modify the port number. For me, Virtuoso set
the default port of 1111 and so I had to update that in the .sh files
before I could execute them successfully.
4) And check the points Hugh made below.
Cheers,
Andrew
On Mar 4, 2008, at 6:51 PM, Hugh Williams wrote:
Hi Andrew,
You need to make the following changes:
As you probably found the sparql_demo_dav.vad and conductor_dav.vad
packages need to be copied to the /usr/local/virtuoso-opensource/
share/virtuoso/vad folder this being where the dbpedia.ini file
seraches for VAD packages by default.
You also need to edit of the dbpedia script files (pre_install.sh,
load_dbpedia.sh, post_install.sh), as I have found they reference a
default port of 1118, and set this to be the SQL port your Virtuoso
server is running to enable it to be contact on the correct port for
loading the dbpedia datasets.
Edit the load_dbpedia.sh script which references the server to start
as virtuoso-iodbc-t (virtuoso binary with iODBC support) and change
this to virtuoso-t, the default open source binary name.
The post_install.sh script uses wget which is no longer shipped on
Mac OS X 10.4+, with curl shipped in preference. I suspect curl
would work if the script is amedned, although I download a copy of
wget from the following URL and added to my /usr/bin folder:
http://krell.cellsandbytes.net/wp-content/uploads/wget_1.10.2.tar.gz
With these 4 changes I was then able to load one of the Dbpedia
datasets successful as seen below, so if you place all the Dbpedia
datasets into your data directory it should then load them
successfully also:
Universal Data Access & Data Integration Technology Providers
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion