Hi Hugh, Omid, all,

Hugh: You beat me to the punch there, I was typing my reply of how I finally got the datasets loading (yes, i had to edit the isql port numbers from 1118 in your .sh files to 1111) when I got your mail. Thanks so much! I compiled wget from the http://www.gnu.org/software/wget/ and that should work – the loading is still chugging along, and should be finished in a hour or so.

I have a few more questions for Hugh, and everyone. I know I'm being a pain, but please bear with me while I work this out.:

1)
load_dbpedia.sh had this line:

exec="ttlp_mt (file_to_string_output ('$f'), '', 'http:// dbpedia.org');" > temp.res

as well as this one:

exec="sparql select count(*) where { ?s <http://dbpedia.org/property/wordnet_type > ?o};" > temp.res

Should I be changing the graph URI to something else? How do I do that? I am still stumped by the massiveness of Virtuoso, and I'm not sure how to access the dbpedia datasets once they are loaded. I know I can call sparql queries via http://localhost:8890/sparql, but should I be concerned with defining the name of the graph uri?

Will you also be able to give some really quick examples on how to call sparql queries on the local dataset? I expect them to run the same as dbpedia.org's public query page (http://dbpedia.org/sparql), but just in case.


2)
If I am importing just shortabstracts_en, articles_label_en and image_en, does the census processing portion of post_install.sh still apply to my case?


3)
Eventually I intend to call the sparql queries via php5 or java. Is there a quick way to do it from the Virtuoso Open Source server, or should I look at alternatives like Perl? My main application is a local Java applet that will send out the queries, and read back the results for further processing.



Thank you.



Omid, to your response: There were a number of main issues I encountered, and I still have problems and questions myself, but I can provide the following points. I'm still waiting for the datasets to finish loading, and in time, when I have everything running smoothly I will post a more concise list here:

1) Read Virtuoso's README file, especially re: the required libraries before you start configuring and building the binaries. I compiled it on Leopard (10.5.2), and it works. Took a long time though, about 1.5 hours to get it all compiled and installed. Stop Leopard's Apache server (if it's running) if you are concerned in it conflicting with the build, but it should be ok to leave it on – Virtuoso sets itself up at localhost:8890 by default.

2) To set up the Virtuoso server, you can copy out the virtuoso.ini file (located in /yourvirtuosoinstallprefix/var/lib/virtuoso/db) to a location where you want your database stored, rename that to dbpedia.ini and then type "sudo virtuoso-t -c dbpedia.ini -f " (excluding quotes; I also omitted the & at the end as I wanted to see in Terminal the status messages of the server while it starts up).

3) Unpack the .sh script files Hugh kindly provided. You will have to go into the .sh files to modify the port number. For me, Virtuoso set the default port of 1111 and so I had to update that in the .sh files before I could execute them successfully.

4) And check the points Hugh made below.


Cheers,
Andrew


On Mar 4, 2008, at 6:51 PM, Hugh Williams wrote:
Hi Andrew,

You  need to make the following changes:

As you probably found the sparql_demo_dav.vad and conductor_dav.vad packages need to be copied to the /usr/local/virtuoso-opensource/ share/virtuoso/vad folder this being where the dbpedia.ini file seraches for VAD packages by default. You also need to edit of the dbpedia script files (pre_install.sh, load_dbpedia.sh, post_install.sh), as I have found they reference a default port of 1118, and set this to be the SQL port your Virtuoso server is running to enable it to be contact on the correct port for loading the dbpedia datasets. Edit the load_dbpedia.sh script which references the server to start as virtuoso-iodbc-t (virtuoso binary with iODBC support) and change this to virtuoso-t, the default open source binary name. The post_install.sh script uses “wget” which is no longer shipped on Mac OS X 10.4+, with “curl” shipped in preference. I suspect “curl “ would work if the script is amedned, although I download a copy of “wget” from the following URL and added to my /usr/bin folder:

            http://krell.cellsandbytes.net/wp-content/uploads/wget_1.10.2.tar.gz

With these 4 changes I was then able to load one of the Dbpedia datasets successful as seen below, so if you place all the Dbpedia datasets into your ‘data’ directory it should then load them successfully also:

Universal Data Access & Data Integration Technology Providers

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to