Re: DB2, RDF, and Jena?

Robert Vesse Tue, 03 Apr 2012 15:12:10 -0700

I think the Cray uRiKA comment seems to be a random aside

We don't run TDB anyway, we have a completely custom database and query engine. 
 Fuseki is used purely as a way to wrap the database as a SPARQL endpoint.


I would want to see IBM's numbers for LUBM and SP2B before trying to make any 
real comparisons

Rob Vesse -- YarcData.com<http://YarcData.com> -- A Division of Cray Inc
Software Engineer, Bay Area
m: 925.960.3941  |  o: 925.264.4729 | @: 
rve...@yarcdata.com<mailto:rve...@yarcdata.com>  |  Skype: rvesse
6210 Stoneridge Mall Rd  |  Suite 120  | Pleasanton CA, 94588

[cid:BC8A5E0A-0B0A-43B5-87B5-93039573F2CD@americas.cray.com]

On Apr 3, 2012, at 2:59 PM, Marco Neumann wrote:

So Timothy Prickett Morgan claims that DB2 outperforms TDB in a Cray
Urika setup by factor 3.5 on some unspecified SPARQL 1.0 query?


On Tue, Apr 3, 2012 at 5:48 PM, Andy Seaborne 
<a...@apache.org<mailto:a...@apache.org>> wrote:
http://www.theregister.co.uk/2012/04/03/ibm_db2_10_infosphere_10/

------------------------------------------
The Apache Jena project is a Java framework for building semantic web
applications based on graph data, and Apache Fuseki is the SPARQL server
that processes the SPARQL queries and spits out the relationships so they
can be visualized in some fashion. (Cray's new Urika system, announced in
March, runs this Apache graph analysis stack on top of a massively
multithreaded server.)

Just like they imported objects and XML into the DB2 database so they could
be indexed and processed natively, IBM is now bringing in the RDF format so
that graph triples can be stored natively.

As IBM explains it – not strictly grammatically, to some English majors – a
triple has a noun, a verb, and a predicate, such as Tim (noun) has won
(verb) the MegaMillions lottery (predicate). You can then query all aspects
of a set of triples to see who else has won MegaMillions – a short list, in
this case.

In tests among DB2 10.1 early adopters, applications that used these graph
triples ran about 3.5 times faster on DB2 than on the Jena TDB data store
(short for triple database, presumably) with SPARQL 1.0 hitting it for
queries.
------------------------------------------

(Dear Reg, TDB does not stand for "triple database")

       Andy



--


---
Marco Neumann
KONA

Join us at SemTech Biz in San Francisco June 3-7 2012 and save 15%
with the lotico community discount code 'STMN'
http://www.lotico.com/evt/SemTechSF2012/

Re: DB2, RDF, and Jena?

Reply via email to