First pass results for parsing from a file to a null sink, no tuning or profiling. Jena java level Triple objects and all nodes are created.

RIOT (128K IO buffer)
bsbm-25m.nt.gz : 127,082 Triples per second (TPS)
bsbm-25m.nt:     133,104 TPS

RDF Thrift (32K IO buffer)
bsbm-25m.rt:     357,101 TPS  x2.8
bsbm-25m.rt.gz:  390,578 TPS  x2.9

RDF Thrift (128K IO buffer)
bsbm-25m.rt:     409,788 TPS  x3.2
bsbm-25m.rt.gz:  389,969 TPS  x2.9

and best
gzip -d bsbm-25m.rt.gz | thrift2rdf (128K IO buffer)
  490,138 TPS

File sizes:
bsbm-25m.nt:     6,505,289,318 bytes (6.1G)
bsbm-25m.nt.gz:    691,429,780 bytes (660M)

bsbm-25m.rt:     6,684,543,995 bytes (6.3G)
bsbm-25m.rt.gz:    700,639,242 bytes (669M)

        Andy

Reply via email to