Re: Is StreamRDFWriter.write() thread-safe?

Zak Mc Kracken Sun, 26 Nov 2017 06:16:32 -0800

Hi Andy,

thank you for your reply. Good to know. My use case is an RDF exporterthat takes data from a relatively slow data source (like a DBMS). Inorder to speed things up, it has multiple threads reading data,converting it to RDF and then sending generated RDF to their own JenaModel (one per thread). At the end, they stream the model to a commonsink/stream, such as a file.

Actually I'm designing this with some flexibility: one can chose to passa java.util.function.Consumer<Model> to the exporter, that is, anhandler that does something with a thread model, once it is ready.That's because, I want to reuse the upstream processing for either anRDF file exporter, or a Neo4J uploader (which should be able to manageconcurrent writings at a finer grain level), or, in general, some otherkind of converter.

That said, I'm OK with making the file writing part synchronized andhence non really parallel, my question was to understand it better howJena works with this.


Best,
Marco.

On 26/11/2017 11:14, Andy Seaborne wrote:

If the output stream is shared, then no.  It's buffered internally.
So at small scale, it'll look safe because the whole output is onebuffer or the order was OK. But beyond that, the buffered flusheswill be interleaved and buffer boundaries are based on characters, notlogical unit of the RDF output.
Parallel writing to a shared OutputStream is a bad idea.

What's the use case you have for a shared output stream?

    Andy

Re: Is StreamRDFWriter.write() thread-safe?

Reply via email to