Files in a tar are in RDF/XML or Turtle.

Yasunori

2019/08/07 3:11、ajs6f <[email protected]>のメール:

In what format are these RDF files?

ajs6f

> On Aug 6, 2019, at 10:05 AM, Yasunori Yamamoto <[email protected]> wrote:
>
> Hello, I'm trying to learn how to parse RDF data archived in a tar.gz
> file (e.g., rdfdatasets.tar.gz that contains a set of RDF data files)
> within my Java program.
> The following code does work properly, but it is inefficient because
> the process reads and loads the entire RDF data in an entry of the
> given tar.gz file into a main memory before parsing.
> So, could you please let me know a better way to save a memory space ?
>
> TarArchiveInputStream tarInput = new TarArchiveInputStream(new
> GzipCompressorInputStream(new FileInputStream(filename)));
> TarArchiveEntry currentEntry;
> PipedRDFIterator<Triple> iter = new
> PipedRDFIterator<Triple>(buffersize, false, pollTimeout, maxPolls);
> final PipedRDFStream<Triple> inputStream = new PipedTriplesStream(iter);
>
> while ((currentEntry = tarInput.getNextTarEntry()) != null) {
> String currentFile = currentEntry.getName();
> Lang lang = RDFLanguages.filenameToLang(currentFile);
> parser_object = RDFParserBuilder
>   .create()
>   .errorHandler(ErrorHandlerFactory.errorHandlerDetailed())
>   .source(new StringReader(CharStreams.toString(new
> InputStreamReader(tarInput))))
>   .checking(checking)
>   .lang(lang)
>   .build();
> parser_object.parse(inputStream);
> }
> tarInput.close();
>
> Sincerely yours,
> Yasunori Yamamoto

Reply via email to