Re: [Hdf-forum] Efficient serialization of HDF5 data

Андрей Парамонов Tue, 05 Dec 2017 06:13:46 -0800

Hello Michaël!

04.12.2017 21:23, Michaël Melchiore пишет:

I build an application which operates on NetCDF data using Big Datatechnologies.
My design aims at avoiding unnecessarily writing data to disk. Instead,I want to operate as much as possible in memory. The challenge is data(de)serialization for distributed communications between computing nodes.
Since NetCDF4 and HDF5 already provide a portable data format, a simpleand efficient design would simply access and then exchange the rawbinary data over the network.
Currently, I fail to access this buffer without creating files. I aminvestigating the use of the Apache Common VFS Ram file system to trickNetCDF into working in memory.
But, a suggestion on the NetCDF Java mailing list (see ticketMQO-415619) was to build an alternative to the core driver. I feel thisis the more desirable course of actions as it is about improving theexisting solutions instead of working around their limitations.
Do you think this approach is feasible ? Any starting pointers would beappreciated !

I am probably not a distinguished expert in HDF5, but I take courage tosuggest you to check

https://www.hdfgroup.org/downloads/spark-connector/

It would be superb if you could share your experience and whether Sparkconnector helped you to implement in-memory processing.


Best wishes,
Andrey Paramonov

--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.


_______________________________________________
Hdf-forum is for HDF software users discussion.
Hdf-forum@lists.hdfgroup.org
http://lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org
Twitter: https://twitter.com/hdf5

Re: [Hdf-forum] Efficient serialization of HDF5 data

Reply via email to