Hi Konrad,

h5dump with the -p option may give you what you want  Combining it with -H will 
rmove the data from the output, or add -d to limit the output to a specific 
dataset or -g  for a specific group.  The output for a dataset looks like this:

         DATASET "BeamLatitude" {
            DATATYPE  H5T_IEEE_F32BE
            DATASPACE  SIMPLE { ( 24, 96, 5 ) / ( H5S_UNLIMITED, H5S_UNLIMITED, 
H5S_UNLIMITED
) }
            STORAGE_LAYOUT {
               CHUNKED ( 12, 48, 5 )
               SIZE 46080
            }
            FILTERS {
               NONE
            }
            FILLVALUE {
               FILL_TIME H5D_FILL_TIME_IFSET
               VALUE  -999.3
            }
            ALLOCATION_TIME {
               H5D_ALLOC_TIME_INCR
            }
         }

SIZE is the storage size of the dataset in bytes.  If the dataset is 
compressed, the storage size is the size of the compressed data.

Larry

-----Original Message-----
From: Hdf-forum [mailto:[email protected]] On Behalf Of 
Konrad Hinsen
Sent: Tuesday, November 05, 2013 8:31 AM
To: [email protected]
Subject: [Hdf-forum] Finding the size of the datasets in a file

Hi everyone,

I just spent some time looking for a command-line tool that shows the size 
occupied by each dataset in a file. I didn't find anything. The most promising 
candidates were h5stat, h5ls, and h5dump, but it seems that none of them can 
provide the information I am looking for.

Is there perhaps a third-party tool for that purpose?

I realize that "size" can be defined in lots of ways, but I don't really care 
about the details. I have lots of files that each contain hundreds of datasets, 
of which most are small but a few are very big.
I am looking for a simple way to identify the big ones. My ideal definition of 
size is "how much smaller would the file be if dataset X were not in there".

Konrad.
--
---------------------------------------------------------------------
Konrad Hinsen
Centre de Biophysique Moléculaire, CNRS Orléans Synchrotron Soleil - Division 
Expériences Saint Aubin - BP 48
91192 Gif sur Yvette Cedex, France
Tel. +33-1 69 35 97 15
E-Mail: research AT khinsen DOT fastmail DOT net 
http://dirac.cnrs-orleans.fr/~hinsen/
ORCID: http://orcid.org/0000-0003-0330-9428
Twitter: @khinsen
---------------------------------------------------------------------

_______________________________________________
Hdf-forum is for HDF software users discussion.
[email protected]
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org

_______________________________________________
Hdf-forum is for HDF software users discussion.
[email protected]
http://mail.lists.hdfgroup.org/mailman/listinfo/hdf-forum_lists.hdfgroup.org

Reply via email to