Hello,

I have a question about data in hdf5 files aside from the data proper.
Specifically, I have the following example:

>>> import tables
>>> import numpy
>>> # Define simple table
>>> table1 = numpy.array([[(0,0,0), (1,0,0)], [(0,1,0), (0,0,1)]], {'names': 
>>> ('r','g','b'), 'formats': ('f4', 'f4', 'f4')})
>>> # Write table to a file
>>> h5file = tables.openFile("/tmp/file1.h5", "w")
>>> h5file.createTable("/", "table1", description = table1)
>>> h5file.close()
>>> # Write same table to another file
>>> h5file = tables.openFile("/tmp/file2.h5", "w")
>>> h5file.createTable("/", "table1", description = table1)
>>> h5file.close()

When I diff the two files, /tmp/file1.h5 and /tmp/file2.h5 they are different:

$ diff /tmp/file1.h5 /tmp/file2.h5
Binary files /tmp/file1.h5 and /tmp/file2.h5 differ

Presumably the file has some metadata (perhaps a timestamp?) that
results in the difference. Is there a way to prevent this? or some
configuration parameter to control it? I want to be able to use a hash
of the file to detect a change in the data, but this will not work if
the file changes independently of the data.

Matt

------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to