Hi there,

I have a large HDF5file (~19 Gig size) with which I apparently having
some data retrieval problems.   It's hard to pinpoint exactly what the
problem is, but it seems to be with the value in a column which is
indexed.  When I run the script and explicitly search with the record
ID (qcid) 1035139, there are data retrieved and all is well.  As soon
as I run this for 300,000 such IDs, the following error occurs,
sometimes at different places.  If it was caused by a corrupt index or
something similar, I would have expected the error to be reproducible
specifically with that ID, but this is not the case.

I am currently using the open source version, but we have recently
applied for a site-license for the Pro version: I am mentioning this
in case this error has something to do with the improved indexing that
comes with the 'Pro' version.

Any help / ideas / tips would be greatly appreciated: The particular
part of code causing the problem is here, followed by the error;

 qdata = [x['qdat'] for x in table.where('(qcid == %r)' %key)]

Exception tables.exceptions.HDF5ExtError: HDF5ExtError('Problems
reading the array data.',) in
'tables.indexesExtension.IndexArray._g_readSortedSlice' ignored
Traceback (most recent call last):
  File "pytable_trial_READdata.py", line 404, in <module>
    runner.main()
  File "pytable_trial_READdata.py", line 142, in main
    clusterassessed = self.getOphs(clusterassessed,table)
  File "pytable_trial_READdata.py", line 233, in mapOrphans

   qdata = [x['qdat'] for x in table.where('(qcid == %r)' %key)]
  File "tableExtension.pyx", line 862, in tables.tableExtension.Row.__next__
  File "tableExtension.pyx", line 946, in
tables.tableExtension.Row.__next__indexed
  File "lrucacheExtension.pyx", line 332, in
tables.lrucacheExtension.ObjectCache.removeslot_
KeyError: ((('qcid', ('eq',), ('1035139',)),), 'e0', (0L, 140242666L, 1L))
Closing remaining open files: isphfile...HDF5-DIAG: Error detected in
HDF5 library version: 1.6.7-rc2 thread 0.  Back trace follows.
  #000: \hdf\hdf5-16\vnet\release-testing\std\src\H5D.c line 1224 in
H5Dclose(): not a dataset
    major(01): Function arguments
    minor(03): Inappropriate type
  #001: \hdf\hdf5-16\vnet\release-testing\std\src\H5I.c line 1258 in
H5I_find_id(): invalid ID
    major(07): Atom layer
    minor(32): Unable to find atom information (already closed?)
HDF5-DIAG: Error detected in HDF5 library version: 1.6.7-rc2 thread 0.
 Back trace follows.
  #000: \hdf\hdf5-16\vnet\release-testing\std\src\H5D.c line 1224 in
H5Dclose(): not a dataset
    major(01): Function arguments
    minor(03): Inappropriate type
  #001: \hdf\hdf5-16\vnet\release-testing\std\src\H5I.c line 1258 in
H5I_find_id(): invalid ID
    major(07): Atom layer
    minor(32): Unable to find atom information (already closed?)

thanks a lot
Toby

------------------------------------------------------------------------------
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to