Hi John, thank you for all the work that you put into the FastBit library, it allows us to achieve great results!
I've bumped into a little bug which might be very hard to reproduce or identify. I'm using two thread to reorder and index data that are already stored on disk. It was ok for a little while, but then it stuck in deadlock. Here are gdb traces from both threads, unfortunately without debugging symbols, so that the specific files and lines are unknown. We are currently using the SVN version 532. (gdb) bt #0 0x00007f8983463054 in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f898345e388 in _L_lock_854 () from /lib64/libpthread.so.0 #2 0x00007f898345e257 in pthread_mutex_lock () from /lib64/libpthread.so.0 #3 0x00007f898271e074 in ibis::fileManager::storage::storage(unsigned long) () from /usr/lib64/libfastbit.so.0 #4 0x00007f898271eb16 in ibis::fileManager::storage::enlarge(unsigned long) () from /usr/lib64/libfastbit.so.0 #5 0x00007f898272214f in ibis::fileManager::roFile::doRead(char const*) () from /usr/lib64/libfastbit.so.0 #6 0x00007f8982723b4b in ibis::fileManager::getFile(char const*, ibis::fileManager::storage**, ibis::fileManager::ACCESS_PREFERENCE) () from /usr/lib64/libfastbit.so.0 #7 0x00007f898273406a in int ibis::fileManager::getFile<unsigned short>(char const*, ibis::array_t<unsigned short>&, ibis::fileManager::ACCESS_PREFERENCE) () from /usr/lib64/libfastbit.so.0 #8 0x00007f8981f9f4a5 in ibis::column::actualMinMax(char const*, ibis::bitvector const&, double&, double&) const () from /usr/lib64/libfastbit.so.0 #9 0x00007f8981fa3546 in ibis::column::computeMinMax() () from /usr/lib64/libfastbit.so.0 #10 0x00007f89827beae6 in ibis::part::gatherSortKeys(ibis::array_t<char const*>&) () from /usr/lib64/libfastbit.so.0 #11 0x00007f89827bfc56 in ibis::part::reorder() () from /usr/lib64/libfastbit.so.0 #12 0x00007f8982c7e2af in reorder_index(void*) () from /usr/share/ipfixcol/plugins/ipfixcol-fastbit-output.so #13 0x00007f898345c851 in start_thread () from /lib64/libpthread.so.0 #14 0x00007f89831aa6dd in next_line () from /lib64/libc.so.6 #15 0x0000000000000000 in ?? () (gdb) (gdb) bt #0 0x00007f8983463054 in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f898345e388 in _L_lock_854 () from /lib64/libpthread.so.0 #2 0x00007f898345e257 in pthread_mutex_lock () from /lib64/libpthread.so.0 #3 0x00007f898175a6aa in ibis::util::mutexLock::mutexLock(pthread_mutex_t*, char const*) () from /usr/lib64/libfastbit.so.0 #4 0x00007f89827177d4 in ibis::fileManager::removeCleaner(ibis::fileManager::cleaner const*) () from /usr/lib64/libfastbit.so.0 #5 0x00007f8981735952 in ibis::part::~part() () from /usr/lib64/libfastbit.so.0 #6 0x00007f8981735c29 in ibis::part::~part() () from /usr/lib64/libfastbit.so.0 #7 0x00007f8982c7e2cd in reorder_index(void*) () from /usr/share/ipfixcol/plugins/ipfixcol-fastbit-output.so #8 0x00007f898345c851 in start_thread () from /lib64/libpthread.so.0 #9 0x00007f89831aa6dd in next_line () from /lib64/libc.so.6 #10 0x0000000000000000 in ?? () (gdb) Do you have any idea what might be going on? With regards, Petr Velan _______________________________________________ FastBit-users mailing list [email protected] https://hpcrdm.lbl.gov/cgi-bin/mailman/listinfo/fastbit-users
