On Wed, Sep 22, 2010 at 11:06 AM, Rasko Leinonen <[email protected]> wrote: > Hi, > > Each document is directly associated with one organism (e.g. mouse [mus > musculus]). > > There are ~ 500,000 distinct organisms. > There are ~ 2,000,000,000 documents. > > The organisms are organised into a k-ary tree, where k is ~ 1000. I.e. > starting from the common root each node of the tree can have up to ~ 1000 > children. > > There are < 1,000,000 nodes in the tree. > The path from root to node is typically ~ 30 nodes deep.
Hi ! It's a bit of topic but ... did you tried neo4j for your graph ? -- Laurent "ker2x" Laborde Sysadmin & DBA at http://www.over-blog.com/ _______________________________________________ FastBit-users mailing list [email protected] https://hpcrdm.lbl.gov/cgi-bin/mailman/listinfo/fastbit-users
