[Bitcoin-development] Full Clients in the future - Blockchain management

Alan Reiner Sat, 02 Jun 2012 08:42:07 -0700

Devs,

I have decided to upgrade Armory's blockchain utilities, partly out ofnecessity due to a poor code decision I made before I even decided I wasmaking a client. In an effort to avoid such mistakes again, I want todo it "right" this time around, and realize that this is a gooddiscussion for all the devs that will have to deal with this eventually...

The part I'm having difficulty with, is the idea that in a few yearsfrom now, it just may not be feasible to hold transactionsfile-/pointers/ in RAM, because even that would overwhelm standard RAMsizes. Without any degree of blockchain compression, I see that themost general, scalable solution is probably a complicated one.

On the other hand, where this fails may be where we have alreadypredicted that the network will have to split into "super-nodes" and"lite nodes." In which case, this discussion is still a good one, butjust directed more towards the super-nodes. But, there may still be apoint at which super-nodes don't have enough RAM to hold this data...

(1) As for how small you can get the data: my original idea was thatthe entire blockchain is stored on disk as blkXXXX.dat files. I storeall transactions as 10-byte "file-references." 10 bytes would be


    -- X in blkX.dat (2 bytes)
    -- Tx start byte (4 bytes)
    -- Tx size bytes (4 bytes)

The file-refs would be stored in a multimap indexed by the first 6 bytesof the tx-hash. In this way, when I search the multimap, I potentiallyget a list of file-refs, and I might have to retrieve a couple of txfrom disk before finding the right one, but it would be a good trade-offcompared to storing all 32 bytes (that's assuming that multimap nodesdon't have too much overhead).

But even with this, if there are 1,000,000,000 transactions in theblockchain, each node is probably 48 bytes (16 bytes + map/containeroverhead), then you're talking about 48 GB to track all the data inRAM. mmap() may help here, but I'm not sure it's the right solution

(2) What other ways are there, besides some kind of blockchaincompression, to maintain a multi-terabyte blockchain, assuming thatstoring references to each tx would overwhelm available RAM? Maybethat assumption isn't necessary, but I think it prepares for the worst.

Or maybe I'm too narrow in my focus. How do other people envision thiswill be handled in the future. I've heard so many vague notions of"well we could do /this/ or /that/, or it wouldn't be hard to do /that/"but I haven't heard any serious proposals for it. And while I believethat blockchain compression will become ubiquitous in the future, noteveryone believes that, and there will undoubtedly be users/devs that/want/ to maintain everything under all circumstances.


-Alan

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/

_______________________________________________
Bitcoin-development mailing list
Bitcoin-development@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bitcoin-development

[Bitcoin-development] Full Clients in the future - Blockchain management

Reply via email to