Extreme sparseness in couch files
---------------------------------
Key: COUCHDB-220
URL: https://issues.apache.org/jira/browse/COUCHDB-220
Project: CouchDB
Issue Type: Bug
Components: Database Core
Affects Versions: 0.9
Environment: ubuntu 8.10 64-bit, ext3
Reporter: Robert Newson
When adding ten thousand documents, each with a small attachment, the
discrepancy between reported file size and actual file size becomes huge;
ls -lh shard0.couch
698M 2009-01-23 13:42 shard0.couch
du -sh shard0.couch
57M shard0.couch
On filesystems that do not support write holes, this will cause an order of
magnitude more I/O.
I think it was introduced by the streaming attachment patch as each attachment
is followed by huge swathes of zeroes when viewed with 'hd -v'.
Compacting this database reduced it to 7.8mb, indicating other sparseness
besides attachments.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.