Thomas Mueller created OAK-2466:
-----------------------------------

             Summary: DataStoreBlobStore: chunk ids should not contain the size
                 Key: OAK-2466
                 URL: https://issues.apache.org/jira/browse/OAK-2466
             Project: Jackrabbit Oak
          Issue Type: Improvement
            Reporter: Thomas Mueller
            Assignee: Thomas Mueller


The blob store garbage collection (data store garbage collection) uses the 
chunk ids to identify binaries to be deleted. The blob ids contain the size now 
(<contentHash>#<size>), and the blob id is currently equal to the chunk id.

It would be more efficient to _not_ use the size, and instead just use the 
content hash, for the chunk ids. That way, enumerating the entries that are in 
the store is potentially faster. Also, it allows us to change the blob id in 
the future, for example add more information to it (for example the creation 
time, or the first few bytes of the content) if we ever want to.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to