Thomas Mueller created OAK-2466:
-----------------------------------
Summary: DataStoreBlobStore: chunk ids should not contain the size
Key: OAK-2466
URL: https://issues.apache.org/jira/browse/OAK-2466
Project: Jackrabbit Oak
Issue Type: Improvement
Reporter: Thomas Mueller
Assignee: Thomas Mueller
The blob store garbage collection (data store garbage collection) uses the
chunk ids to identify binaries to be deleted. The blob ids contain the size now
(<contentHash>#<size>), and the blob id is currently equal to the chunk id.
It would be more efficient to _not_ use the size, and instead just use the
content hash, for the chunk ids. That way, enumerating the entries that are in
the store is potentially faster. Also, it allows us to change the blob id in
the future, for example add more information to it (for example the creation
time, or the first few bytes of the content) if we ever want to.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)