so basically, each commit has a tree preceding it. the tree is composed of git objects that can be listed with `git ls-tree
the format i currently have ----- another possible issue with the current adapter thing is that um the tokenizer uses raw spaces. usually these models replace spaces with a special token. i dont' know much about it.
