[Rpm-maint] RFC: file verification API

Panu Matilainen Thu, 21 Feb 2008 02:14:00 -0800

One of the first items for rpm.org TODO list last May was an API forpackage verification, might be time to actually do something about it...

There is already an API call to do file verification: rpmVerifyFile(). Theproblem with that is that you only get the bits of what differs, but whatif you want to look at the actual values? Eg "/bin/foo owner mismatch,should be root but is joedoe" type of reporting. Sure it's just a stat()away but rpmVerifyFile() already called it, kinda silly to have to do itagain... and for file checksumming it's not so simple as rpm does prelinkundo automatically (and in any case it's not exactly a cheap operation).So basically we'd like some way of storing the results of what was readfrom disk on verification.

The idea I've been playing with is to use the rpmfi structure for that:rpmfi already has all the necessary methods for retrieving file ownership,timestamps etc, so we wouldn't have to invent and implement yet another"object" with methods for the storing and retrieving the bits andusernames etc.

Yesterday I got around to experiment a bit with it, to the point of"proof-of-concept" implementation. By using rpmfi for storage of from-diskdata, verification now looks like this:

---

rpmfi fi = rpmfiNew(ts, hdr, RPMTAG_BASENAMES, 1);
rpmfi diskfi = rpmfiNewFromDisk(ts, hdr, RPMTAG_BASENAMES, verifyflags);

while ((rpmfiNext(fi) >= 0) && (rpmfiNext(diskfi) >= 0) {
    if (rpmfiFMtime(fi) != rpmfiFMtime(diskfi))
        verifyResult |= RPMVERIFY_MTIME;
    ... /* other checks */
}
rpmfiFree(fi);
rpmfiFree(diskfi);

---

Implementing a custom verification procedures that print out actual valuedifferences or whatever is pretty trivial this way, as is doing arpmVerifyFile() type operation that just gives raises verify-failed bits(like the above does) if you don't care about the actual values.

The not-so-nice thing with this approach are that there's no way to verifyindividual files, rpmfiNewFromDisk() works on header at a time. Of courseyou can ignore the files you don't care about when iterating over them,but you'll pay the penalty of md5summing everything. How big a deal isthat in reality... dunno, from cli rpm has always only supportedheader-at-a-time verification. The other issue is that since the readingfrom disk and comparison are detached, things like lstat failure reasonsare lost (whereas currently you get "permission denied" and such), butthat could probably be dealt with by just adding an extra entry to rpmfito store errno's (which would be empty for a "normal" rpmfi).

Comments? Any gaping showstopper holes in the idea that I'm too blind tosee? For the curious, the draft-implementation of rpmfiNewFromDisk() ishere http://laiskiainen.org/rpm/patches/rpm-rpmfi-from-disk-1.patch

The alternative to hijacking rpmfi (which seems very natural for thepurpose) would be implementing some other means of storing and accessingthe verification results, a fair bit of mostly tedious work most likely.

Then there's of course the other parts of package verification:dependencies and verify-scripts, those would need some sort of API too Isuppose... ideas welcome.


        - Panu -
_______________________________________________
Rpm-maint mailing list
Rpm-maint@lists.rpm.org
https://lists.rpm.org/mailman/listinfo/rpm-maint

[Rpm-maint] RFC: file verification API

Reply via email to