[pkg-discuss] Single and Multiple Origin Metadata Consistency

Shawn Walker Tue, 13 Oct 2009 20:47:41 -0700

Greetings,

As part of the catalog v1 work that johansen and I are doing, we'vediscovered that there are numerous cases where consistency issues withthe catalog data can cause problems when refreshing publisher metadata.

The currently known cases, along with our current solution (if oneexists) for them are listed below, grouped by origin scenario. All ofthese cases are for v1 catalog incremental update.

What has become clear in enumerating these cases is that we may needdifferent behaviour based on whether one or multiple origins are presentfor a publisher.

The purpose of this inquiry is to determine whether the proposed orcurrent behaviours noted below are acceptable, and whether we shouldemit additional warnings for certain error cases or alter the behaviour.

Please note that only exceptional error cases are detailed here, andthat the client doesn't actually support multiple origins yet, so thosecases do not have to be for the forthcoming catalog v1 changeset, but doneed to be addressed very soon.


Overview
==============================
In short, how do we deal with these single origin cases:
* (good) -> (older)
* (good) -> (rebuilt_newer)
* (good) -> (different)
* (good) -> (different_malicious)
* (good) -> (malicious)

...and how do we deal with these multiple origin cases:
* (good, good) -> (good, older)
* (good, good) -> (good, rebuilt_newer)
* (good, good) -> (good, different)
* (good, good) -> (good, different_malicious)
* (good, good) -> (malicious)

...and the reverse of all the cases above. Of course, there are furthercombinations possible, but they should just be variations on the above.

It is important to note that the catalog v1 incremental update mechanismdepends on the use of the timestamps in catalog retrieval which arebased upon publication occurring on one host, with the data thenreplicated to other origins. While all of the timestamp data is in UTC,there can still be unexpected variations between hosts.

This is important because timestamps are used to determine the order ofupdates. The host that is the publication source must ensure that thetime on each update is after the previous update. If multiple originshave updates from different sources with different timestamps, it'spossible to introduce inconsistency into the update process.

Another assumption is that multiple origins for the same publisher donot accept publication -- they must be read-only or a variation thereof.That is, all origins for a publisher are expected to contain the sameset of package data (barring synchronisation issues).



Single Origin
==============================
These cases assume that a publisher only has a single origin such as this:

publisher: example.com
origin: http://pkg.example.com/repository

Case 1
------------------------------
Scenario:

pkg.example.com's repository server has had catastrophic disk failure,and has restored an older version of the repository from backup.


Current Refresh Behaviour:

Since last_modified is older, but the creation date of the catalogmatches the last retrieved one, the client will abort the incrementalupdate and silently perform a full retrieval instead.


Case 2
------------------------------
Scenario:

pkg.example.com's repository server has had catastrophic disk failure,but did not have a backup to restore. The repository was rebuilt usinga copy of the package data, which means there is a completely newcatalog in place.


Current Refresh Behaviour:

The client will attempt an incremental update because the rebuiltcatalog is newer than the last one it retrieved. However, it willdetect that the creation date (and time) of the new catalog does notmatch that of the old catalog. So, it will abort the incremental updateand silently perform a full retrieval instead.


Case 3
------------------------------
Scenario:

pkg.example.com's package repository is completely rebuilt every night(similar to the ON nightly repository we have). This means a newcatalog is put into place each time.


Current Refresh Behaviour:

The client will attempt an incremental update. However, when it detectsthat the creation date and time of the new catalog do not match, it willabort and silently perform a full retrieval instead.


Case 4
------------------------------
Scenario:

User publishes copies of the packages in pkg.example.com's repository totheir own repository, and executes set-publisher -O http://localhostexample.com.


Current Refresh Behaviour:

The client will attempt an incremental update. However, when it detectsthat the creation date and time of the new catalog do not match, it willabort and silently perform a full retrieval instead.


Case 5
------------------------------
Scenario:

Malicious user has redirected the client's requests to pkg.example.com'srepository to their own evil source via <insert nefarious plan here>,which has been built from scratch, so has a different catalog thanpkg.example.com's repository.


Current Refresh Behaviour:

The client will attempt an incremental update. However, when it detectsthat the creation date and time of the new catalog do not match, it willabort and silently perform a full retrieval instead.


Case 6
------------------------------
Scenario:

Malicious user has redirected the client's requests to pkg.example.com'srepository to their own evil source via <insert nefarious plan here>.However, they used a copy of pkg.example.com's repository and then addedtheir new, modified versions of packages.


Current Refresh Behaviour:

The client will silently incrementally update, unaware that the sourceof the catalog data has changed.


Case 7
------------------------------
Scenario:

Malicious user had redirected the client's requests to pkg.example.com'srepository to their own evil source via <insert nefarious plan here>.However, they used a copy of pkg.example.com's repository and then addedtheir new, modified versions of packages. Client user discovers this,and fixes the problem, but client currently has copy of the malicioususer's repository data.


Outstanding Issues
------------------------------

The client is relying on creation date and time (which is accurate tothe micro-second level with six-digits of precision). Is this aconcern? Or is there a point where we say "good enough".

Even then, is there anyway to protect from the malicious user scenariosabove? It seems like signing the catalog is the only way to deal withthis. But that only helps the network repository case, and not theon-disk case where manifest signing is the only thing we can depend on.



Multiple Origins
==============================
These cases assume that a publisher only has multiple origins such as this:

publisher: example.com

origins: http://pkg.example.com/repository,http://pkg.example.net/repository


Case 1
------------------------------
The example.net repository is an older copy of the example.com.

Current Refresh Behaviour:

Since last_modified is older, but the creation date of the catalogmatches the last retrieved one, the client will abort the incrementalupdate and perform a full retrieval.


Case 2
------------------------------

The example.net repository is a copy of the example.com repository, butits catalog data is older.


Current Refresh Behaviour:

The client will attempt an incremental update. However, when it detectsthat the creation date and time of the new catalog do not match, it willabort and perform a full retrieval instead.


Case 3
------------------------------
Scenario:

pkg.example.com's package repository is completely rebuilt every night(similar to the ON nightly repository we have). This means a newcatalog is put into place each time. However, pkg.example.net is a copyof this repository that has to be synchronized, and so its contentsdon't always exactly match.


Current Refresh Behaviour:

The client when contacting pkg.example.net for an incremental update,will silently do nothing thinking that no updates are available.


Case 4
------------------------------
Scenario:

One of the origins for pkg.example.com's repository has been compromisedby a malicious user via <insert nefarious plan here>, which has beenbuilt from scratch, so has a different catalog than pkg.example.com'srepository.


Current Refresh Behaviour:

The client will attempt an incremental update. However, when it detectsthat the creation date and time of the new catalog do not match, it willabort and silently perform a full retrieval instead.


Case 5
------------------------------
Scenario:

One of the origins for pkg.example.com's repository has been compromisedby a malicious user via <insert nefarious plan here>. However, theyused a copy of pkg.example.com's repository and then added their new,modified versions of packages.


Current Refresh Behaviour:

The client will silently incrementally update, unaware that the sourceof the catalog data has changed.


Case 6
------------------------------
Scenario:

One of pkg.exmaple.com's origins was compromised, corrupted, orcontained older data for some period of time.


Outstanding Issues
-------------------------------

Silently performing a full retrieval for the multiple origin cases isn'tlikely the right answer here. Instead, trying another origin seems theright thing to do.

However, how does the client know which origin is authoritative?Specifically, it seems like the client would have to contact everyorigin and then pick the newest source with matching identityinformation (creation date) with the assumption that was theauthoritative one.


Cheers,
--
Shawn Walker
_______________________________________________
pkg-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/pkg-discuss

[pkg-discuss] Single and Multiple Origin Metadata Consistency

Reply via email to