On Sun, 13 Jan 2019 at 12:47, Nico Kadel-Garcia <nka...@gmail.com> wrote:

> On Sun, Jan 13, 2019 at 11:54 AM Stephen John Smoogen <smo...@gmail.com>
> wrote:
> >
> >
> >
> > On Sat, 12 Jan 2019 at 22:25, Nico Kadel-Garcia <nka...@gmail.com>
> wrote:
> >>
> >> On Fri, Jan 11, 2019 at 4:37 PM Roberto Ragusa <m...@robertoragusa.it>
> wrote:
> >> >
> >> > On 1/8/19 4:22 PM, Lennart Poettering wrote:
> >> >
> >> > > If all you want to do is count, then it should be entirely
> sufficient
> >> > > to do it like this:
> >> > >
> >> > >    GET
> /metalink?repo=fedora-28&arch=x86_64&edition=<blah>&countme=1 HTTP/1.1
> >> > >
> >> > > the first time within each one-week window and a simple
> >> > >
> >> > >    GET /metalink?repo=fedora-28&arch=x86_64&edition=<blah> HTTP/1.1
> >> > >
> >> > > all other times.
> >> >
> >> > As an additional improvement, is it really needed to count every
> machine?
> >> > We can subsample a lot, and only let some specific machines to show
> >> > up for counting.
> >>
> >> The difficulty is not the counting. Requiring safe counting and
> >> aggregation by the server is a requirement that no server or
> >> intermediate server or proxy needs to follow, and would require
> >> configuration or filtering control of a server that is outside of
> >> client hands. It's not legally or technologically mandated. The great
> >> use fo r the data is tracking hosts, metadata that is saleable and
> >> likely to help provide a new form of tracking information.
> >>
> >> Writing this into the dnf behavior is typical, but i't's not
> >> beneficial to the clients. It's beneficial to the mirrors, who are
> >> likely to sell the data. While it may be that infamous problem, a
> >> "Simple Matter Of Programming(tm)" to sanitize the data, there are
> >> strong motivations to collect it and sell it, and I'd expect various
> >> mirrors to start doing so within moments of the activation of the
> >> feature.
> >
> >
> > 1. The mirrors do not see this.
>
> If it's not available to the mirrors, then anyone who hardcodes a
> mirror's URL into the local "baseurl" settings is not going to be
> counted this way, and we're back at the "we don't know how many
> clients there are" problem. If only the "mirrorlist" hosts see the
> UUID, "countme" or any other identical client ID.
>
>
Since you seem to have avoided reading the emails where this was detailed,
here is the simplest version of the countme proposal. [Please see Lennarts
email and replies for the non shortened version.]

Once a time period (day, week, month), an update would just add a countme=1
to it.

There is no more client id. There is no data other than that. We would just
count all the countme=1 and get an idea of what was going on. It isn't an
exact number but it puts some amount of solid-ness in the fuzzy cloud. The
more complicated version which mattdm is wanting is that countme gets
incremented by the week after install. Nothing else. No data from the
/etc/machine-id, no data from /var/yum/uuid etc.




> > 2. We aren't talking about UUIDs anymore and just a countme variable
> being sent periodically. If a countme is going to be too much data to send,
> then clients are probably already sending way too much data already.
>
> Then can we change the title of the thread?
>
>
Nico, you know this better than me. This is email not a forum. People can
rename threads but depending on the email software it will just look like a
completely different thread. I think a rename has been done, but people
keep responding on this thread.


>

-- 
Stephen J Smoogen.
_______________________________________________
devel mailing list -- devel@lists.fedoraproject.org
To unsubscribe send an email to devel-le...@lists.fedoraproject.org
Fedora Code of Conduct: https://getfedora.org/code-of-conduct.html
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org

Reply via email to