Hi,

I am trying to use the remapped counter feature
on Intel Core. It does not work as expected.

Well, in fact, it segfaults immediately, but that's because
you have not enabled PCE in cr4. But assume you have
for a moment. Them there are still several problems.

1/ hdr->index does not work with fixed counters

    On Intel, RDPMC supports fixed counters, however, you
    have to add an offset 0x40000000 to the index (see Intel
    Vol 2b RDPMC description).

    From what I can see in:

    static int perf_counter_index(struct perf_counter *counter)
    {
        if (counter->state != PERF_COUNTER_STATE_ACTIVE)
                return 0;

        return counter->hw.idx + 1 - PERF_COUNTER_INDEX_OFFSET;
    }

    It  cannot be made to work for both generic and fixed counters.
    I think this function has to go in arch-specific code or become
    more complicated.


2/ rdpmc() + offset returns bogus counts

    Suppose you are using a generic counter, RDPMC returns the live
    40-bit value of the counter. Then, my understanding is that you are
    supposed to addr hdr->offset to that value to reconstruct the full
    64-bit value.

    It appears this is not quite working. Here is small example counting
    PERF_COUNT_HW_BRANCH_INSTRUCTIONS.

   RDPMC RAW hdr->offset
                34a1 offset=0x0
                3bac offset=0x0
                40e4 offset=0x0
                461c offset=0x0
                4b54 offset=0x0
                508c offset=0x0
                55c4 offset=0x0
                5afc offset=0x0
                6034 offset=0x0
                656c offset=0x0
                6aa4 offset=0x0
                6fdc offset=0x0
                7514 offset=0x0
                7a4c offset=0x0
                7f84 offset=0x0
                84bc offset=0x0
                89f4 offset=0x0
                8f2c offset=0x0
                9464 offset=0x0
                999c offset=0x0
                9ed4 offset=0x0
                ac20 offset=0x0
          ff8000bbe9 offset=0x7fffffff
          ff8000bf15 offset=0x7fffffff
          ff8000c241 offset=0x7fffffff
          ff8000c56d offset=0x7fffffff
          ff8000c899 offset=0x7fffffff
          ff8000cbc5 offset=0x7fffffff
          ff8000cef1 offset=0x7fffffff
          ff8000d21d offset=0x7fffffff
          ff8000d549 offset=0x7fffffff

It appears at some point, the live value of the counter is changed and suddenly
you set bit 31 (bit32-39 are sign extension). Don't think it is because of PMU
interrupt, I suspect something wrong during context switching.

The core of my loop is as follows:

        for(;;) {
                for (i=0; i < num; i++) {
                        uint64_t offset = 0;
                        uint64_t values[3];
                        do {
                                seq = hdr->lock;
                                barrier();
                                if (hdr->index > 0) {
                                        values[0] = rdpmc(hdr->index-1);
                                        offset = hdr->offset;
                                        //values[0] += offset;
                                        values[1] = hdr->time_enabled;
                                        values[2] = hdr->time_running;
                                } else {
                                        ret = read(fd, values, sizeof(values));
                                        break;
                                }
                                barrier();
                        } while (hdr->lock != seq);

                        if (values[2])
                                values[0]=
(uint64_t)((double)values[0] * values[1]/values[2]);

                        printf("%20"PRIx64" offset=0x%"PRIx64"\n",
                                values[0],
                                offset);
                }
                fib(10);
        }

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
perfmon2-devel mailing list
perfmon2-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel

Reply via email to