Re: [beagleboard] PRU - Can't read data up to 2.5 MHz

Gerhard Hoffmann Wed, 28 Nov 2018 05:43:19 -0800


Am 28.11.18 um 12:08 schrieb [email protected]:
...


|state[0] = ((__R31&sclk) == sclk) ?  true :  false;|
|state[0] = (__R31&sclk) == sclk;
|
|
|
|should do the same thing, but I would expect the compiler to optimize|
|that away. Unrolling the loops and inlining should help, also.
|

This is how I do the read:

Remember that I read now the SPI data into a CPLD and fetch
them bytewise.

I switch the 2 byte address lines to the CPLD and then have to wait
7 ns for propagation through the CPLD and some time more until
the ringing at the P8/P9 connector has calmed down. So I must
wait, say 4 Instructions à 5 nsec before I really get the data.
That is done with some volatile reads. I had the impression that
the number of instructions and the delay did not always scale 1:1,
so it took some pruning with the oscilloscope until I was satisfied.

The canonical solution for your problem is probably to use the
hardware SPI interface with the PRU, which should work to 48 MBit/s.
I could not make that work, and in the end I wanted 100 MBit/s anyway.

cheers,
Gerhard

------------------------------------------------------


// data avail is either (not busy) or (not drl). It is high active.
// The CPLD takes a little more than 32 Clocks at 100 MHz
// to get the 32 bits. Then we can read them out, bytewise, and
// we select the byte using 2 port bits as address.
// It is probably harmless if that extends slightly into the next
// conversion since the read activity is decoupled from the ADC core
// Reading the CPLD does not toggle ADC pins.
//
// inline saves 20 nsec of procedure overhead.

inline void wait_data_avail(void){

while ( __R31 & (1 << DAT_AVAIL)) {}; // wait for the hightime of p9.26 = data_avail

    while  (!(__R31 & (1 << DAT_AVAIL))){};    // wait for the low time

// Now we are at the start of the high time. The ADC transactionwindow opens.

    // next 320 ns we will read the data into the CPLD or program the ADC
}


// read 4 bytes from the CPLD, mask them, shift them & convert to one int.

// I must read at least 3 times that the results are right ( for addresssetup time)// removing a single read makes it 60 nsec faster, 15 nsec per read.Should be 5 nsec???

// reading 3 times takes 40 nsec per bit. That should be enough.

// reading 4 times takes 60 nsec per bit. Reading __R31 takes abt. 20ns. :-(// Von der steigenden Flanke von data_available am P9 bis zum returndauert 725 nsec.

// kill 320 nsec, the time the CPLD needs to fill the shift register
// Once through the empty loop costs 5 nsec.
// for( retval=60; retval;  retval--){};

// In the mean time I have changed the CPLD so that it tells when Iimmediately// can fetch the data, so I gain 350 nsec that were spent with busywaiting previously.

// Now I should be able to process 3 channels.
// Using the scope is essential to see where time is lost.

inline int read_adc(void){

    int retval;

// Without volatile this runs 3 times as fast, even though __R31 isvolatile

    // The compiler seems to assume incorrectly that reading __R31 has no
    // side effects. But it has. It spends time and data might change.
    //
    // maybe we could do the merging of the result in the setup time
    // but when the compiler re-arranges instructions that might fail.

    volatile unsigned int byte0, byte1, byte2, byte3;

    wait_data_avail();

    // from here to parking the address at return it takes 350 nsec.

    __R30   &= ~(3 << QSEL);    // address 0
    byte0    = __R31;            // address setup time for byte 0
    byte0    = __R31;
//    byte0    = __R31;
    byte0    = __R31;

    __R30   |= (1 << QSEL);        // address 1
    byte1    = __R31;
    byte1    = __R31;
//    byte1    = __R31;
    byte1    = __R31;

    __R30   &= ~(3 << QSEL);    // address 2,   remove old bit field
    __R30   |=  (2 << QSEL);    // insert new bit field
    byte2   = __R31;
    byte2   = __R31;
//    byte2   = __R31;
    byte2   = __R31;

    __R30   |= (1<< QSEL);        // increment to address 3
    byte3   = __R31;            //
    byte3   = __R31;
//    byte3   = __R31;
    byte3   = __R31;            // get the last byte

    retval  = ((byte0 & 0xff)      )
            | ((byte1 & 0xff) << 8 )
            | ((byte2 & 0xff) << 16)
            | ((byte3 & 0xff) << 24);

    __R30   &= ~(3 << QSEL);    // park address at 0, may be removed.

// but makes it easy to spot the actionon the scope.

    return  retval;
}

------------------------------------------------------

I have to read data from an SPI master device, which sends the clockat 10 MHz. Since the SPI kernel driver only allows to the beagle boneto working as SPI Master I had to implement this functionality using aPRU.
From what I've read throughout the internet the PRU processing rate is200 MHz, so I thought I could easily read data at 10 MHz. Oddly, ithappens that with transmission rates up to 2.5 MHz I am being unableto catch all the rising edges on the clock pin.


--
For more options, visit http://beagleboard.org/discuss

---You received this message because you are subscribed to the Google Groups "BeagleBoard" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/beagleboard/1f01cc2f-444c-a423-0a47-5acc4d45855f%40hoffmann-hochfrequenz.de.
For more options, visit https://groups.google.com/d/optout.

Re: [beagleboard] PRU - Can't read data up to 2.5 MHz

Reply via email to