Re: [PATCH] spi: qup: Fix incorrect block transfers

2014-09-24 Thread Mark Brown
On Tue, Sep 23, 2014 at 02:26:13PM -0500, Andy Gross wrote:
> On Tue, Sep 23, 2014 at 12:24:27PM +0300, Ivan T. Ivanov wrote:
> > On Sun, 2014-09-21 at 23:27 -0500, Andy Gross wrote:

> > > This patch fixes a number of errors with the QUP block transfer mode.  
> > > Errors
> > > manifested themselves as input underruns, output overruns, and timed out
> > > transactions.

> > At what speeds are you seeing those errors?

> We've tried 25MHz and 50MHz.  Both fail in the same way.  Keep in mind this is
> definitely a timing / race issue and it probably also dependent on the latency
> of the attached device.  I cannot reproduce this at all on my IPQ8064 based
> board, but others can.

With SPI everything is entirely clocked from the master - the attached
device can't cause any of those issues unless there's an electrical
problem that *really* upsets the master.


signature.asc
Description: Digital signature


Re: [PATCH] spi: qup: Fix incorrect block transfers

2014-09-24 Thread Mark Brown
On Tue, Sep 23, 2014 at 02:26:13PM -0500, Andy Gross wrote:
 On Tue, Sep 23, 2014 at 12:24:27PM +0300, Ivan T. Ivanov wrote:
  On Sun, 2014-09-21 at 23:27 -0500, Andy Gross wrote:

   This patch fixes a number of errors with the QUP block transfer mode.  
   Errors
   manifested themselves as input underruns, output overruns, and timed out
   transactions.

  At what speeds are you seeing those errors?

 We've tried 25MHz and 50MHz.  Both fail in the same way.  Keep in mind this is
 definitely a timing / race issue and it probably also dependent on the latency
 of the attached device.  I cannot reproduce this at all on my IPQ8064 based
 board, but others can.

With SPI everything is entirely clocked from the master - the attached
device can't cause any of those issues unless there's an electrical
problem that *really* upsets the master.


signature.asc
Description: Digital signature


Re: [PATCH] spi: qup: Fix incorrect block transfers

2014-09-23 Thread Andy Gross
On Tue, Sep 23, 2014 at 12:24:27PM +0300, Ivan T. Ivanov wrote:
> 
> Hi Andy,
> 
> On Sun, 2014-09-21 at 23:27 -0500, Andy Gross wrote:
> > This patch fixes a number of errors with the QUP block transfer mode.  
> > Errors
> > manifested themselves as input underruns, output overruns, and timed out
> > transactions.
> 
> At what speeds are you seeing those errors?

We've tried 25MHz and 50MHz.  Both fail in the same way.  Keep in mind this is
definitely a timing / race issue and it probably also dependent on the latency
of the attached device.  I cannot reproduce this at all on my IPQ8064 based
board, but others can.

This problem manifested itself while using spidev and a usermode flash
programming application (flashrom).

> 
> > 
> > The block mode does not require the priming that occurs in FIFO mode.  At 
> > the
> > moment that the QUP is placed into the RUN state, the QUP may immediately 
> > raise
> > an interrupt if the request is a write.  Therefore, there is no need to 
> > prime
> > the pump.
> > 
> > In addition, the block transfers require that whole blocks of data are
> > read/written at a time.  The last block of data that completes a 
> > transaction may
> > contain less than a full blocks worth of data.
> 
> Does this mean that block transfer will start only if the required
> bytes from block is written into buffer?

No, a better way of putting this is that immediately on setting RUN state,
you'll get a service interrupt to fill the FIFO in block mode.  So there is no
need to prime the FIFO from the non-isr context due to this behavior.



> 
> > +static void qup_fill_read_buffer(struct spi_qup *controller,
> > +   struct spi_transfer *xfer, u32 data)
> 
> Please, could prefix this whit spi_ to be consistent with the
> rest of the code.

Good point.  I need to be consistent.

> >  {
> > u8 *rx_buf = xfer->rx_buf;
> > -   u32 word, state;
> > -   int idx, shift, w_size;
> > -
> > -   w_size = controller->w_size;
> > -
> > -   while (controller->rx_bytes < xfer->len) {
> > -
> > -   state = readl_relaxed(controller->base + QUP_OPERATIONAL);
> > -   if (0 == (state & QUP_OP_IN_FIFO_NOT_EMPTY))
> > -   break;
> > +   int idx, shift;
> > +   int read_len = min_t(int, xfer->len - controller->rx_bytes,
> > +   controller->w_size);
> 
> You should not need this check here. xfer->len is multiple of 
> controller->w_size
> and you always read one word at time.

Ah I missed the __spi_validate where this is done.  I'll remove this.  Good
catch.



> > const u8 *tx_buf = xfer->tx_buf;
> > -   u32 word, state, data;
> > -   int idx, w_size;
> > +   u32 val;
> > +   int idx;
> > +   int write_len = min_t(int, xfer->len - controller->tx_bytes,
> > +   controller->w_size);
> >  
> 
> Same here.

Agreed.



> > -   word = 0;
> > -   for (idx = 0; idx < w_size; idx++, controller->tx_bytes++) {
> > +static void spi_qup_service_block(struct spi_qup *controller,
> > +   struct spi_transfer *xfer, bool is_read)
> > +{
> 
> Please, could you split this function to read and write, so we can use:
> 
> spi_qup_fifo_read() and spi_qup_fifo_write() in FIFO modes and 
> spi_qup_block_read() and spi_qup_block_write() for BLOCK mode.

Well I had it collapsed and the functions are identical except for the
read/write specific pieces, which amount to 2 lines.  I can resplit it out.  It
makes it symmetric.

> 
> > +   u32 data, words_per_blk, num_words, ack_flag, op_flag;
> > +   int i;
> > +
> > +   if (is_read) {
> > +   op_flag = QUP_OP_IN_BLOCK_READ_REQ;
> > +   ack_flag = QUP_OP_IN_SERVICE_FLAG;
> > +   num_words = DIV_ROUND_UP(xfer->len - controller->rx_bytes,
> > +   controller->w_size);
> 
> Same here and below.

Agreed.

> > +   words_per_blk = controller->in_blk_sz >> 2;
> > +   } else {
> > +   op_flag = QUP_OP_OUT_BLOCK_WRITE_REQ;
> > +   ack_flag = QUP_OP_OUT_SERVICE_FLAG;
> > +   num_words = DIV_ROUND_UP(xfer->len - controller->tx_bytes,
> > +   controller->w_size);
> > +   words_per_blk = controller->out_blk_sz >> 2;
> > +   }

-- 
sent by an employee of the Qualcomm Innovation Center, Inc.
The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum,
hosted by The Linux Foundation
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] spi: qup: Fix incorrect block transfers

2014-09-23 Thread Ivan T. Ivanov

Hi Andy,

On Sun, 2014-09-21 at 23:27 -0500, Andy Gross wrote:
> This patch fixes a number of errors with the QUP block transfer mode.  Errors
> manifested themselves as input underruns, output overruns, and timed out
> transactions.

At what speeds are you seeing those errors?

> 
> The block mode does not require the priming that occurs in FIFO mode.  At the
> moment that the QUP is placed into the RUN state, the QUP may immediately 
> raise
> an interrupt if the request is a write.  Therefore, there is no need to prime
> the pump.
> 
> In addition, the block transfers require that whole blocks of data are
> read/written at a time.  The last block of data that completes a transaction 
> may
> contain less than a full blocks worth of data.

Does this mean that block transfer will start only if the required
bytes from block is written into buffer?

> 
> Each block of data results in an input/output service interrupt accompanied 
> with
> a input/output block flag set.  Additional block reads/writes require clearing
> of the service flag.  It is ok to check for additional blocks of data in the
> ISR, but you have to ack every block you transfer.  Imbalanced acks result in
> early return from complete transactions with pending interrupts that still 
> have
> to be ack'd.  The next transaction can be affected by these interrupts.
> 
> Signed-off-by: Andy Gross 
> ---
>  drivers/spi/spi-qup.c |  194 
> +++--
>  1 file changed, 141 insertions(+), 53 deletions(-)
> 



> +static void qup_fill_read_buffer(struct spi_qup *controller,
> + struct spi_transfer *xfer, u32 data)

Please, could prefix this whit spi_ to be consistent with the
rest of the code.

>  {
>   u8 *rx_buf = xfer->rx_buf;
> - u32 word, state;
> - int idx, shift, w_size;
> -
> - w_size = controller->w_size;
> -
> - while (controller->rx_bytes < xfer->len) {
> -
> - state = readl_relaxed(controller->base + QUP_OPERATIONAL);
> - if (0 == (state & QUP_OP_IN_FIFO_NOT_EMPTY))
> - break;
> + int idx, shift;
> + int read_len = min_t(int, xfer->len - controller->rx_bytes,
> + controller->w_size);

You should not need this check here. xfer->len is multiple of controller->w_size
and you always read one word at time.

>  
> - word = readl_relaxed(controller->base + QUP_INPUT_FIFO);
> -
> - if (!rx_buf) {
> - controller->rx_bytes += w_size;
> - continue;
> - }
> -
> - for (idx = 0; idx < w_size; idx++, controller->rx_bytes++) {
> + if (rx_buf)
> + for (idx = 0; idx < read_len; idx++) {
>   /*
>* The data format depends on bytes per SPI word:
>*  4 bytes: 0x12345678
> @@ -229,40 +218,129 @@ static void spi_qup_fifo_read(struct spi_qup 
> *controller,
>*  1 byte : 0x0012
>*/
>   shift = BITS_PER_BYTE;
> - shift *= (w_size - idx - 1);
> - rx_buf[controller->rx_bytes] = word >> shift;
> + shift *= (controller->w_size - idx - 1);
> + rx_buf[controller->rx_bytes + idx] = data >> shift;
>   }
> - }
> +
> + controller->rx_bytes += read_len;
>  }
>  
> -static void spi_qup_fifo_write(struct spi_qup *controller,
> - struct spi_transfer *xfer)
> +static void qup_prepare_write_data(struct spi_qup *controller,
> + struct spi_transfer *xfer, u32 *data)
>  {
>   const u8 *tx_buf = xfer->tx_buf;
> - u32 word, state, data;
> - int idx, w_size;
> + u32 val;
> + int idx;
> + int write_len = min_t(int, xfer->len - controller->tx_bytes,
> + controller->w_size);
>  

Same here.

> - w_size = controller->w_size;
> + *data = 0;
>  
> - while (controller->tx_bytes < xfer->len) {
> + if (tx_buf)
> + for (idx = 0; idx < write_len; idx++) {
> + val = tx_buf[controller->tx_bytes + idx];
> + *data |= val << (BITS_PER_BYTE * (3 - idx));
> + }
>  
> - state = readl_relaxed(controller->base + QUP_OPERATIONAL);
> - if (state & QUP_OP_OUT_FIFO_FULL)
> - break;
> + controller->tx_bytes += write_len;
> +}
>  
> - word = 0;
> - for (idx = 0; idx < w_size; idx++, controller->tx_bytes++) {
> +static void spi_qup_service_block(struct spi_qup *controller,
> + struct spi_transfer *xfer, bool is_read)
> +{

Please, could you split this function to read and write, so we can use:

spi_qup_fifo_read() and spi_qup_fifo_write() in FIFO modes and 
spi_qup_block_read() and spi_qup_block_write() for BLOCK mode.

> + u32 data, words_per_blk, num_words, ack_flag, op_flag;
> + int i;
> +
> 

Re: [PATCH] spi: qup: Fix incorrect block transfers

2014-09-23 Thread Ivan T. Ivanov

Hi Andy,

On Sun, 2014-09-21 at 23:27 -0500, Andy Gross wrote:
 This patch fixes a number of errors with the QUP block transfer mode.  Errors
 manifested themselves as input underruns, output overruns, and timed out
 transactions.

At what speeds are you seeing those errors?

 
 The block mode does not require the priming that occurs in FIFO mode.  At the
 moment that the QUP is placed into the RUN state, the QUP may immediately 
 raise
 an interrupt if the request is a write.  Therefore, there is no need to prime
 the pump.
 
 In addition, the block transfers require that whole blocks of data are
 read/written at a time.  The last block of data that completes a transaction 
 may
 contain less than a full blocks worth of data.

Does this mean that block transfer will start only if the required
bytes from block is written into buffer?

 
 Each block of data results in an input/output service interrupt accompanied 
 with
 a input/output block flag set.  Additional block reads/writes require clearing
 of the service flag.  It is ok to check for additional blocks of data in the
 ISR, but you have to ack every block you transfer.  Imbalanced acks result in
 early return from complete transactions with pending interrupts that still 
 have
 to be ack'd.  The next transaction can be affected by these interrupts.
 
 Signed-off-by: Andy Gross agr...@codeaurora.org
 ---
  drivers/spi/spi-qup.c |  194 
 +++--
  1 file changed, 141 insertions(+), 53 deletions(-)
 

snip

 +static void qup_fill_read_buffer(struct spi_qup *controller,
 + struct spi_transfer *xfer, u32 data)

Please, could prefix this whit spi_ to be consistent with the
rest of the code.

  {
   u8 *rx_buf = xfer-rx_buf;
 - u32 word, state;
 - int idx, shift, w_size;
 -
 - w_size = controller-w_size;
 -
 - while (controller-rx_bytes  xfer-len) {
 -
 - state = readl_relaxed(controller-base + QUP_OPERATIONAL);
 - if (0 == (state  QUP_OP_IN_FIFO_NOT_EMPTY))
 - break;
 + int idx, shift;
 + int read_len = min_t(int, xfer-len - controller-rx_bytes,
 + controller-w_size);

You should not need this check here. xfer-len is multiple of controller-w_size
and you always read one word at time.

  
 - word = readl_relaxed(controller-base + QUP_INPUT_FIFO);
 -
 - if (!rx_buf) {
 - controller-rx_bytes += w_size;
 - continue;
 - }
 -
 - for (idx = 0; idx  w_size; idx++, controller-rx_bytes++) {
 + if (rx_buf)
 + for (idx = 0; idx  read_len; idx++) {
   /*
* The data format depends on bytes per SPI word:
*  4 bytes: 0x12345678
 @@ -229,40 +218,129 @@ static void spi_qup_fifo_read(struct spi_qup 
 *controller,
*  1 byte : 0x0012
*/
   shift = BITS_PER_BYTE;
 - shift *= (w_size - idx - 1);
 - rx_buf[controller-rx_bytes] = word  shift;
 + shift *= (controller-w_size - idx - 1);
 + rx_buf[controller-rx_bytes + idx] = data  shift;
   }
 - }
 +
 + controller-rx_bytes += read_len;
  }
  
 -static void spi_qup_fifo_write(struct spi_qup *controller,
 - struct spi_transfer *xfer)
 +static void qup_prepare_write_data(struct spi_qup *controller,
 + struct spi_transfer *xfer, u32 *data)
  {
   const u8 *tx_buf = xfer-tx_buf;
 - u32 word, state, data;
 - int idx, w_size;
 + u32 val;
 + int idx;
 + int write_len = min_t(int, xfer-len - controller-tx_bytes,
 + controller-w_size);
  

Same here.

 - w_size = controller-w_size;
 + *data = 0;
  
 - while (controller-tx_bytes  xfer-len) {
 + if (tx_buf)
 + for (idx = 0; idx  write_len; idx++) {
 + val = tx_buf[controller-tx_bytes + idx];
 + *data |= val  (BITS_PER_BYTE * (3 - idx));
 + }
  
 - state = readl_relaxed(controller-base + QUP_OPERATIONAL);
 - if (state  QUP_OP_OUT_FIFO_FULL)
 - break;
 + controller-tx_bytes += write_len;
 +}
  
 - word = 0;
 - for (idx = 0; idx  w_size; idx++, controller-tx_bytes++) {
 +static void spi_qup_service_block(struct spi_qup *controller,
 + struct spi_transfer *xfer, bool is_read)
 +{

Please, could you split this function to read and write, so we can use:

spi_qup_fifo_read() and spi_qup_fifo_write() in FIFO modes and 
spi_qup_block_read() and spi_qup_block_write() for BLOCK mode.

 + u32 data, words_per_blk, num_words, ack_flag, op_flag;
 + int i;
 +
 + if (is_read) {
 + op_flag = QUP_OP_IN_BLOCK_READ_REQ;
 + ack_flag = QUP_OP_IN_SERVICE_FLAG;
 +   

Re: [PATCH] spi: qup: Fix incorrect block transfers

2014-09-23 Thread Andy Gross
On Tue, Sep 23, 2014 at 12:24:27PM +0300, Ivan T. Ivanov wrote:
 
 Hi Andy,
 
 On Sun, 2014-09-21 at 23:27 -0500, Andy Gross wrote:
  This patch fixes a number of errors with the QUP block transfer mode.  
  Errors
  manifested themselves as input underruns, output overruns, and timed out
  transactions.
 
 At what speeds are you seeing those errors?

We've tried 25MHz and 50MHz.  Both fail in the same way.  Keep in mind this is
definitely a timing / race issue and it probably also dependent on the latency
of the attached device.  I cannot reproduce this at all on my IPQ8064 based
board, but others can.

This problem manifested itself while using spidev and a usermode flash
programming application (flashrom).

 
  
  The block mode does not require the priming that occurs in FIFO mode.  At 
  the
  moment that the QUP is placed into the RUN state, the QUP may immediately 
  raise
  an interrupt if the request is a write.  Therefore, there is no need to 
  prime
  the pump.
  
  In addition, the block transfers require that whole blocks of data are
  read/written at a time.  The last block of data that completes a 
  transaction may
  contain less than a full blocks worth of data.
 
 Does this mean that block transfer will start only if the required
 bytes from block is written into buffer?

No, a better way of putting this is that immediately on setting RUN state,
you'll get a service interrupt to fill the FIFO in block mode.  So there is no
need to prime the FIFO from the non-isr context due to this behavior.

snip

 
  +static void qup_fill_read_buffer(struct spi_qup *controller,
  +   struct spi_transfer *xfer, u32 data)
 
 Please, could prefix this whit spi_ to be consistent with the
 rest of the code.

Good point.  I need to be consistent.

   {
  u8 *rx_buf = xfer-rx_buf;
  -   u32 word, state;
  -   int idx, shift, w_size;
  -
  -   w_size = controller-w_size;
  -
  -   while (controller-rx_bytes  xfer-len) {
  -
  -   state = readl_relaxed(controller-base + QUP_OPERATIONAL);
  -   if (0 == (state  QUP_OP_IN_FIFO_NOT_EMPTY))
  -   break;
  +   int idx, shift;
  +   int read_len = min_t(int, xfer-len - controller-rx_bytes,
  +   controller-w_size);
 
 You should not need this check here. xfer-len is multiple of 
 controller-w_size
 and you always read one word at time.

Ah I missed the __spi_validate where this is done.  I'll remove this.  Good
catch.

snip

  const u8 *tx_buf = xfer-tx_buf;
  -   u32 word, state, data;
  -   int idx, w_size;
  +   u32 val;
  +   int idx;
  +   int write_len = min_t(int, xfer-len - controller-tx_bytes,
  +   controller-w_size);
   
 
 Same here.

Agreed.

snip

  -   word = 0;
  -   for (idx = 0; idx  w_size; idx++, controller-tx_bytes++) {
  +static void spi_qup_service_block(struct spi_qup *controller,
  +   struct spi_transfer *xfer, bool is_read)
  +{
 
 Please, could you split this function to read and write, so we can use:
 
 spi_qup_fifo_read() and spi_qup_fifo_write() in FIFO modes and 
 spi_qup_block_read() and spi_qup_block_write() for BLOCK mode.

Well I had it collapsed and the functions are identical except for the
read/write specific pieces, which amount to 2 lines.  I can resplit it out.  It
makes it symmetric.

 
  +   u32 data, words_per_blk, num_words, ack_flag, op_flag;
  +   int i;
  +
  +   if (is_read) {
  +   op_flag = QUP_OP_IN_BLOCK_READ_REQ;
  +   ack_flag = QUP_OP_IN_SERVICE_FLAG;
  +   num_words = DIV_ROUND_UP(xfer-len - controller-rx_bytes,
  +   controller-w_size);
 
 Same here and below.

Agreed.

  +   words_per_blk = controller-in_blk_sz  2;
  +   } else {
  +   op_flag = QUP_OP_OUT_BLOCK_WRITE_REQ;
  +   ack_flag = QUP_OP_OUT_SERVICE_FLAG;
  +   num_words = DIV_ROUND_UP(xfer-len - controller-tx_bytes,
  +   controller-w_size);
  +   words_per_blk = controller-out_blk_sz  2;
  +   }

-- 
sent by an employee of the Qualcomm Innovation Center, Inc.
The Qualcomm Innovation Center, Inc. is a member of the Code Aurora Forum,
hosted by The Linux Foundation
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[PATCH] spi: qup: Fix incorrect block transfers

2014-09-21 Thread Andy Gross
This patch fixes a number of errors with the QUP block transfer mode.  Errors
manifested themselves as input underruns, output overruns, and timed out
transactions.

The block mode does not require the priming that occurs in FIFO mode.  At the
moment that the QUP is placed into the RUN state, the QUP may immediately raise
an interrupt if the request is a write.  Therefore, there is no need to prime
the pump.

In addition, the block transfers require that whole blocks of data are
read/written at a time.  The last block of data that completes a transaction may
contain less than a full blocks worth of data.

Each block of data results in an input/output service interrupt accompanied with
a input/output block flag set.  Additional block reads/writes require clearing
of the service flag.  It is ok to check for additional blocks of data in the
ISR, but you have to ack every block you transfer.  Imbalanced acks result in
early return from complete transactions with pending interrupts that still have
to be ack'd.  The next transaction can be affected by these interrupts.

Signed-off-by: Andy Gross 
---
 drivers/spi/spi-qup.c |  194 +++--
 1 file changed, 141 insertions(+), 53 deletions(-)

diff --git a/drivers/spi/spi-qup.c b/drivers/spi/spi-qup.c
index 9f83d29..9c4c745 100644
--- a/drivers/spi/spi-qup.c
+++ b/drivers/spi/spi-qup.c
@@ -80,6 +80,8 @@
 #define QUP_IO_M_MODE_BAM  3
 
 /* QUP_OPERATIONAL fields */
+#define QUP_OP_IN_BLOCK_READ_REQ   BIT(13)
+#define QUP_OP_OUT_BLOCK_WRITE_REQ BIT(12)
 #define QUP_OP_MAX_INPUT_DONE_FLAG BIT(11)
 #define QUP_OP_MAX_OUTPUT_DONE_FLAGBIT(10)
 #define QUP_OP_IN_SERVICE_FLAG BIT(9)
@@ -143,6 +145,7 @@ struct spi_qup {
int tx_bytes;
int rx_bytes;
int qup_v1;
+   int mode;
 };
 
 
@@ -198,30 +201,16 @@ static int spi_qup_set_state(struct spi_qup *controller, 
u32 state)
return 0;
 }
 
-
-static void spi_qup_fifo_read(struct spi_qup *controller,
-   struct spi_transfer *xfer)
+static void qup_fill_read_buffer(struct spi_qup *controller,
+   struct spi_transfer *xfer, u32 data)
 {
u8 *rx_buf = xfer->rx_buf;
-   u32 word, state;
-   int idx, shift, w_size;
-
-   w_size = controller->w_size;
-
-   while (controller->rx_bytes < xfer->len) {
-
-   state = readl_relaxed(controller->base + QUP_OPERATIONAL);
-   if (0 == (state & QUP_OP_IN_FIFO_NOT_EMPTY))
-   break;
+   int idx, shift;
+   int read_len = min_t(int, xfer->len - controller->rx_bytes,
+   controller->w_size);
 
-   word = readl_relaxed(controller->base + QUP_INPUT_FIFO);
-
-   if (!rx_buf) {
-   controller->rx_bytes += w_size;
-   continue;
-   }
-
-   for (idx = 0; idx < w_size; idx++, controller->rx_bytes++) {
+   if (rx_buf)
+   for (idx = 0; idx < read_len; idx++) {
/*
 * The data format depends on bytes per SPI word:
 *  4 bytes: 0x12345678
@@ -229,40 +218,129 @@ static void spi_qup_fifo_read(struct spi_qup *controller,
 *  1 byte : 0x0012
 */
shift = BITS_PER_BYTE;
-   shift *= (w_size - idx - 1);
-   rx_buf[controller->rx_bytes] = word >> shift;
+   shift *= (controller->w_size - idx - 1);
+   rx_buf[controller->rx_bytes + idx] = data >> shift;
}
-   }
+
+   controller->rx_bytes += read_len;
 }
 
-static void spi_qup_fifo_write(struct spi_qup *controller,
-   struct spi_transfer *xfer)
+static void qup_prepare_write_data(struct spi_qup *controller,
+   struct spi_transfer *xfer, u32 *data)
 {
const u8 *tx_buf = xfer->tx_buf;
-   u32 word, state, data;
-   int idx, w_size;
+   u32 val;
+   int idx;
+   int write_len = min_t(int, xfer->len - controller->tx_bytes,
+   controller->w_size);
 
-   w_size = controller->w_size;
+   *data = 0;
 
-   while (controller->tx_bytes < xfer->len) {
+   if (tx_buf)
+   for (idx = 0; idx < write_len; idx++) {
+   val = tx_buf[controller->tx_bytes + idx];
+   *data |= val << (BITS_PER_BYTE * (3 - idx));
+   }
 
-   state = readl_relaxed(controller->base + QUP_OPERATIONAL);
-   if (state & QUP_OP_OUT_FIFO_FULL)
-   break;
+   controller->tx_bytes += write_len;
+}
 
-   word = 0;
-   for (idx = 0; idx < w_size; idx++, controller->tx_bytes++) {
+static void 

[PATCH] spi: qup: Fix incorrect block transfers

2014-09-21 Thread Andy Gross
This patch fixes a number of errors with the QUP block transfer mode.  Errors
manifested themselves as input underruns, output overruns, and timed out
transactions.

The block mode does not require the priming that occurs in FIFO mode.  At the
moment that the QUP is placed into the RUN state, the QUP may immediately raise
an interrupt if the request is a write.  Therefore, there is no need to prime
the pump.

In addition, the block transfers require that whole blocks of data are
read/written at a time.  The last block of data that completes a transaction may
contain less than a full blocks worth of data.

Each block of data results in an input/output service interrupt accompanied with
a input/output block flag set.  Additional block reads/writes require clearing
of the service flag.  It is ok to check for additional blocks of data in the
ISR, but you have to ack every block you transfer.  Imbalanced acks result in
early return from complete transactions with pending interrupts that still have
to be ack'd.  The next transaction can be affected by these interrupts.

Signed-off-by: Andy Gross agr...@codeaurora.org
---
 drivers/spi/spi-qup.c |  194 +++--
 1 file changed, 141 insertions(+), 53 deletions(-)

diff --git a/drivers/spi/spi-qup.c b/drivers/spi/spi-qup.c
index 9f83d29..9c4c745 100644
--- a/drivers/spi/spi-qup.c
+++ b/drivers/spi/spi-qup.c
@@ -80,6 +80,8 @@
 #define QUP_IO_M_MODE_BAM  3
 
 /* QUP_OPERATIONAL fields */
+#define QUP_OP_IN_BLOCK_READ_REQ   BIT(13)
+#define QUP_OP_OUT_BLOCK_WRITE_REQ BIT(12)
 #define QUP_OP_MAX_INPUT_DONE_FLAG BIT(11)
 #define QUP_OP_MAX_OUTPUT_DONE_FLAGBIT(10)
 #define QUP_OP_IN_SERVICE_FLAG BIT(9)
@@ -143,6 +145,7 @@ struct spi_qup {
int tx_bytes;
int rx_bytes;
int qup_v1;
+   int mode;
 };
 
 
@@ -198,30 +201,16 @@ static int spi_qup_set_state(struct spi_qup *controller, 
u32 state)
return 0;
 }
 
-
-static void spi_qup_fifo_read(struct spi_qup *controller,
-   struct spi_transfer *xfer)
+static void qup_fill_read_buffer(struct spi_qup *controller,
+   struct spi_transfer *xfer, u32 data)
 {
u8 *rx_buf = xfer-rx_buf;
-   u32 word, state;
-   int idx, shift, w_size;
-
-   w_size = controller-w_size;
-
-   while (controller-rx_bytes  xfer-len) {
-
-   state = readl_relaxed(controller-base + QUP_OPERATIONAL);
-   if (0 == (state  QUP_OP_IN_FIFO_NOT_EMPTY))
-   break;
+   int idx, shift;
+   int read_len = min_t(int, xfer-len - controller-rx_bytes,
+   controller-w_size);
 
-   word = readl_relaxed(controller-base + QUP_INPUT_FIFO);
-
-   if (!rx_buf) {
-   controller-rx_bytes += w_size;
-   continue;
-   }
-
-   for (idx = 0; idx  w_size; idx++, controller-rx_bytes++) {
+   if (rx_buf)
+   for (idx = 0; idx  read_len; idx++) {
/*
 * The data format depends on bytes per SPI word:
 *  4 bytes: 0x12345678
@@ -229,40 +218,129 @@ static void spi_qup_fifo_read(struct spi_qup *controller,
 *  1 byte : 0x0012
 */
shift = BITS_PER_BYTE;
-   shift *= (w_size - idx - 1);
-   rx_buf[controller-rx_bytes] = word  shift;
+   shift *= (controller-w_size - idx - 1);
+   rx_buf[controller-rx_bytes + idx] = data  shift;
}
-   }
+
+   controller-rx_bytes += read_len;
 }
 
-static void spi_qup_fifo_write(struct spi_qup *controller,
-   struct spi_transfer *xfer)
+static void qup_prepare_write_data(struct spi_qup *controller,
+   struct spi_transfer *xfer, u32 *data)
 {
const u8 *tx_buf = xfer-tx_buf;
-   u32 word, state, data;
-   int idx, w_size;
+   u32 val;
+   int idx;
+   int write_len = min_t(int, xfer-len - controller-tx_bytes,
+   controller-w_size);
 
-   w_size = controller-w_size;
+   *data = 0;
 
-   while (controller-tx_bytes  xfer-len) {
+   if (tx_buf)
+   for (idx = 0; idx  write_len; idx++) {
+   val = tx_buf[controller-tx_bytes + idx];
+   *data |= val  (BITS_PER_BYTE * (3 - idx));
+   }
 
-   state = readl_relaxed(controller-base + QUP_OPERATIONAL);
-   if (state  QUP_OP_OUT_FIFO_FULL)
-   break;
+   controller-tx_bytes += write_len;
+}
 
-   word = 0;
-   for (idx = 0; idx  w_size; idx++, controller-tx_bytes++) {
+static void spi_qup_service_block(struct spi_qup