Self-review

On 04/09/2015 09:38 PM, Peter Hurley wrote:
> A read() from a pty master may mistakenly indicate EOF (errno == -EIO)
> after the pty slave has closed, even though input data remains to be read.
> For example,
> 
>        pty slave       |        input worker        |    pty master
>                        |                            |
>                        |                            |   n_tty_read()
> pty_write()            |                            |     input avail? no
>   add data             |                            |     sleep
>   schedule worker  --->|                            |     .
>                        |---> flush_to_ldisc()       |     .
> pty_close()            |       fill read buffer     |     .
>   wait for worker      |       wakeup reader    --->|     .
>                        |       read buffer full?    |---> input avail ? yes
>                        |<---   yes - exit worker    |     copy 4096 bytes to 
> user
>   TTY_OTHER_CLOSED <---|                            |<--- kick worker
>                        |                            |
> 
>                               **** New read() before worker starts ****
> 
>                        |                            |   n_tty_read()
>                        |                            |     input avail? no
>                        |                            |     TTY_OTHER_CLOSED? 
> yes
>                        |                            |     return -EIO
> 
> Several conditions are required to trigger this race:
> 1. the ldisc read buffer must become full so the input worker exits
> 2. the read() count parameter must be >= 4096 so the ldisc read buffer
>    is empty
> 3. the subsequent read() occurs before the kicked worker has processed
>    more input
> 
> However, the underlying cause of the race is that data is pipelined, while
> tty state is not; ie., data already written by the pty slave end is not
> yet visible to the pty master end, but state changes by the pty slave end
> are visible to the pty master end immediately.
> 
> Pipeline the TTY_OTHER_CLOSED state through input worker to the reader.
> 1. Introduce TTY_OTHER_DONE which is set by the input worker when
>    TTY_OTHER_CLOSED is set and either the input buffers are flushed or
>    input processing has completed. Readers/polls are woken when
>    TTY_OTHER_DONE is set.
> 2. Reader/poll checks TTY_OTHER_DONE instead of TTY_OTHER_CLOSED.
> 3. A new input worker is started from pty_close() after setting
>    TTY_OTHER_CLOSED, which ensures the TTY_OTHER_DONE state will be
>    set if the last input worker is already finished (or just about to
>    exit).
> 
> Remove tty_flush_to_ldisc(); no in-tree callers.
> 
> Fixes: 52bce7f8d4fc ("pty, n_tty: Simplify input processing on final close")
> Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=96311
> BugLink: http://bugs.launchpad.net/bugs/1429756
> Cc: <[email protected]> # 3.19+
> Reported-by: Andy Whitcroft <[email protected]>
> Reported-by: H.J. Lu <[email protected]>
> Signed-off-by: Peter Hurley <[email protected]>
> ---
> 
> v2: Clear TTY_OTHER_DONE on pty re-open
> 
>  Documentation/serial/tty.txt |  3 +++
>  drivers/tty/n_hdlc.c         |  4 ++--
>  drivers/tty/n_tty.c          |  4 ++--
>  drivers/tty/pty.c            |  4 ++--
>  drivers/tty/tty_buffer.c     | 25 +++++++++++--------------
>  include/linux/tty.h          |  2 +-
>  6 files changed, 21 insertions(+), 21 deletions(-)
> 
> diff --git a/Documentation/serial/tty.txt b/Documentation/serial/tty.txt
> index 1e52d67..dbe6623 100644
> --- a/Documentation/serial/tty.txt
> +++ b/Documentation/serial/tty.txt
> @@ -198,6 +198,9 @@ TTY_IO_ERROR              If set, causes all subsequent 
> userspace read/write
>  
>  TTY_OTHER_CLOSED     Device is a pty and the other side has closed.
>  
> +TTY_OTHER_DONE               Device is a pty and the other side has closed 
> and
> +                     all pending input processing has been completed.
> +
>  TTY_NO_WRITE_SPLIT   Prevent driver from splitting up writes into
>                       smaller chunks.
>  
> diff --git a/drivers/tty/n_hdlc.c b/drivers/tty/n_hdlc.c
> index 644ddb8..bbc4ce6 100644
> --- a/drivers/tty/n_hdlc.c
> +++ b/drivers/tty/n_hdlc.c
> @@ -600,7 +600,7 @@ static ssize_t n_hdlc_tty_read(struct tty_struct *tty, 
> struct file *file,
>       add_wait_queue(&tty->read_wait, &wait);
>  
>       for (;;) {
> -             if (test_bit(TTY_OTHER_CLOSED, &tty->flags)) {
> +             if (test_bit(TTY_OTHER_DONE, &tty->flags)) {
>                       ret = -EIO;
>                       break;
>               }
> @@ -828,7 +828,7 @@ static unsigned int n_hdlc_tty_poll(struct tty_struct 
> *tty, struct file *filp,
>               /* set bits for operations that won't block */
>               if (n_hdlc->rx_buf_list.head)
>                       mask |= POLLIN | POLLRDNORM;    /* readable */
> -             if (test_bit(TTY_OTHER_CLOSED, &tty->flags))
> +             if (test_bit(TTY_OTHER_DONE, &tty->flags))
>                       mask |= POLLHUP;
>               if (tty_hung_up_p(filp))
>                       mask |= POLLHUP;
> diff --git a/drivers/tty/n_tty.c b/drivers/tty/n_tty.c
> index 54da8f4..522de6d 100644
> --- a/drivers/tty/n_tty.c
> +++ b/drivers/tty/n_tty.c
> @@ -2233,7 +2233,7 @@ static ssize_t n_tty_read(struct tty_struct *tty, 
> struct file *file,
>                       ldata->minimum_to_wake = (minimum - (b - buf));
>  
>               if (!input_available_p(tty, 0)) {
> -                     if (test_bit(TTY_OTHER_CLOSED, &tty->flags)) {
> +                     if (test_bit(TTY_OTHER_DONE, &tty->flags)) {

A very small race window is open here, where a stale head index could
be read and yet still TTY_OTHER_DONE could be observed. The "input done"
state needs to be snapshot before testing input_available_p() to close
this race window.

>                               retval = -EIO;
>                               break;
>                       }
> @@ -2444,7 +2444,7 @@ static unsigned int n_tty_poll(struct tty_struct *tty, 
> struct file *file,
>               mask |= POLLIN | POLLRDNORM;
>       if (tty->packet && tty->link->ctrl_status)
>               mask |= POLLPRI | POLLIN | POLLRDNORM;
> -     if (test_bit(TTY_OTHER_CLOSED, &tty->flags))
> +     if (test_bit(TTY_OTHER_DONE, &tty->flags))

same here.

>               mask |= POLLHUP;
>       if (tty_hung_up_p(file))
>               mask |= POLLHUP;
> diff --git a/drivers/tty/pty.c b/drivers/tty/pty.c
> index 6fffb53..439864b 100644
> --- a/drivers/tty/pty.c
> +++ b/drivers/tty/pty.c
> @@ -59,9 +59,8 @@ static void pty_close(struct tty_struct *tty, struct file 
> *filp)
>       /* Review - krefs on tty_link ?? */
>       if (!tty->link)
>               return;
> -     tty_flush_to_ldisc(tty->link);
>       set_bit(TTY_OTHER_CLOSED, &tty->link->flags);
> -     wake_up_interruptible(&tty->link->read_wait);
> +     tty_flip_buffer_push(tty->link->port);
>       wake_up_interruptible(&tty->link->write_wait);
>       if (tty->driver->subtype == PTY_TYPE_MASTER) {
>               set_bit(TTY_OTHER_CLOSED, &tty->flags);
> @@ -250,6 +249,7 @@ static int pty_open(struct tty_struct *tty, struct file 
> *filp)
>  
>       clear_bit(TTY_IO_ERROR, &tty->flags);

>       clear_bit(TTY_OTHER_CLOSED, &tty->link->flags);
> +     clear_bit(TTY_OTHER_DONE, &tty->link->flags);

This state transition needs to be atomic.

>       set_bit(TTY_THROTTLED, &tty->flags);
>       return 0;
>  
> diff --git a/drivers/tty/tty_buffer.c b/drivers/tty/tty_buffer.c
> index 7566164..642dcd0 100644
> --- a/drivers/tty/tty_buffer.c
> +++ b/drivers/tty/tty_buffer.c
> @@ -229,6 +229,11 @@ void tty_buffer_flush(struct tty_struct *tty, struct 
> tty_ldisc *ld)
>       if (ld && ld->ops->flush_buffer)
>               ld->ops->flush_buffer(tty);
>  
> +     if (test_bit(TTY_OTHER_CLOSED, &tty->flags)) {
> +             set_bit(TTY_OTHER_DONE, &tty->flags);
> +             wake_up_interruptible(&tty->read_wait);
> +     }
> +
>       atomic_dec(&buf->priority);
>       mutex_unlock(&buf->lock);
>  }
> @@ -471,8 +476,13 @@ static void flush_to_ldisc(struct work_struct *work)
>               smp_rmb();
>               count = head->commit - head->read;
>               if (!count) {
> -                     if (next == NULL)
> +                     if (next == NULL) {
> +                             if (test_bit(TTY_OTHER_CLOSED, &tty->flags)) {
> +                                     set_bit(TTY_OTHER_DONE, &tty->flags);

This is racy with clearing TTY_OTHER_DONE; the state transition from
TTY_OTHER_CLOSED => TTY_OTHER_DONE needs to be atomic (as does the
state transition in pty_open() from TTY_OTHER_CLOSED => !TTY_OTHER_DONE).

> +                                     wake_up_interruptible(&tty->read_wait);
> +                             }
>                               break;
> +                     }
>                       buf->head = next;
>                       tty_buffer_free(port, head);
>                       continue;
> @@ -489,19 +499,6 @@ static void flush_to_ldisc(struct work_struct *work)
>  }
>  
>  /**
> - *   tty_flush_to_ldisc
> - *   @tty: tty to push
> - *
> - *   Push the terminal flip buffers to the line discipline.
> - *
> - *   Must not be called from IRQ context.
> - */
> -void tty_flush_to_ldisc(struct tty_struct *tty)
> -{
> -     flush_work(&tty->port->buf.work);
> -}
> -
> -/**
>   *   tty_flip_buffer_push    -       terminal
>   *   @port: tty port to push
>   *
> diff --git a/include/linux/tty.h b/include/linux/tty.h
> index f9fbdf1..0f29f31 100644
> --- a/include/linux/tty.h
> +++ b/include/linux/tty.h
> @@ -339,6 +339,7 @@ struct tty_file_private {
>  #define TTY_EXCLUSIVE                3       /* Exclusive open mode */
>  #define TTY_DEBUG            4       /* Debugging */
>  #define TTY_DO_WRITE_WAKEUP  5       /* Call write_wakeup after queuing new 
> */
> +#define TTY_OTHER_DONE               6       /* Closed pty has completed 
> input processing */
>  #define TTY_LDISC_OPEN               11      /* Line discipline is open */
>  #define TTY_PTY_LOCK                 16      /* pty private */
>  #define TTY_NO_WRITE_SPLIT   17      /* Preserve write boundaries to driver 
> */
> @@ -462,7 +463,6 @@ extern int tty_hung_up_p(struct file *filp);
>  extern void do_SAK(struct tty_struct *tty);
>  extern void __do_SAK(struct tty_struct *tty);
>  extern void no_tty(void);
> -extern void tty_flush_to_ldisc(struct tty_struct *tty);
>  extern void tty_buffer_free_all(struct tty_port *port);
>  extern void tty_buffer_flush(struct tty_struct *tty, struct tty_ldisc *ld);
>  extern void tty_buffer_init(struct tty_port *port);
> 

--
To unsubscribe from this list: send the line "unsubscribe stable" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to