On 04/07/2015 08:40 AM, Ilya Dryomov wrote:
> This reverts commit 89baaa570ab0b476db09408d209578cfed700e9f.
> 
> Dirty page throttling should be sufficient for us in the general case
> so there is no need to use __GFP_MEMALLOC - it would be needed only in
> the swap-over-rbd case, which we currently don't support.  (It would
> probably take approximately the commit that is being reverted to add
> that support, but we would also need the "swap" option to distinguish
> from the general case and make sure swap ceph_client-s aren't shared
> with anything else.)  See ceph-devel threads [1] and [2] for the
> details of why enabling pfmemalloc reserves for all cases is a bad
> thing.
> 
> On top of potential system lockups related to drained emergency
> reserves, this turned out to cause ceph lockups in case peers are on
> the same host and communicating via loopback due to sk_filter()
> dropping pfmemalloc skbs on the receiving side because the receiving
> loopback socket is not tagged with SOCK_MEMALLOC.
> 
> [1] "SOCK_MEMALLOC vs loopback"
>     http://www.spinics.net/lists/ceph-devel/msg22998.html
> [2] "[PATCH] libceph: don't set memalloc flags in loopback case"
>     http://www.spinics.net/lists/ceph-devel/msg23392.html
> 
> Conflicts:
>       net/ceph/messenger.c [ context: tcp_nodelay option ]
> 
> Cc: Mike Christie <micha...@cs.wisc.edu>
> Cc: Mel Gorman <mgor...@suse.de>
> Cc: Sage Weil <s...@redhat.com>
> Cc: sta...@vger.kernel.org # 3.18+, needs backporting
> Signed-off-by: Ilya Dryomov <idryo...@gmail.com>

Yeah, I misunderstood the memalloc flag use.

Reviewed-by: Mike Christie <micha...@cs.wisc.edu>

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to