[jira] [Updated] (KUDU-1447) Document recommendation to disable THP

2020-06-01 Thread Grant Henke (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-1447:
--
Labels: supportability  (was: )

> Document recommendation to disable THP
> --
>
> Key: KUDU-1447
> URL: https://issues.apache.org/jira/browse/KUDU-1447
> Project: Kudu
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.8.0
>Reporter: Todd Lipcon
>Assignee: Todd Lipcon
>Priority: Major
>  Labels: supportability
>
> Doing a bunch of cluster testing, I finally got to the root of why sometimes 
> threads take several seconds to start up, causing various timeout issues, 
> false elections, etc. It turns out that khugepaged does synchronous page 
> compaction while holding a process's mmap semaphore, and when that's 
> concurrent with lots of IO, can block for several seconds.
> https://lkml.org/lkml/2011/7/26/103
> To avoid this, we should tell users to set hugepages to "madvise" or "never" 
> -- it's not sufficient to just disable defrag, because khugepaged still runs 
> in the background in that case and causes this sporadic issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-1447) Document recommendation to disable THP

2020-06-01 Thread Grant Henke (Jira)


 [ 
https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-1447:
--
Target Version/s:   (was: 1.8.0)

> Document recommendation to disable THP
> --
>
> Key: KUDU-1447
> URL: https://issues.apache.org/jira/browse/KUDU-1447
> Project: Kudu
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.8.0
>Reporter: Todd Lipcon
>Assignee: Todd Lipcon
>Priority: Major
>  Labels: supportability
>
> Doing a bunch of cluster testing, I finally got to the root of why sometimes 
> threads take several seconds to start up, causing various timeout issues, 
> false elections, etc. It turns out that khugepaged does synchronous page 
> compaction while holding a process's mmap semaphore, and when that's 
> concurrent with lots of IO, can block for several seconds.
> https://lkml.org/lkml/2011/7/26/103
> To avoid this, we should tell users to set hugepages to "madvise" or "never" 
> -- it's not sufficient to just disable defrag, because khugepaged still runs 
> in the background in that case and causes this sporadic issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (KUDU-1447) Document recommendation to disable THP

2018-03-08 Thread Grant Henke (JIRA)

 [ 
https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-1447:
--
Target Version/s: 1.8.0  (was: 1.7.0)

> Document recommendation to disable THP
> --
>
> Key: KUDU-1447
> URL: https://issues.apache.org/jira/browse/KUDU-1447
> Project: Kudu
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.8.0
>Reporter: Todd Lipcon
>Assignee: Todd Lipcon
>Priority: Major
>
> Doing a bunch of cluster testing, I finally got to the root of why sometimes 
> threads take several seconds to start up, causing various timeout issues, 
> false elections, etc. It turns out that khugepaged does synchronous page 
> compaction while holding a process's mmap semaphore, and when that's 
> concurrent with lots of IO, can block for several seconds.
> https://lkml.org/lkml/2011/7/26/103
> To avoid this, we should tell users to set hugepages to "madvise" or "never" 
> -- it's not sufficient to just disable defrag, because khugepaged still runs 
> in the background in that case and causes this sporadic issue.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (KUDU-1447) Document recommendation to disable THP

2018-02-16 Thread Grant Henke (JIRA)

 [ 
https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-1447:
--
Target Version/s: 1.7.0  (was: 1.5.0)

> Document recommendation to disable THP
> --
>
> Key: KUDU-1447
> URL: https://issues.apache.org/jira/browse/KUDU-1447
> Project: Kudu
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.8.0
>Reporter: Todd Lipcon
>Assignee: Todd Lipcon
>Priority: Major
>
> Doing a bunch of cluster testing, I finally got to the root of why sometimes 
> threads take several seconds to start up, causing various timeout issues, 
> false elections, etc. It turns out that khugepaged does synchronous page 
> compaction while holding a process's mmap semaphore, and when that's 
> concurrent with lots of IO, can block for several seconds.
> https://lkml.org/lkml/2011/7/26/103
> To avoid this, we should tell users to set hugepages to "madvise" or "never" 
> -- it's not sufficient to just disable defrag, because khugepaged still runs 
> in the background in that case and causes this sporadic issue.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)