[jira] [Updated] (KUDU-1447) Document recommendation to disable THP
[ https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Henke updated KUDU-1447: -- Labels: supportability (was: ) > Document recommendation to disable THP > -- > > Key: KUDU-1447 > URL: https://issues.apache.org/jira/browse/KUDU-1447 > Project: Kudu > Issue Type: Improvement > Components: documentation >Affects Versions: 0.8.0 >Reporter: Todd Lipcon >Assignee: Todd Lipcon >Priority: Major > Labels: supportability > > Doing a bunch of cluster testing, I finally got to the root of why sometimes > threads take several seconds to start up, causing various timeout issues, > false elections, etc. It turns out that khugepaged does synchronous page > compaction while holding a process's mmap semaphore, and when that's > concurrent with lots of IO, can block for several seconds. > https://lkml.org/lkml/2011/7/26/103 > To avoid this, we should tell users to set hugepages to "madvise" or "never" > -- it's not sufficient to just disable defrag, because khugepaged still runs > in the background in that case and causes this sporadic issue. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (KUDU-1447) Document recommendation to disable THP
[ https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Henke updated KUDU-1447: -- Target Version/s: (was: 1.8.0) > Document recommendation to disable THP > -- > > Key: KUDU-1447 > URL: https://issues.apache.org/jira/browse/KUDU-1447 > Project: Kudu > Issue Type: Improvement > Components: documentation >Affects Versions: 0.8.0 >Reporter: Todd Lipcon >Assignee: Todd Lipcon >Priority: Major > Labels: supportability > > Doing a bunch of cluster testing, I finally got to the root of why sometimes > threads take several seconds to start up, causing various timeout issues, > false elections, etc. It turns out that khugepaged does synchronous page > compaction while holding a process's mmap semaphore, and when that's > concurrent with lots of IO, can block for several seconds. > https://lkml.org/lkml/2011/7/26/103 > To avoid this, we should tell users to set hugepages to "madvise" or "never" > -- it's not sufficient to just disable defrag, because khugepaged still runs > in the background in that case and causes this sporadic issue. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (KUDU-1447) Document recommendation to disable THP
[ https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Henke updated KUDU-1447: -- Target Version/s: 1.8.0 (was: 1.7.0) > Document recommendation to disable THP > -- > > Key: KUDU-1447 > URL: https://issues.apache.org/jira/browse/KUDU-1447 > Project: Kudu > Issue Type: Improvement > Components: documentation >Affects Versions: 0.8.0 >Reporter: Todd Lipcon >Assignee: Todd Lipcon >Priority: Major > > Doing a bunch of cluster testing, I finally got to the root of why sometimes > threads take several seconds to start up, causing various timeout issues, > false elections, etc. It turns out that khugepaged does synchronous page > compaction while holding a process's mmap semaphore, and when that's > concurrent with lots of IO, can block for several seconds. > https://lkml.org/lkml/2011/7/26/103 > To avoid this, we should tell users to set hugepages to "madvise" or "never" > -- it's not sufficient to just disable defrag, because khugepaged still runs > in the background in that case and causes this sporadic issue. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (KUDU-1447) Document recommendation to disable THP
[ https://issues.apache.org/jira/browse/KUDU-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Henke updated KUDU-1447: -- Target Version/s: 1.7.0 (was: 1.5.0) > Document recommendation to disable THP > -- > > Key: KUDU-1447 > URL: https://issues.apache.org/jira/browse/KUDU-1447 > Project: Kudu > Issue Type: Improvement > Components: documentation >Affects Versions: 0.8.0 >Reporter: Todd Lipcon >Assignee: Todd Lipcon >Priority: Major > > Doing a bunch of cluster testing, I finally got to the root of why sometimes > threads take several seconds to start up, causing various timeout issues, > false elections, etc. It turns out that khugepaged does synchronous page > compaction while holding a process's mmap semaphore, and when that's > concurrent with lots of IO, can block for several seconds. > https://lkml.org/lkml/2011/7/26/103 > To avoid this, we should tell users to set hugepages to "madvise" or "never" > -- it's not sufficient to just disable defrag, because khugepaged still runs > in the background in that case and causes this sporadic issue. -- This message was sent by Atlassian JIRA (v7.6.3#76005)