[jira] [Commented] (METRON-1445) Update performance tuning guide with more explicit parameter instructions
[ https://issues.apache.org/jira/browse/METRON-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441321#comment-16441321 ] ASF GitHub Bot commented on METRON-1445: Github user asfgit closed the pull request at: https://github.com/apache/metron/pull/988 > Update performance tuning guide with more explicit parameter instructions > - > > Key: METRON-1445 > URL: https://issues.apache.org/jira/browse/METRON-1445 > Project: Metron > Issue Type: Improvement >Reporter: Michael Miklavcic >Assignee: Michael Miklavcic >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (METRON-1445) Update performance tuning guide with more explicit parameter instructions
[ https://issues.apache.org/jira/browse/METRON-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441252#comment-16441252 ] ASF GitHub Bot commented on METRON-1445: Github user mmiklavc commented on a diff in the pull request: https://github.com/apache/metron/pull/988#discussion_r182170422 --- Diff: metron-platform/Performance-tuning-guide.md --- @@ -111,15 +138,146 @@ __spout-config.json__ } ``` -These are the spout recommended defaults from Storm and are currently the defaults provided in the Kafka spout itself. In fact, if you find the recommended defaults work fine for you, +Above is a snippet for configuring parsers. These are the spout recommended defaults from Storm and are currently the defaults provided in the Kafka spout itself. In fact, if you find the recommended defaults work fine for you, then you can omit these settings altogether. + Where to Find Tuning Properties + +**Important:** The parser topologies are deployed via a builder pattern that takes parameters from the CLI as set via Ambari. The enrichment and indexing topologies are configured +using a Storm Flux file, a configuration properties file, and Ambari. Here is a setting materialization summary for each of the topology types: + +- Parsers + - Management UI -> parser json config and CLI -> Storm +- Enrichment + - Ambari UI -> properties file -> Flux -> Storm +- Indexing + - Ambari UI -> properties file -> Flux -> Storm + +**Parsers** + +This is a mapping of the various performance tuning properties for parsers and how they are materialized. + +See more detail on starting parsers [here](https://github.com/apache/metron/blob/master/metron-platform/metron-parsers/README.md#starting-the-parser-topology) + +| Category| Management UI Property Name | JSON Config File Property Name | CLI Option | Storm Property Name | Notes | +|-||||-|---| +| Storm topology config | Num Workers | n/a| -nw,--num_workers | topology.workers | | +| | Num Ackers | n/a| -na,--num_ackers | topology.acker.executors| | +| | Storm Config | topology.max.spout.pending | -e,--extra_topology_options , e.g. { "topology.max.spout.pending" : NUM } | topology.max.spout.pending | Put property in JSON format in a file named `storm--config.json` | +| Kafka spout | Spout Parallelism | n/a| -sp,--spout_p | n/a | | +| | Spout Num Tasks | n/a| -snt,--spout_num_tasks | n/a | | +| | Spout Config | spout.pollTimeoutMs| -esc,--extra_kafka_spout_config , e.g. { "spout.pollTimeoutMs" : 200 } | n/a | Put property in JSON format in a file named `spout--config.json` | +| | Spout Config | spout.maxUncommittedOffsets| -esc,--extra_kafka_spout_config , e.g. { "spout.maxUncommittedOffsets" : 1000 } | n/a | Put property in JSON format in a file named `spout--config.json` | +| | Spout Config | spout.offsetCommitPeriodMs | -esc,--extra_kafka_spout_config , e.g. { "spout.offsetCommitPeriodMs" :
[jira] [Commented] (METRON-1445) Update performance tuning guide with more explicit parameter instructions
[ https://issues.apache.org/jira/browse/METRON-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441243#comment-16441243 ] ASF GitHub Bot commented on METRON-1445: Github user justinleet commented on a diff in the pull request: https://github.com/apache/metron/pull/988#discussion_r182168873 --- Diff: metron-platform/Performance-tuning-guide.md --- @@ -111,15 +138,146 @@ __spout-config.json__ } ``` -These are the spout recommended defaults from Storm and are currently the defaults provided in the Kafka spout itself. In fact, if you find the recommended defaults work fine for you, +Above is a snippet for configuring parsers. These are the spout recommended defaults from Storm and are currently the defaults provided in the Kafka spout itself. In fact, if you find the recommended defaults work fine for you, then you can omit these settings altogether. + Where to Find Tuning Properties + +**Important:** The parser topologies are deployed via a builder pattern that takes parameters from the CLI as set via Ambari. The enrichment and indexing topologies are configured +using a Storm Flux file, a configuration properties file, and Ambari. Here is a setting materialization summary for each of the topology types: + +- Parsers + - Management UI -> parser json config and CLI -> Storm +- Enrichment + - Ambari UI -> properties file -> Flux -> Storm +- Indexing + - Ambari UI -> properties file -> Flux -> Storm + +**Parsers** + +This is a mapping of the various performance tuning properties for parsers and how they are materialized. + +See more detail on starting parsers [here](https://github.com/apache/metron/blob/master/metron-platform/metron-parsers/README.md#starting-the-parser-topology) + +| Category| Management UI Property Name | JSON Config File Property Name | CLI Option | Storm Property Name | Notes | +|-||||-|---| +| Storm topology config | Num Workers | n/a| -nw,--num_workers | topology.workers | | +| | Num Ackers | n/a| -na,--num_ackers | topology.acker.executors| | +| | Storm Config | topology.max.spout.pending | -e,--extra_topology_options , e.g. { "topology.max.spout.pending" : NUM } | topology.max.spout.pending | Put property in JSON format in a file named `storm--config.json` | +| Kafka spout | Spout Parallelism | n/a| -sp,--spout_p | n/a | | +| | Spout Num Tasks | n/a| -snt,--spout_num_tasks | n/a | | +| | Spout Config | spout.pollTimeoutMs| -esc,--extra_kafka_spout_config , e.g. { "spout.pollTimeoutMs" : 200 } | n/a | Put property in JSON format in a file named `spout--config.json` | +| | Spout Config | spout.maxUncommittedOffsets| -esc,--extra_kafka_spout_config , e.g. { "spout.maxUncommittedOffsets" : 1000 } | n/a | Put property in JSON format in a file named `spout--config.json` | +| | Spout Config | spout.offsetCommitPeriodMs | -esc,--extra_kafka_spout_config , e.g. { "spout.offsetCommitPeriodMs" :
[jira] [Commented] (METRON-1445) Update performance tuning guide with more explicit parameter instructions
[ https://issues.apache.org/jira/browse/METRON-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441186#comment-16441186 ] ASF GitHub Bot commented on METRON-1445: Github user nickwallen commented on the issue: https://github.com/apache/metron/pull/988 +1 Thanks > Update performance tuning guide with more explicit parameter instructions > - > > Key: METRON-1445 > URL: https://issues.apache.org/jira/browse/METRON-1445 > Project: Metron > Issue Type: Improvement >Reporter: Michael Miklavcic >Assignee: Michael Miklavcic >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (METRON-1445) Update performance tuning guide with more explicit parameter instructions
[ https://issues.apache.org/jira/browse/METRON-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16441155#comment-16441155 ] ASF GitHub Bot commented on METRON-1445: Github user mmiklavc commented on a diff in the pull request: https://github.com/apache/metron/pull/988#discussion_r182152983 --- Diff: metron-platform/metron-common/README.md --- @@ -400,3 +401,34 @@ __Side Effects__ Calling the mark() method multiple times simply resets the start time to the current nano time. Calling log() with a non-existent mark name will log 0 ns elapsed time with a warning indicating that log has been invoked for a mark name that does not exist. The class is not thread-safe and makes no attempt at keeping multiple threads from modifying the same markers. +# Metron Debugging + +A Python script is provided for gathering information useful in debugging your Metron cluster. Run from the node that has Metron installed on it. All options listed below are required. + +_Note:_ Be aware that no anonymization/scrubbing is performed on the captured configuration details. + +``` +# ./cluster_info.py -h --- End diff -- @nickwallen fixed to reference $METRON_HOME/bin > Update performance tuning guide with more explicit parameter instructions > - > > Key: METRON-1445 > URL: https://issues.apache.org/jira/browse/METRON-1445 > Project: Metron > Issue Type: Improvement >Reporter: Michael Miklavcic >Assignee: Michael Miklavcic >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (METRON-1445) Update performance tuning guide with more explicit parameter instructions
[ https://issues.apache.org/jira/browse/METRON-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16433494#comment-16433494 ] ASF GitHub Bot commented on METRON-1445: GitHub user mmiklavc opened a pull request: https://github.com/apache/metron/pull/988 METRON-1445: Update performance tuning guide with more explicit parameter instructions ## Contributor Comments https://issues.apache.org/jira/browse/METRON-1445 The current performance tuning guide can be challenging to follow for those not familiar with the system. This PR updates the guide to provide details about how and where to find tuning parameters along with details as to how the properties are materialized in a running topology. Additionally, a supplement to the tuning guide has been provided to pull Metron cluster debugging info into a tarball. As pertains to the perf guide, it captures the state of Ambari configuration, flux files, properties files, and zk config. This helps users confirm system state as they make tuning changes. ## Pull Request Checklist ### For all changes: - [x] Is there a JIRA ticket associated with this PR? If not one needs to be created at [Metron Jira](https://issues.apache.org/jira/browse/METRON/?selectedTab=com.atlassian.jira.jira-projects-plugin:summary-panel). - [x] Does your PR title start with METRON- where is the JIRA number you are trying to resolve? Pay particular attention to the hyphen "-" character. - [x] Has your PR been rebased against the latest commit within the target branch (typically master)? ### For code changes: n/a ### For documentation related changes: - [ ] Have you ensured that format looks appropriate for the output in which it is rendered by building and verifying the site-book? If not then run the following commands and the verify changes via `site-book/target/site/index.html`: ``` cd site-book mvn site ``` You can merge this pull request into a Git repository by running: $ git pull https://github.com/mmiklavc/metron performance-doc Alternatively you can review and apply these changes as the patch at: https://github.com/apache/metron/pull/988.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #988 commit 43120ee6825d52fd5074fa2ab2a8c8f6d8a71dba Author: Michael MiklavcicDate: 2018-02-16T19:40:00Z Add start of debugging utility. First pass at perf tuning doc updates. commit f5c39d02c636ed7c7e08fddef8f0d1d9da4052d9 Author: Michael Miklavcic Date: 2018-04-03T21:54:12Z Merge with master commit 79ddc02767cec5a76b5d0955546bbedf96a4353f Author: Michael Miklavcic Date: 2018-04-04T18:08:04Z Merge branch 'master' into performance-doc commit 4f4cf53699330257531df87a50f8c5dee13aca11 Author: Michael Miklavcic Date: 2018-04-05T15:21:03Z Merge branch 'master' into performance-doc commit fc17e1d1b00809c2c4a5514181221722a117cd5e Author: Michael Miklavcic Date: 2018-04-11T06:47:18Z Finish tuning guide parameter mappings. Move debug info script from Bash to Python. > Update performance tuning guide with more explicit parameter instructions > - > > Key: METRON-1445 > URL: https://issues.apache.org/jira/browse/METRON-1445 > Project: Metron > Issue Type: Improvement >Reporter: Michael Miklavcic >Assignee: Michael Miklavcic >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)