[
https://issues.apache.org/jira/browse/METRON-1717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16609601#comment-16609601
]
ASF GitHub Bot commented on METRON-1717:
----------------------------------------
Github user merrimanr commented on a diff in the pull request:
https://github.com/apache/metron/pull/1187#discussion_r216416991
--- Diff: metron-analytics/metron-profiler-common/README.md ---
@@ -0,0 +1,386 @@
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements. See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership. The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+-->
+# Metron Profiler
+
+* [Introduction](#introduction)
+* [Getting Started](#getting-started)
+* [Profiles](#profiles)
+* [Examples](#examples)
+
+## Introduction
+
+The Profiler is a feature extraction mechanism that can generate a profile
describing the behavior of an entity. An entity might be a server, user,
subnet or application. Once a profile has been generated defining what normal
behavior looks-like, models can be built that identify anomalous behavior.
+
+This is achieved by summarizing the telemetry data consumed by Metron over
tumbling windows. A summary statistic is applied to the data received within a
given window. Collecting these values across many windows result in a time
series that is useful for analysis.
+
+Any field contained within a message can be used to generate a profile. A
profile can even be produced by combining fields that originate in different
data sources. A user has considerable power to transform the data used in a
profile by leveraging the Stellar language.
+
+There are three separate ports of the Profiler that share this common code
base.
+* The [Storm Profiler](../metron-profiler-storm/README.md) builds
low-latency profiles over streaming data sets.
+* The [Spark Profiler](../metron-profiler-spark/README.md) backfills
profiles using archived telemetry.
+* The [REPL Profiler](../metron-profiler-repl/README.md) allows profiles
to be tested and debugged within the Stellar REPL.
+
+## Getting Started
+
+1. [Create a profile](../metron-profiler-repl/README.md#getting-started)
using the Stellar REPL. Validate your profile using mock data, then apply real,
live data.
+
+1. [Backfill your
profile](../metron-profiler-spark/README.md#getting-started) using archived
telemetry to see how your profile behaves over time.
+
+1. [Deploy your
profile](../metron-profiler-storm/README.md#getting-started) to Storm to
maintain a low-latency profile over a streaming data set.
+
+1. [Retrieve your profile data](../metron-profiler-client/README.md) using
the Stellar API so that you can build enrichments, alert on abnormalities
--- End diff --
period at the end
> Relocate Storm Profiler Code
> ----------------------------
>
> Key: METRON-1717
> URL: https://issues.apache.org/jira/browse/METRON-1717
> Project: Metron
> Issue Type: Sub-task
> Reporter: Nick Allen
> Assignee: Nick Allen
> Priority: Major
>
> The Storm port of the Profiler currently lives in
> metron-analytics/metron-profiler. This should be moved to
> metron-analytics/metron-profiler-storm. This would mirror the project names
> for the Spark port (metron-profiler-spark) and the REPL port
> (metron-profiler-repl).
> The package name for the Storm port of the Profiler should be changed to
> org.apache.metron.profiler.storm. This would mimic the package name used for
> Spark; org.apache.metron.profiler.spark.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)