This is an automated email from the ASF dual-hosted git repository. riemer pushed a commit to branch add-manual-data-lake-dimensions-selection in repository https://gitbox.apache.org/repos/asf/streampipes.git
commit 1e2dc8c6f5c5692eeea2413f7e772c34e9c0cfca Author: Dominik Riemer <[email protected]> AuthorDate: Mon Sep 16 13:29:47 2024 +0200 Update documentation --- .../documentation.md | 13 +++++++++++++ .../strings.en | 2 +- 2 files changed, 14 insertions(+), 1 deletion(-) diff --git a/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/documentation.md b/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/documentation.md index d87bc9a6e0..ece210c6de 100644 --- a/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/documentation.md +++ b/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/documentation.md @@ -70,3 +70,16 @@ The Schema Update Options dictate the behavior when encountering a measurement ( - **Description:** Keeps old event fields in the event schema. - **Strategy:** This follows an append-only strategy, allowing continued work with historic data. - **Consideration:** Old properties may exist for which no new data is generated. + + +### Dimensions + +Select fields which will be marked as dimensions. Dimensions reflect tags in the underlying time-series database. +Dimensions support grouping operations and can be used for fields with a limited set of values, e.g., boolean flags or +fields representing IDs. Dimensions are not a good choice for fields with a high number of different values since they +slow down database queries. + +By default, all fields which are marked as dimensions in the metadata are chosen and can be manually overridden +with this configuration. + +Data types which can be marked as dimensional values are booleans, integer, and strings. diff --git a/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/strings.en b/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/strings.en index b6ea6ef6aa..4c23b83d27 100644 --- a/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/strings.en +++ b/streampipes-extensions/streampipes-sinks-internal-jvm/src/main/resources/org.apache.streampipes.sinks.internal.jvm.datalake/strings.en @@ -27,5 +27,5 @@ timestamp_mapping.description=The value which contains a timestamp schema_update.title=Schema Update schema_update.description=Update existing schemas with the new one or extend the existing schema with new properties -dimensions_selection.key=Dimensions +dimensions_selection.title=Dimensions dimensions_selection.description=Selected fields will be stored as dimensions.
