[I] [DRAFT] 28.0.0 release notes (druid)

via GitHub Sat, 04 Nov 2023 05:15:30 -0700


LakshSingla opened a new issue, #15326:
URL: https://github.com/apache/druid/issues/15326


   Apache Druid 28.0.0 contains over 420 new features, bug fixes, performance 
enhancements, documentation improvements, and additional test coverage from 57 
contributors.
   
   See the [complete set of 
changes](https://github.com/apache/druid/issues?q=is%3Aclosed+milestone%3A28.0+sort%3Aupdated-desc+)
 for additional details, including bug fixes.
   
   Review the [upgrade notes](#upgrade-notes) and [incompatible 
changes](#incompatible-changes) before you upgrade to Druid 28.0.0.
   
   # <a name="28.0.0-important-changes-and-deprecations" 
href="#28.0.0-important-changes-and-deprecations">#</a> Important changes and 
deprecations
   
   In Druid 28.0.0, we have made substantial improvements to querying to make 
the system more ANSI SQL compatible. This includes changes in handling NULL and 
boolean values as well as boolean logic. At the same time, the Apache Calcite 
library has been upgraded to the latest version. While we have documented known 
query behavior changes, please read the [upgrade notes](#upgrade-notes) section 
carefully. Test your application before rolling out to broad production 
scenarios while closely monitoring the query status.
   
   ## <a name="28.0.0-important-changes-and-deprecations-sql-compatibility" 
href="#28.0.0-important-changes-and-deprecations-sql-compatibility">#</a> SQL 
compatibility
   
   Druid continues to make SQL query execution more consistent with how 
standard SQL behaves. However, there are feature flags available to restore the 
old behavior if needed.
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-sql-compatibility-three-valued-logic"
 
href="#28.0.0-important-changes-and-deprecations-sql-compatibility-three-valued-logic">#</a>
 Three-valued logic
   
   Druid native filters now observe SQL [three-valued 
logic](https://en.wikipedia.org/wiki/Three-valued_logic#SQL) (`true`, `false`, 
or `unknown`) instead of Druid's classic two-state logic by default, when the 
following default settings apply:
   
   * `druid.generic.useThreeValueLogicForNativeFilters = true`
   * `druid.expressions.useStrictBooleans = true`
   * `druid.generic.useDefaultValueForNull = false`
   
   [#15058](https://github.com/apache/druid/pull/15058)
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-sql-compatibility-strict-booleans"
 
href="#28.0.0-important-changes-and-deprecations-sql-compatibility-strict-booleans">#</a>
 Strict booleans
   
   `druid.expressions.useStrictBooleans` is now enabled by default.
   Druid now handles booleans strictly using `1` (true) or `0` (false).
   Previously, true and false could be represented either as `true` and `false` 
as well as `1` and `0`, respectively.
   In addition, Druid now returns a null value for Boolean comparisons like 
`True && NULL`.
   
   If you don't explicitly configure this property in `runtime.properties`, 
clusters now use LONG types for any ingested boolean values and in the output 
of boolean functions for transformations and query time operations.
   For more information, see [SQL compatibility in the upgrade 
notes](#sql-compatibility-1).
   
   [#14734](https://github.com/apache/druid/pull/14734)
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-sql-compatibility-null-handling"
 
href="#28.0.0-important-changes-and-deprecations-sql-compatibility-null-handling">#</a>
 NULL handling
   
   `druid.generic.useDefaultValueForNull` is now disabled by default.
   Druid now differentiates between empty records and null records.
   Previously, Druid might treat empty records as empty or null.
   For more information, see [SQL compatibility in the upgrade 
notes](#sql-compatibility-1).
   
   [#14792](https://github.com/apache/druid/pull/14792)
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-sql-planner-improvements" 
href="#28.0.0-important-changes-and-deprecations-sql-planner-improvements">#</a>
 SQL planner improvements
   
   Druid uses Apache Calcite for SQL planning and optimization. Starting in 
Druid 28.0.0, the Calcite version has been upgraded from 1.21 to 1.35. This 
upgrade brings in many bug fixes in SQL planning from Calcite.
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-sql-planner-improvements-dynamic-parameters"
 
href="#28.0.0-important-changes-and-deprecations-sql-planner-improvements-dynamic-parameters">#</a>
 Dynamic parameters
   
   As part of the Calcite upgrade, the behavior of type inference for dynamic 
parameters has changed. To avoid any type interference issues, explicitly 
`CAST` all dynamic parameters as a specific data type in SQL queries. For 
example, use:
   
   ```sql
   SELECT (1 * CAST (? as DOUBLE))/2 as tmp
   ```
   
   Do not use:
   
   ```sql
   SELECT (1 * ?)/2 as tmp
   ```
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-async-query-and-query-from-deep-storage"
 
href="#28.0.0-important-changes-and-deprecations-async-query-and-query-from-deep-storage">#</a>
 Async query and query from deep storage
   
   [Query from deep 
storage](https://druid.apache.org/docs/latest/querying/query-deep-storage/) is 
no longer an experimental feature. When you query from deep storage, more data 
is available for queries without having to scale your Historical services to 
accommodate more data. To benefit from the space saving that query from deep 
storage offers, configure your load rules to unload data from your Historical 
services.
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-async-query-and-query-from-deep-storage-support-for-multiple-result-formats"
 
href="#28.0.0-important-changes-and-deprecations-async-query-and-query-from-deep-storage-support-for-multiple-result-formats">#</a>
 Support for multiple result formats
   
   Query from deep storage now supports multiple result formats.
   Previously, the `/druid/v2/sql/statements/` endpoint only supported results 
in the `object` format. Now, results can be written in any format specified in 
the `resultFormat` parameter.
   For more information on result parameters supported by the Druid SQL API, 
see 
[Responses](https://druid.apache.org/docs/latest/api-reference/sql-api#responses).
   
   [#14571](https://github.com/apache/druid/pull/14571)
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-async-query-and-query-from-deep-storage-broadened-access-for-queries-from-deep-storage"
 
href="#28.0.0-important-changes-and-deprecations-async-query-and-query-from-deep-storage-broadened-access-for-queries-from-deep-storage">#</a>
 Broadened access for queries from deep storage
   
   Users with the `STATE` permission can interact with status APIs for queries 
from deep storage. Previously, only the user who submitted the query could use 
those APIs. This enables the web console to monitor the running status of the 
queries. Users with the `STATE` permission can access the query results.
   
   [#14944](https://github.com/apache/druid/pull/14944)
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-msq-queries-for-realtime-tasks" 
href="#28.0.0-important-changes-and-deprecations-msq-queries-for-realtime-tasks">#</a>
 MSQ queries for realtime tasks
   
   The MSQ task engine can now include real time segments in query results. To 
do this, use the `includeSegmentSource` context parameter and set it to 
`REALTIME`.
   
   [#15024](https://github.com/apache/druid/pull/15024)
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-msq-support-for-union-all-queries"
 
href="#28.0.0-important-changes-and-deprecations-msq-support-for-union-all-queries">#</a>
 MSQ support for UNION ALL queries
   
   You can now use the MSQ task engine to run UNION ALL queries with 
`UnionDataSource`.
   
   [#14981](https://github.com/apache/druid/pull/14981)
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-ingest-from-multiple-kafka-topics-to-a-single-datasource"
 
href="#28.0.0-important-changes-and-deprecations-ingest-from-multiple-kafka-topics-to-a-single-datasource">#</a>
 Ingest from multiple Kafka topics to a single datasource
   
   You can now ingest streaming data from multiple Kafka topics to a datasource 
using a single supervisor.
   You configure the topics for the supervisor spec using a regex pattern as 
the value for `topicPattern` in the IO config. If you add new topics to Kafka 
that match the regex, Druid automatically starts ingesting from those new 
topics.
   
   If you enable multi-topic ingestion for a datasource, downgrading will cause 
the Supervisor to fail.
   For more information, see [Stop supervisors that ingest from multiple Kafka 
topics before 
downgrading](#stop-supervisors-that-ingest-from-multiple-kafka-topics-before-downgrading).
   
   [#14424](https://github.com/apache/druid/pull/14424)
   [#14865](https://github.com/apache/druid/pull/14865)
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-sql-unnest-and-ingestion-flattening"
 
href="#28.0.0-important-changes-and-deprecations-sql-unnest-and-ingestion-flattening">#</a>
 SQL UNNEST and ingestion flattening
   
   The UNNEST function is no longer experimental. For more information, see 
[UNNEST](https://druid.apache.org/docs/latest/querying/sql/#unnest) and [Unnest 
arrays within a 
column](https://druid.apache.org/docs/latest/tutorials/tutorial-unnest-arrays/).
   
   You no longer need to include the context parameter `enableUnnest: true` to 
use UNNEST.
   
   [#14886](https://github.com/apache/druid/pull/14886)
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-sql-unnest-and-ingestion-flattening-recommended-syntax-for-sql-unnest"
 
href="#28.0.0-important-changes-and-deprecations-sql-unnest-and-ingestion-flattening-recommended-syntax-for-sql-unnest">#</a>
 Recommended syntax for SQL UNNEST
   
   The recommended syntax for SQL UNNEST has changed. We recommend using CROSS 
JOIN instead of commas for most queries to prevent issues with precedence. For 
example, use:
   
   ```sql
   SELECT column_alias_name1 FROM datasource CROSS JOIN 
UNNEST(source_expression1) AS table_alias_name1(column_alias_name1) CROSS JOIN 
UNNEST(source_expression2) AS table_alias_name2(column_alias_name2), ...
   ```
   
   Do not use:
   
   ```sql
   SELECT column_alias_name FROM datasource, UNNEST(source_expression1) AS 
table_alias_name1(column_alias_name1), UNNEST(source_expression2) AS 
table_alias_name2(column_alias_name2), ...
   ```
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-window-functions-%28experimental%29"
 
href="#28.0.0-important-changes-and-deprecations-window-functions-%28experimental%29">#</a>
 Window functions (experimental)
   
   You can use [window 
functions](https://druid.apache.org/docs/latest/querying/sql-window-functions) 
in Apache Druid to produce values based upon the relationship of one row within 
a window of rows to the other rows within the same window. A window is a group 
of related rows within a result set. For example, rows with the same value for 
a specific dimension.
   
   Enable window functions in your query with the `enableWindowing: true` 
context parameter.
   
   [#15184](https://github.com/apache/druid/pull/15184)
   
   ## <a 
name="28.0.0-important-changes-and-deprecations-concurrent-append-and-replace-%28experimental%29"
 
href="#28.0.0-important-changes-and-deprecations-concurrent-append-and-replace-%28experimental%29">#</a>
 Concurrent append and replace (experimental)
   
   Druid 28.0.0 adds experimental support for concurrent append and replace.
   This feature allows you to safely replace the existing data in an interval 
of a datasource while new data is being appended to that interval. One of the 
most common applications of this is appending new data to an interval while 
compaction of that interval is already in progress.
   For more information, see [Concurrent append and 
replace](https://druid.apache.org/docs/latest/data-management/automatic-compaction#concurrent-append-and-replace).
   
   Segment locking will be deprecated and removed in favor of concurrent append 
and replace that is much simpler in design. With concurrent append and replace, 
Druid doesn't lock compaction jobs out because of active realtime ingestion.
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-concurrent-append-and-replace-%28experimental%29-task-locks-for-append-and-replace-batch-ingestion-jobs"
 
href="#28.0.0-important-changes-and-deprecations-concurrent-append-and-replace-%28experimental%29-task-locks-for-append-and-replace-batch-ingestion-jobs">#</a>
 Task locks for append and replace batch ingestion jobs
   
   Append batch ingestion jobs can now share locks. This allows you to run 
multiple append batch ingestion jobs against the same time internal. Replace 
batch ingestion jobs still require an exclusive lock. This means you can run 
multiple append batch ingestion jobs and one replace batch ingestion job for a 
given interval.
   
   [#14407](https://github.com/apache/druid/pull/14407)
   
   ### <a 
name="28.0.0-important-changes-and-deprecations-concurrent-append-and-replace-%28experimental%29-streaming-ingestion-with-concurrent-replace"
 
href="#28.0.0-important-changes-and-deprecations-concurrent-append-and-replace-%28experimental%29-streaming-ingestion-with-concurrent-replace">#</a>
 Streaming ingestion with concurrent replace
   
   Streaming jobs reading from Kafka and Kinesis with `APPEND` locks can now 
ingest concurrently with compaction running with `REPLACE` locks. The segment 
granularity of the streaming job must be equal to or finer than that of the 
concurrent replace job.
   
   [#15039](https://github.com/apache/druid/pull/15039)
   
   # <a name="28.0.0-functional-area-and-related-changes" 
href="#28.0.0-functional-area-and-related-changes">#</a> Functional area and 
related changes
   
   ## <a name="28.0.0-functional-area-and-related-changes-web-console" 
href="#28.0.0-functional-area-and-related-changes-web-console">#</a> Web console
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-web-console-added-ui-support-for-segment-loading-query-context-parameter"
 
href="#28.0.0-functional-area-and-related-changes-web-console-added-ui-support-for-segment-loading-query-context-parameter">#</a>
 Added UI support for segment loading query context parameter
   
   The web console supports the `waitUntilSegmentsLoad` query context parameter.
   
   <img width="1505" alt="UI for waitUntilSegmentsLoad context parameter" 
src="https://github.com/apache/druid/assets/30999375/43fb8bb7-651b-4bfb-aac8-3f4e6388fc86";>
   
   
   [#15110](https://github.com/apache/druid/pull/15110)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-web-console-added-concurrent-append-and-replace-switches"
 
href="#28.0.0-functional-area-and-related-changes-web-console-added-concurrent-append-and-replace-switches">#</a>
 Added concurrent append and replace switches
   
   The web console includes concurrent append and replace switches.
   
   The following screenshot shows the concurrent append and replace switches in 
the classic batch ingestion wizard:
   <img width="1504" alt="Classic batch ingestion wizard" 
src="https://github.com/apache/druid/assets/30999375/c07de29b-37fc-4673-925c-7b4904a42885";>
   
   The following screenshot shows the concurrent append and replace switches in 
the compaction configuration UI:
   <img width="1504" alt="Compaction configuration UI" 
src="https://github.com/apache/druid/assets/30999375/8d0da143-ce11-42b5-bdbf-2d46816d623b";>
   
   
   [#15114](https://github.com/apache/druid/pull/15114)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-web-console-added-ui-support-for-ingesting-from-multiple-kafka-topics-to-a-single-datasource"
 
href="#28.0.0-functional-area-and-related-changes-web-console-added-ui-support-for-ingesting-from-multiple-kafka-topics-to-a-single-datasource">#</a>
 Added UI support for ingesting from multiple Kafka topics to a single 
datasource
   
   The web console supports ingesting streaming data from multiple Kafka topics 
to a datasource using a single supervisor.
   
   <img width="307" alt="UI for Kafka multi-topic ingestion" 
src="https://github.com/apache/druid/assets/30999375/26f40b39-942c-46ab-b5b7-6f58b439439a";>
   
   
   [#14833](https://github.com/apache/druid/pull/14833)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-web-console-other-web-console-improvements"
 
href="#28.0.0-functional-area-and-related-changes-web-console-other-web-console-improvements">#</a>
 Other web console improvements
   
   * You can now copy query results from the web console directly to the 
clipboard [#14889](https://github.com/apache/druid/pull/14889)
   * The web console now shows the execution dialog for `query_controller` 
tasks in the task view instead of the generic raw task details dialog. You can 
still access the raw task details from the ellipsis (...) menu 
[#14930)](https://github.com/apache/druid/pull/14930)
   * You can now select a horizontal range in the web console time chart to 
modify the current WHERE clause 
[#14929](https://github.com/apache/druid/pull/14929)
   * You can now set dynamic query parameters in the web console 
[#14921](https://github.com/apache/druid/pull/14921)
   * You can now edit the Coordinator dynamic configuration in the web console 
[#14791](https://github.com/apache/druid/pull/14791)
   * You can now prettify SQL queries and use flatten with a Kafka input format 
[#14906](https://github.com/apache/druid/pull/14906)
   * A warning now appears when a CSV or TSV sample contains newlines that 
Druid does not accept [#14783](https://github.com/apache/druid/pull/14783)
   * You can now select a format when downloading data 
[#14794](https://github.com/apache/druid/pull/14794)
   * Improved the clarity of cluster default rules in the retention dialog 
[#14793](https://github.com/apache/druid/pull/14793)
   * The web console now detects inline queries in the query text and lets you 
run them individually [#14810](https://github.com/apache/druid/pull/14801)
   * You can now reset specific partition offsets for a supervisor 
[#14863](https://github.com/apache/druid/pull/14863)
   
   ## <a name="28.0.0-functional-area-and-related-changes-cluster-stability" 
href="#28.0.0-functional-area-and-related-changes-cluster-stability">#</a> 
Cluster stability
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-cluster-stability-unused-segments"
 
href="#28.0.0-functional-area-and-related-changes-cluster-stability-unused-segments">#</a>
 Unused segments
   
   Druid now stops loading and moving segments as soon as they are marked as 
unused. This prevents Historical processes from spending time on superfluous 
loads of segments that will be unloaded later. You can mark segments as unused 
by a drop rule, overshadowing, or by calling [the Data management 
API](https://druid.apache.org/docs/latest/api-reference/data-management-api).
   
   [#14644](https://github.com/apache/druid/pull/14644)
   
   ## <a name="28.0.0-functional-area-and-related-changes-ingestion" 
href="#28.0.0-functional-area-and-related-changes-ingestion">#</a> Ingestion
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-json-and-auto-column-indexer"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-json-and-auto-column-indexer">#</a>
 JSON and auto column indexer
   
   The `json` column type is now equivalent to using `auto` in JSON-based batch 
ingestion dimension specs. Upgrade your ingestion specs to `json` to take 
advantage of the features and functionality of `auto`, including the following:
   
   - Type specializations including ARRAY typed columns
   - Better support for nested arrays of strings, longs, and doubles
   - Smarter index utilization
   
   `json` type columns created with Druid 28.0.0 are not backwards compatible 
with Druid versions older than 26.0.0.
   If you upgrade from one of these versions, you can continue to write nested 
columns in a backwards compatible format (version 4).
   
   For more information, see [Nested column format in the upgrade 
notes](#nested-column-format).
   
   [#14955](https://github.com/apache/druid/pull/14955)
   [#14456](https://github.com/apache/druid/pull/14456)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-ingestion-status" 
href="#28.0.0-functional-area-and-related-changes-ingestion-ingestion-status">#</a>
 Ingestion status
   
   Ingestion reports now include a `segmentLoadStatus` object that provides 
information related to the ingestion, such as duration and total segments.
   
   [#14322](https://github.com/apache/druid/pull/14322)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion" 
href="#28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion">#</a>
 SQL-based ingestion
   
   #### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-ability-to-ingest-array-types"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-ability-to-ingest-array-types">#</a>
 Ability to ingest ARRAY types
   
   SQL-based ingestion now supports storing ARRAY typed values in [ARRAY typed 
columns](https://druid.apache.org/docs/latest/querying/arrays) as well as 
storing both VARCHAR and numeric typed arrays.
   Previously, the MSQ task engine stored ARRAY typed values as [multi-value 
dimensions](https://druid.apache.org/docs/latest/querying/multi-value-dimensions)
 instead of ARRAY typed columns.
   
   The MSQ task engine now includes the `arrayIngestMode` query context 
parameter, which controls how
   `ARRAY` types are stored in Druid segments.
   Set the `arrayIngestMode` query context parameter to `array` to ingest ARRAY 
types.
   
   In Druid 28.0.0, the default mode for `arrayIngestMode` is `mvd` for 
backwards compatibility, which only supports VARCHAR typed arrays and stores 
them as multi-value dimensions. This default is subject to change in future 
releases.
   
   For information on how to migrate to the new behavior, see the [Ingestion 
options for ARRAY typed columns in the upgrade 
notes](#ingestion-options-for-array-typed-columns).
   For information on inserting, filtering, and grouping behavior for `ARRAY` 
typed columns, see [Array 
columns](https://druid.apache.org/docs/latest/querying/arrays).
   
   [#15093](https://github.com/apache/druid/pull/15093)
   
   #### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-numeric-array-type-support"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-numeric-array-type-support">#</a>
 Numeric array type support
   
   Row-based frames and, by extension, the MSQ task engine now support numeric 
array types. This means that all queries consuming or producing arrays work 
with the MSQ task engine. Numeric arrays can also be ingested using SQL-based 
ingestion with MSQ. For example, queries like `SELECT [1, 2]` are valid now 
since they consume a numeric array instead of failing with an unsupported 
column type exception.
   
   [#14900](https://github.com/apache/druid/pull/14900)
   
   #### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-azure-blob-storage-support"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-azure-blob-storage-support">#</a>
 Azure Blob Storage support
   
   Added support for Microsoft Azure Blob Storage.
   You can now use fault tolerance and durable storage with Microsoft Azure 
Blob Storage.
   For more information, see [Durable 
storage](https://druid.apache.org/docs/latest/multi-stage-query/reference#durable-storage).
   
   [#14660](https://github.com/apache/druid/pull/14660)
   
   #### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-other-sql-based-ingestion-improvements"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-sql-based-ingestion-other-sql-based-ingestion-improvements">#</a>
 Other SQL-based ingestion improvements
   
   * Added a new `rowsPerPage` context parameter for the MSQ task engine.
   Use `rowsPerPage` to limit the number of rows per page. For more information 
on context parameters for the MSQ task engine, see [Context 
parameters](https://druid.apache.org/docs/latest/multi-stage-query/reference#context-parameters)
 [#14994](https://github.com/apache/druid/pull/14994)
   * Druid now ignores `ServiceClosedException` on `postCounters` while the 
controller is offline [#14707](https://github.com/apache/druid/pull/14707)
   * Improved error messages related to OVERWRITE keyword 
[#14870](https://github.com/apache/druid/pull/14870)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-streaming-ingestion" 
href="#28.0.0-functional-area-and-related-changes-ingestion-streaming-ingestion">#</a>
 Streaming ingestion
   
   #### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-streaming-ingestion-ability-to-reset-offsets-for-a-supervisor"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-streaming-ingestion-ability-to-reset-offsets-for-a-supervisor">#</a>
 Ability to reset offsets for a supervisor
   
   Added a new API endpoint 
`/druid/indexer/v1/supervisor/:supervisorId/resetOffsets` to reset specific 
partition offsets for a supervisor without resetting the entire set.
   This endpoint clears only the specified offsets in Kafka or sequence numbers 
in Kinesis, prompting the supervisor to resume data reading.
   
   [#14772](https://github.com/apache/druid/pull/14772)
   
   #### <a 
name="28.0.0-functional-area-and-related-changes-ingestion-streaming-ingestion-other-streaming-ingestion-improvements"
 
href="#28.0.0-functional-area-and-related-changes-ingestion-streaming-ingestion-other-streaming-ingestion-improvements">#</a>
 Other streaming ingestion improvements
   
   * Added `PropertyNamingStrategies` from Jackson to fix Hadoop ingestion and 
make it compatible with newer Jackson 
[#14671](https://github.com/apache/druid/pull/14671)
   * Added pod name to the `TaskLocation` object for Kubernetes task scheduling 
to make debugging easier [#14758](https://github.com/apache/druid/pull/14758)
   * Added lifecycle hooks to `KubernetesTaskRunner` 
[#14790](https://github.com/apache/druid/pull/14790)
   * Added new method for `SqlStatementResource` and `SqlTaskResource` to set 
request attribute [#14878](https://github.com/apache/druid/pull/14878)
   * Added a sampling factor for `DeterminePartitionsJob` 
[#13840](https://github.com/apache/druid/pull/13840)
   * Added `usedClusterCapacity` to the `GET` `/totalWorkerCapacity` response. 
Use this API to get the total ingestion capacity on the overlord 
[#14888](https://github.com/apache/druid/pull/14888)
   * Improved Kubernetes task runner performance 
[#14649](https://github.com/apache/druid/pull/14649)
   * Improved handling of long data source names. Previously, the Kubernetes 
task runner would throw an error if the name of a data source was too long 
[#14620](https://github.com/apache/druid/pull/14620)
   * Improved the streaming ingestion completion timeout error message 
[#14636](https://github.com/apache/druid/pull/14636)
   * Druid now retries fetching S3 task logs on transient S3 errors 
[#14714](https://github.com/apache/druid/pull/14714)
   * Druid now reports `task/pending/time` metrics for Kubernetes-based 
ingestion [#14698](https://github.com/apache/druid/pull/14698)
   * Druid now reports `k8s/peon/startup/time` metrics for Kubernetes-based 
ingestion [#14771](https://github.com/apache/druid/pull/14771)
   * `handoffConditionTimeout` now defaults to 15 minutes&mdash;the default 
change won't affect existing supervisors 
[#14539](https://github.com/apache/druid/pull/14539)
   * Fixed an NPE with checkpoint parsing for streaming ingestion 
[#14353](https://github.com/apache/druid/pull/14353)
   * Fixed an issue with Hadoop ingestion writing arrays as `objects.toString` 
as a result of transform expressions 
[#15127](https://github.com/apache/druid/pull/15127)
   * The `PodTemplateTaskAdapter` now accounts for queryable tasks 
[#14789](https://github.com/apache/druid/pull/14789)
   * The rolling supervisor now restarts at `taskDuration` 
[#14396](https://github.com/apache/druid/pull/14396)
   * S3 `deleteObjects` requests are now retried if the failure state allows 
retry [#14776](https://github.com/apache/druid/pull/14776)
   * You can now ingest the name of a Kafka topic to a datasource 
[#14857](https://github.com/apache/druid/pull/14857)
   
   ## <a name="28.0.0-functional-area-and-related-changes-querying" 
href="#28.0.0-functional-area-and-related-changes-querying">#</a> Querying
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-improved-lookup-function"
 
href="#28.0.0-functional-area-and-related-changes-querying-improved-lookup-function">#</a>
 Improved LOOKUP function
   
   The LOOKUP function now accepts an optional constant string as a third 
argument. This string is used to replace missing values in results. For 
example, the query `LOOKUP(store, 'store_to_country', 'NA')`, returns `NA` if 
the `store_to_country` value is missing for a given `store`.
   
   [#14956](https://github.com/apache/druid/pull/14956)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-avg-function" 
href="#28.0.0-functional-area-and-related-changes-querying-avg-function">#</a> 
AVG function
   
   The AVG aggregation function now returns a `double` instead of a `long`.
   
   [#15089](https://github.com/apache/druid/pull/15089)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-improvements-to-earliest-and-latest-operators"
 
href="#28.0.0-functional-area-and-related-changes-querying-improvements-to-earliest-and-latest-operators">#</a>
 Improvements to EARLIEST and LATEST operators
   
   Improved EARLIEST and LATEST operators as follows:
   
   * EARLIEST and LATEST operators now rewrite to EARLIEST_BY and LATEST_BY 
during query processing to make the `__time` column reference explicit to 
Calcite. [#15095](https://github.com/apache/druid/pull/15095)
   * You can now use EARLIEST/EARLIEST_BY and LATEST/LATEST_BY for STRING 
columns without specifying the `maxBytesPerValue` parameter.
   If you omit the `maxBytesPerValue` parameter, the aggregations default to 
1024 bytes for the buffer. [#14848](https://github.com/apache/druid/pull/14848)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-functions-for-evaluating-distinctness"
 
href="#28.0.0-functional-area-and-related-changes-querying-functions-for-evaluating-distinctness">#</a>
 Functions for evaluating distinctness
   
   New SQL and native query functions allow you to evaluate whether two 
expressions are distinct or not distinct.
   Expressions are distinct if they have different values or if one of them is 
NULL.
   Expressions are not distinct if their values are the same or if both of them 
are NULL.
   
   Because the functions treat NULLs as known values  when used as a comparison 
operator, they always return true or false even if one or both expressions are 
NULL.
   
   The following table shows the difference in behavior between the equals sign 
(=) and IS [NOT] DISTINCT FROM:
   
   | A | B | A=B | A IS NOT DISTINCT FROM B|
   |---|---|-----|-------------------------|
   | 0 | 0 | true | true |
   | 0 | 1 | false | false |
   | 0 | null | unknown | false |
   | null | null | unknown | true |
   
   [#14976](https://github.com/apache/druid/pull/14976)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-functions-for-evaluating-equalities"
 
href="#28.0.0-functional-area-and-related-changes-querying-functions-for-evaluating-equalities">#</a>
 Functions for evaluating equalities
   
   New SQL and native query functions allow you to evaluate whether a condition 
is true or false. These functions are different from `x == true` and `x != 
true` in that they never return null even when the variable is null.
   
   | SQL function | Native function |
   |--------------|-----------------|
   | `IS_TRUE`    | `istrue()`|
   | `IS_FALSE`   | `isfalse()`|
   | `IS_NOT_TRUE`| `nottrue()`|
   | `IS_NOT_FALSE`|`notfalse()`|
   
   [#14977](https://github.com/apache/druid/pull/14977)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-function-to-decode-base64-encoded-strings"
 
href="#28.0.0-functional-area-and-related-changes-querying-function-to-decode-base64-encoded-strings">#</a>
 Function to decode Base64-encoded strings
   
   The new SQL and native query function, `decode_base64_utf8` decodes a 
Base64-encoded string and returns the UTF-8-encoded string. For example, 
`decode_base64_utf8('aGVsbG8=')`.
   
   [#14943](https://github.com/apache/druid/pull/14943)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-improved-subquery-guardrail"
 
href="#28.0.0-functional-area-and-related-changes-querying-improved-subquery-guardrail">#</a>
 Improved subquery guardrail
   
   You can now set the `maxSubqueryBytes` guardrail to one of the following:
   
   * `disabled`: Default setting. Druid doesn't apply the guardrail around the 
number of bytes a subquery can generate.
   * `auto`: Druid calculates the amount of memory to use for the 
materialization of results as a portion of the fixed memory of the heap.  
     In the query context, Druid uses the following formula to determine the 
upper limit on the number of bytes a subquery can generate:
   
     ```text
     ((total JVM space - memory occupied by lookups) * 0.5) / maximum queries 
that the system can handle concurrently
     ```
   
   * INTEGER: The number of bytes to use for materializing subquery results. 
Set a specific value if you understand the query patterns and want to optimize 
memory usage.  
     For example, set the `maxSubqueryBytes` parameter to 300000000 (`300 * 
1000 * 1000`) for a 300 MB limit.
     Set the `maxSubqueryBytes` parameter to 314572800 (`300 * 1024 * 1024`) 
for a 300 MiB limit.
   
   [#14808](https://github.com/apache/druid/pull/14808)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-querying-other-query-improvements"
 
href="#28.0.0-functional-area-and-related-changes-querying-other-query-improvements">#</a>
 Other query improvements
   
   * Added filters to the set of filters that work with UNNEST filter rewrite 
and pushdown [#14777](https://github.com/apache/druid/pull/14777)
   * Enabled [whole-query 
caching](https://druid.apache.org/docs/latest/querying/caching#whole-query-caching)
 on the Broker for [groupBy 
v2](https://druid.apache.org/docs/latest/querying/groupbyquery.html) queries 
[#11595](https://github.com/apache/druid/pull/11595)
   * Improved performance of EARLIEST aggregator with vectorization 
[#14408](https://github.com/apache/druid/pull/14408)
   
   ## <a name="28.0.0-functional-area-and-related-changes-cluster-management" 
href="#28.0.0-functional-area-and-related-changes-cluster-management">#</a> 
Cluster management
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-cluster-management-encrypt-data-in-transit"
 
href="#28.0.0-functional-area-and-related-changes-cluster-management-encrypt-data-in-transit">#</a>
 Encrypt data in transit
   
   The `net.spy.memcached` client has been replaced with the AWS ElastiCache 
client. This change allows Druid to encrypt data in transit using TLS.
   
   Configure it with the following properties:
   
   |Property|Description|Default|
   |--------|-----------|-------|
   | `druid.cache.enableTls` | Enable TLS based connection for Memcached 
client. Boolean | false |
   | `druid.cache.clientMode`  | Client Mode. Static mode requires the user to 
specify individual cluster nodes. Dynamic mode uses 
[AutoDiscovery](https://docs.aws.amazon.com/AmazonElastiCache/latest/mem-ug/AutoDiscovery.HowAutoDiscoveryWorks.html)
 feature of AWS Memcached. String. 
["static"](https://docs.aws.amazon.com/AmazonElastiCache/latest/mem-ug/AutoDiscovery.Manual.html)
 or 
["dynamic"](https://docs.aws.amazon.com/AmazonElastiCache/latest/mem-ug/AutoDiscovery.Using.ModifyApp.Java.html)
 | static            |
   | `druid.cache.skipTlsHostnameVerification` | Skip TLS Hostname 
Verification. Boolean. | true |
   
   [#14827](https://github.com/apache/druid/pull/14827/)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-cluster-management-new-metadata-in-the-druid-segments-table"
 
href="#28.0.0-functional-area-and-related-changes-cluster-management-new-metadata-in-the-druid-segments-table">#</a>
 New metadata in the Druid segments table
   
   The Druid segments table now has a column called `used_flag_last_updated` 
(VARCHAR (255)). This column is a UTC date string corresponding to the last 
time that the used column was modified.
   
   Note that this is an incompatible change to the table. For upgrade 
information, see [Upgrade Druid segments table](#upgrade-druid-segments-table).
   
   [#12599](https://github.com/apache/druid/pull/12599)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-cluster-management-other-cluster-management-improvements"
 
href="#28.0.0-functional-area-and-related-changes-cluster-management-other-cluster-management-improvements">#</a>
 Other cluster management improvements
   
   * You can now use multiple console appenders in Peon logging 
[#14521](https://github.com/apache/druid/pull/14521)
   * Thread names of the processing pool for Indexer, Peon, and Historical 
processes now include the query ID 
[#15059](https://github.com/apache/druid/pull/15059)
   * The value for `replicationThrottleLimit` used for smart segment loading 
has been increased from 2% to 5% of total number of used segments. The total 
number of replicas in the load queue at the start of a run plus the replicas 
assigned in a run is kept less than or equal to the throttle limit 
[#14913](https://github.com/apache/druid/pull/14913)
   * The value default  value for `balancerComputeThreads` is now calculated 
based on the number of CPUs divided by 2. Previously, the value was `1`. Smart 
segment loading uses this computed value 
[#14902](https://github.com/apache/druid/pull/14902)
   * Improved `InvalidNullByteFault` errors. They now include the output column 
name instead of the query column name for ease of use 
[#14780](https://github.com/apache/druid/pull/14780)
   * Improved the exception message when `DruidLeaderClient` doesn't find 
leader node [#14775](https://github.com/apache/druid/pull/14775)
   * Reduced Coordinator logging under normal operation 
[#14926](https://github.com/apache/druid/pull/14926)
   * Heap usage is now more predictable at very minor performance cost when 
using nested values [#14919](https://github.com/apache/druid/pull/14919)
   * __Middle Manager-less ingestion__: 
      * The `sys.tasks` metadata table and web console now show the Kubernetes 
pod name rather than the peon location when using Middle Manager-less ingestion 
[#14959](https://github.com/apache/druid/pull/14959)
      * Added support for Middle Manager-less ingestion to migrate with zero 
downtime to and from `WorkerTaskRunners` that use Middle Managers 
[#14918](https://github.com/apache/druid/pull/14918)
   * Druid extensions cannot bind custom Coordinator duties to the duty groups 
`IndexingServiceDuties` and `MetadataStoreManagementDuties` anymore. These are 
meant to be core coordinator built-in flows and should not be affected by 
custom duties. Users can still define a `CustomCoordinatorDuty` with a custom 
duty group and period [#14891](https://github.com/apache/druid/pull/14891)
   * Druid now adjusts `balancerComputeThreads` and `maxSegmentsToMove` 
automatically based on usage skew between the Historical processes in a tier 
[#14584](https://github.com/apache/druid/pull/14584)
   * Removed the configurable property 
`druid.coordinator.compaction.skipLockedIntervals` because it should always be 
`true` [#14807](https://github.com/apache/druid/pull/14807)
   * Updated mm-less task runner lifecycle logic to better match the logic in 
the HTTP and ZooKeeper worker task runners 
[#14895](https://github.com/apache/druid/pull/14895)
   
   ## <a name="28.0.0-functional-area-and-related-changes-data-management" 
href="#28.0.0-functional-area-and-related-changes-data-management">#</a> Data 
management
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-data-management-alert-message-for-segment-assignments"
 
href="#28.0.0-functional-area-and-related-changes-data-management-alert-message-for-segment-assignments">#</a>
 Alert message for segment assignments
   
   Improved alert message for segment assignments when an invalid tier is 
specified in a load rule or when no rule applies on a segment.
   
   [#14696](https://github.com/apache/druid/pull/14696)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-data-management-coordinator-api-for-unused-segments"
 
href="#28.0.0-functional-area-and-related-changes-data-management-coordinator-api-for-unused-segments">#</a>
 Coordinator API for unused segments
   
   Added `includeUnused` as an optional parameter to the Coordinator API.
   You can send a `GET` request to 
`/druid/coordinator/v1/metadata/datasources/{dataSourceName}/segments/{segmentId}?includeUnused=true`
 to retrieve the metadata for a specific segment as stored in the metadata 
store.
   
   The API also returns unused segments if the `includeUnused` parameter is set.
   
   [#14846](https://github.com/apache/druid/pull/14846/)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-data-management-kill-task-improvements"
 
href="#28.0.0-functional-area-and-related-changes-data-management-kill-task-improvements">#</a>
 Kill task improvements
   
   * Added `killTaskSlotRatio` and `maxKillTaskSlots` dynamic configuration 
properties to allow control of task resource usage spawned by the 
`KillUnusedSegments` coordinator task 
[#14769](https://github.com/apache/druid/pull/14769)
   * The value for `druid.coordinator.kill.period` can now be greater than or 
equal to `druid.coordinator.period.indexingPeriod`. Previously, it had to be 
greater than `druid.coordinator.period.indexingPeriod`. Additionally, the 
leader Coordinator now keeps track of the last submitted `kill` task for a 
datasource to avoid submitting duplicate `kill` tasks 
[#14831](https://github.com/apache/druid/pull/14831)
   * Added a new config `druid.coordinator.kill.bufferPeriod` for a buffer 
period. This config defines the amount of time that a segment is unused before 
`KillUnusedSegment` can kill it. Using the default `PT24H`, if you mark a 
segment as unused at `2022-06-01T00:05:00.000Z`, then the segment cannot be 
killed until at or after `2022-06-02T00:05:00.000Z` 
[#12599](https://github.com/apache/druid/pull/12599)
   * You can now specify the following parameters for a `kill` task:
     * `batchSize`: The maximum number of segments to delete in one `kill` 
batch [#14642](https://github.com/apache/druid/pull/14642)
     * `limit`: The maximum number of segments for a `kill` task to delete 
[#14662](https://github.com/apache/druid/pull/14662)
   * You can now speed up `kill` tasks by batch deleting multiple segments 
stored in S3 [#14131](https://github.com/apache/druid/pull/14131)
   * Kill tasks that delete unused segments now publish a task report 
containing kill stats such as `numSegmentsKilled`, `numBatchesProcessed`, and 
`numSegmentsMarkedAsUnused` [#15023](https://github.com/apache/druid/pull/15023)
   * `IndexerSQLMetadataStorageCoordinator` now uses the JDBI `PreparedBatch` 
instead of issuing single update statements inside a transaction to mitigate 
scaling challenges [#14639](https://github.com/apache/druid/pull/14639)
   
   ## <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring" 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring">#</a> 
Metrics and monitoring
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-ingestion-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-ingestion-metrics">#</a>
 New ingestion metrics
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`ingest/input/bytes`|Number of bytes read from input sources, after 
decompression but prior to parsing. This covers all data read, including data 
that does not end up being fully processed and ingested. For example, this 
includes data that ends up being rejected for being unparseable or filtered 
out.|`dataSource`, `taskId`, `taskType`, `groupId`, `tags`|Depends on the 
amount of data read.|
   
   [#14582](https://github.com/apache/druid/pull/14582)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-query-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-query-metrics">#</a>
 New query metrics
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`mergeBuffer/pendingRequests`|Number of requests waiting to acquire a batch 
of buffers from the merge buffer pool.|This metric is exposed through the 
`QueryCountStatsMonitor` module for the Broker.| |
   
   [#15025](https://github.com/apache/druid/pull/15025)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-zookeeper-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-zookeeper-metrics">#</a>
 New ZooKeeper metrics
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`zk/connected`|Indicator of connection status. `1` for connected, `0` for 
disconnected. Emitted once per monitor period.|None|1|
   |`zk/reconnect/time`|Amount of time, in milliseconds, that a server was 
disconnected from ZooKeeper before reconnecting. Emitted on reconnection. Not 
emitted if connection to ZooKeeper is permanently lost, because in this case, 
there is no reconnection.|None|Not present|
   
   [#14333](https://github.com/apache/druid/pull/14333)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-subquery-metrics-for-the-broker"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-subquery-metrics-for-the-broker">#</a>
 New subquery metrics for the Broker
   
   The new  `SubqueryCountStatsMonitor` emits metrics corresponding to the 
subqueries and their execution.
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`subquery/rowLimit/count`|Number of subqueries whose results are 
materialized as rows (Java objects on heap).|This metric is only available if 
the `SubqueryCountStatsMonitor` module is included.| |
   |`subquery/byteLimit/count`|Number of subqueries whose results are 
materialized as frames (Druid's internal byte representation of rows).|This 
metric is only available if the `SubqueryCountStatsMonitor` module is 
included.| |
   |`subquery/fallback/count`|Number of subqueries which cannot be materialized 
as frames|This metric is only available if the `SubqueryCountStatsMonitor` 
module is included.| |
   |`subquery/fallback/insufficientType/count`|Number of subqueries which 
cannot be materialized as frames due to insufficient type information in the 
row signature.|This metric is only available if the `SubqueryCountStatsMonitor` 
module is included.| |
   |`subquery/fallback/unknownReason/count`|Number of subqueries which cannot 
be materialized as frames due other reasons.|This metric is only available if 
the `SubqueryCountStatsMonitor` module is included.| |
   |`query/rowLimit/exceeded/count`|Number of queries whose inlined subquery 
results exceeded the given row limit|This metric is only available if the 
`SubqueryCountStatsMonitor` module is included.| |
   |`query/byteLimit/exceeded/count`|Number of queries whose inlined subquery 
results exceeded the given byte limit|This metric is only available if the 
`SubqueryCountStatsMonitor` module is included.| |
   
   [#14808](https://github.com/apache/druid/pull/14808)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-coordinator-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-coordinator-metrics">#</a>
 New Coordinator metrics
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`killTask/availableSlot/count`| Number of available task slots that can be 
used for auto kill tasks in the auto kill run. This is the max number of task 
slots minus any currently running auto kill tasks.| |Varies|
   |`killTask/maxSlot/count`| Maximum number of task slots available for auto 
kill tasks in the auto kill run.| |Varies|
   |`kill/task/count`| Number of tasks issued in the auto kill run.| |Varies|
   |`kill/pendingSegments/count`|Number of stale pending segments deleted from 
the metadata store.|`dataSource`|Varies|
   
   [#14782](https://github.com/apache/druid/pull/14782)
   [#14951](https://github.com/apache/druid/pull/14951)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-compaction-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-compaction-metrics">#</a>
 New compaction metrics
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`compact/segmentAnalyzer/fetchAndProcessMillis`|Time taken to fetch and 
process segments to infer the schema for the compaction task to 
run.|`dataSource`, `taskId`, `taskType`, `groupId`,`tags`| Varies. A high value 
indicates compaction tasks will speed up from explicitly setting the data 
schema. |
   
   [#14752](https://github.com/apache/druid/pull/14752)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-segment-scan-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-segment-scan-metrics">#</a>
 Segment scan metrics
   
   Added a new metric to figure out the usage of `druid.processing.numThreads` 
on the Historicals/Indexers/Peons.
   
   |Metric|Description|Dimensions|Normal value|
   |------|-----------|----------|------------|
   |`segment/scan/active`|Number of segments currently scanned. This metric 
also indicates how many threads from `druid.processing.numThreads` are 
currently being used.||Close to `druid.processing.numThreads`|
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-kafka-consumer-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-new-kafka-consumer-metrics">#</a>
 New Kafka consumer metrics
   
   Added the following Kafka consumer metrics:
   
   * `kafka/consumer/bytesConsumed`: Equivalent to the Kafka consumer metric 
`bytes-consumed-total`. Only emitted for Kafka tasks.
   * `kafka/consumer/recordsConsumed`: Equivalent to the Kafka consumer metric 
`records-consumed-total`. Only emitted for Kafka tasks.
   
   [#14582](https://github.com/apache/druid/pull/14582)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-service%2Fheartbeat-metrics"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-service%2Fheartbeat-metrics">#</a>
 service/heartbeat metrics
   
   * Exposed `service/heartbeat` metric to `statsd-reporter` 
[#14564](https://github.com/apache/druid/pull/14564)
   * Modified the `service/heartbeat` metric to expose the `leader` dimension 
[#14593](https://github.com/apache/druid/pull/14593)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-metrics-and-monitoring-tombstone-and-segment-counts"
 
href="#28.0.0-functional-area-and-related-changes-metrics-and-monitoring-tombstone-and-segment-counts">#</a>
 Tombstone and segment counts
   
   Added `ingest/tombstones/count` and `ingest/segments/count` metrics in MSQ 
to report the number of tombstones and segments after Druid finishes publishing 
segments.
   
   [#14980](https://github.com/apache/druid/pull/14980)
   
   ## <a name="28.0.0-functional-area-and-related-changes-extensions" 
href="#28.0.0-functional-area-and-related-changes-extensions">#</a> Extensions
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-extensions-ingestion-task-payloads-for-kubernetes"
 
href="#28.0.0-functional-area-and-related-changes-extensions-ingestion-task-payloads-for-kubernetes">#</a>
 Ingestion task payloads for Kubernetes
   
   You can now provide compressed task payloads larger than 128 KB when you run 
MiddleManager-less ingestion jobs.
   
   [#14887](https://github.com/apache/druid/pull/14887)
   
   ### <a 
name="28.0.0-functional-area-and-related-changes-extensions-prometheus-emitter" 
href="#28.0.0-functional-area-and-related-changes-extensions-prometheus-emitter">#</a>
 Prometheus emitter
   
   The Prometheus emitter now supports a new optional configuration parameter, 
`druid.emitter.prometheus.extraLabels`.
   This addition offers the flexibility to add arbitrary extra labels to 
Prometheus metrics, providing more granular control in managing and identifying 
data across multiple Druid clusters or other dimensions.
   For more information, see [Prometheus emitter 
extension](https://druid.apache.org/docs/latest/development/extensions-contrib/prometheus).
   
   [#14728](https://github.com/apache/druid/pull/14728)
   
   ## <a 
name="28.0.0-functional-area-and-related-changes-documentation-improvements" 
href="#28.0.0-functional-area-and-related-changes-documentation-improvements">#</a>
 Documentation improvements
   
   We've moved Jupyter notebooks that guide you through query, ingestion, and 
data management with Apache Druid to the new [Learn 
Druid](https://github.com/implydata/learn-druid) repository.
   The repository also contains a Docker Compose file to get you up and running 
with a learning lab.
   
   [#15136](https://github.com/apache/druid/pull/15136)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[I] [DRAFT] 28.0.0 release notes (druid)

Reply via email to