gianm opened a new pull request #10267:
URL: https://github.com/apache/druid/pull/10267


   DruidInputSource, DruidSegmentReader changes:
   
   1) Remove "dimensions" and "metrics". They are not necessary, because we
      can compute which columns we need to read based on what is going to
      be used by the timestamp, transform, dimensions, and metrics.
   2) Start using ColumnsFilter (see below) to decide which columns we need
      to read.
   3) Actually respect the "timestampSpec". Previously, it was ignored, and
      the timestamp of the returned InputRows was set to the `__time` column
      of the input datasource.
   
   (1) and (2) together fix a bug in which the DruidInputSource would not
   properly read columns that are used as inputs to a transformSpec.
   
   (3) fixes a bug where the timestampSpec would be ignored if you attempted
   to set the column to something other than `__time`.
   
   (1) and (3) are breaking changes.
   
   Web console changes:
   
   1) Remove "Dimensions" and "Metrics" from the Druid input source.
   2) Set timestampSpec to `{"column": "__time", "format": "millis"}` for
      compatibility with the new behavior.
   
   Other changes:
   
   1) Add ColumnsFilter, a new class that allows input readers to determine
      which columns they need to read. Currently, it's only used by the
      DruidInputSource, but it could be used by other columnar input sources
      in the future.
   2) Add a ColumnsFilter to InputRowSchema.
   3) Remove the metric names from InputRowSchema (they were unused).
   4) Add InputRowSchemas.fromDataSchema method that computes the proper
      ColumnsFilter for given timestamp, dimensions, transform, and metrics.
   5) Add "getRequiredColumns" method to TransformSpec to support the above.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to