SaschaC created CALCITE-2256: -------------------------------- Summary: Exception in Druid Adapter: JsonSegmentMetadata, numRows field out of range Key: CALCITE-2256 URL: https://issues.apache.org/jira/browse/CALCITE-2256 Project: Calcite Issue Type: Bug Components: druid Affects Versions: 1.16.0 Reporter: SaschaC Assignee: Julian Hyde
When trying to connect to a Druid datasource, Calcite pulls metadata from Druid as a first action. As Druid is a database for big-data, it is not uncommon that a datasource would contain billions of records. The metadata about a datasource returns a records count, namely the total number of records residing in that datasource. This is returned in a property named "numRows". The Calcite Druid Adapter tries to read the numRows field (that Druid returns within the response Json document of a metadata request) into an integer field which overflows for record counts bigger than 4 billion and triggers the below exception {code} Caused by: com.fasterxml.jackson.databind.JsonMappingException: Numeric value (13452491153) out of range of int at [Source: (sun.net.www.protocol.http.HttpURLConnection$HttpInputStream); line: 565, column: 26] at [Source: (sun.net.www.protocol.http.HttpURLConnection$HttpInputStream); line: 565, column: 15] (through reference chain: java.util.ArrayList[0]->org.apache.calcite.adapter.druid.DruidConnectionImpl$JsonSegmentMetadata["numRows"]) {code} In class org.apache.calcite.adapter.druid,*DruidConnectionImpl*, inner class JsonSegmentMetadata has integer fields for reading in Druid metadata like size or numRows which aught to be of type *long* so that they can hold larger numbers: {code} private static class JsonSegmentMetadata { public String id; public List<String> intervals; public Map<String, JsonColumn> columns; *public int size;* *public int numRows;* public Map<String, JsonAggregator> aggregators; } {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)