Todd Lipcon created KUDU-1802: --------------------------------- Summary: Deserializing scan tokens should avoid round-trip to master Key: KUDU-1802 URL: https://issues.apache.org/jira/browse/KUDU-1802 Project: Kudu Issue Type: Improvement Components: client, perf Affects Versions: 1.2.0 Reporter: Todd Lipcon
Currently, KuduScanToken::DeserializeIntoScanner calls KuduClient::OpenTable() which makes a GetTableSchema call to the master. This round trip is a bit expensive because it's always a "thundering herd" for an Impala query or Spark job -- every host deserializes a bunch of scan tokens at the same time and ends up having to back off. We should consider some ways to avoid this. -- This message was sent by Atlassian JIRA (v6.3.4#6332)