Junegunn Choi created PHOENIX-3073: -------------------------------------- Summary: Fast path for single-key point lookups Key: PHOENIX-3073 URL: https://issues.apache.org/jira/browse/PHOENIX-3073 Project: Phoenix Issue Type: Improvement Reporter: Junegunn Choi Assignee: Junegunn Choi
While comparing Phoenix JDBC client to the native HBase Java client, I noticed that Phoenix client uses significantly more CPU time on the client machine. Profiling revealed that the majority of the time was spent on {{BaseResultIterators.getParallelScans()}}. This was surprising to me as I was only testing with simple point lookup queries. Here's how I tested: - {{SELECT /*+ SMALL SERIAL */ ID, DOCID FROM IMAGE WHERE ID = ?}} - {{IMAGE}} is a salted table with 100 salt buckets - {{ID}}, the primary key, was randomly selected in a small range so that the requests are served without disk I/O - 20K/sec concurrent requests using 128 threads {{getParallelScans()}} is quite expensive as it iterates over all regions of the table which can be many, only to return a single Scan object for this query. Since such a single-key point lookup is one of the most frequent type of requests in a typical OLTP application, I believe it makes sense to have a fast path for it. With the patch, the average CPU usage of the client during the workload dropped to 18.8% from 56.7% before the patch. -- This message was sent by Atlassian JIRA (v6.3.4#6332)