[
https://issues.apache.org/jira/browse/KUDU-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
wkhapy123 updated KUDU-2666:
----------------------------
Description:
spark version 2.3.0
MyKuduCluster is 3 node
each tablet 3 replicas.
when I use sparkcontext read kudu table, task Locality Level is RACK_LOCAL。
How could it be Node_LOCAL?
query like this
spark.sqlContext.sql(s"select * from tablea where event_day>=1546185600 and
tenant_id=1 and channel_id='15850513729' limit 1 ").collect
was:
spark version 2.3.0
MyKuduCluster is 3 node
each tablet 3 replicas.
when I use sparkcontext read kudu table, task [Locality
Level|http://spark-01:8088/proxy/application_1547717455873_0092/stages/stage?id=111&attempt=0&task.sort=Locality+Level&task.pageSize=100]
is RACK_LOCAL。
How could it be Node_LOCAL?
query like this
spark.sqlContext.sql(s"select * from tablea where event_day>=1546185600 and
tenant_id=1 and channel_id='15850513729' limit 1 ").collect
> kudu spark intergration taskRead Locality Level is RACK_LOCAL
> -------------------------------------------------------------
>
> Key: KUDU-2666
> URL: https://issues.apache.org/jira/browse/KUDU-2666
> Project: Kudu
> Issue Type: Improvement
> Components: spark
> Affects Versions: 1.8.0
> Reporter: wkhapy123
> Priority: Major
> Attachments: 1.png, 2.png
>
>
> spark version 2.3.0
> MyKuduCluster is 3 node
> each tablet 3 replicas.
> when I use sparkcontext read kudu table, task Locality Level is RACK_LOCAL。
> How could it be Node_LOCAL?
> query like this
> spark.sqlContext.sql(s"select * from tablea where event_day>=1546185600 and
> tenant_id=1 and channel_id='15850513729' limit 1 ").collect
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)