[ 
https://issues.apache.org/jira/browse/PHOENIX-2435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15018049#comment-15018049
 ] 

Josh Mahonin commented on PHOENIX-2435:
---------------------------------------

Hi [~diego.fustes]

Your approach was very close, but internally the phoenix-spark code takes care 
of building the select statement from the tableName, columns and predicate. It 
then updates the configuration from those, and since you had already set it, it 
ends up getting overridden. If you're curious about the process, the code is 
here:

https://github.com/apache/phoenix/blob/master/phoenix-spark/src/main/scala/org/apache/phoenix/spark/PhoenixRDD.scala#L64

I've been considering adjusting the RDD API so that it can consume something 
closer to a regular SQL statement, but since Spark is moving towards DataFrames 
/ Spark SQL anyway, I've kept it on the back burner.

> Where clause does not work for phoenix-spark create phoenix RDD
> ---------------------------------------------------------------
>
>                 Key: PHOENIX-2435
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-2435
>             Project: Phoenix
>          Issue Type: Bug
>    Affects Versions: 4.4.0
>            Reporter: Diego Fustes Villadóniga
>
> I create a phoenix RDD by passing the next query:
> SELECT RN,ODO,GSN,ANG,MFL FROM SHOT WHERE RN = 'TUBFL04M0'
> It seems that the where clause is not taking effect and all rows are readed 
> into the RDD



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to