Filed: https://issues.apache.org/jira/browse/SPARK-6967
Shouldn't they be null? > Statistics are only used to eliminate partitions that can't possibly hold matching values. So while you are right this might result in a false positive, that will not result in a wrong answer.