Andre Schumacher created SPARK-1649:
---------------------------------------

             Summary: DataType should contain nullable bit
                 Key: SPARK-1649
                 URL: https://issues.apache.org/jira/browse/SPARK-1649
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 1.1.0
            Reporter: Andre Schumacher
            Priority: Critical


For the underlying storage layer it would simplify things such as schema 
conversions, predicate filter determination and such to record in the data type 
itself whether a column can be nullable. So the DataType type could look like 
like this:

abstract class DataType(nullable: Boolean = true)

Concrete subclasses could then override the nullable val. Mostly this could be 
left as the default but when types can be contained in nested types one could 
optimize for, e.g., arrays with elements that are nullable and those that are 
not.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to