Andras Nemeth created SPARK-12264:
-------------------------------------
Summary: Could DataType provide a TypeTag?
Key: SPARK-12264
URL: https://issues.apache.org/jira/browse/SPARK-12264
Project: Spark
Issue Type: New Feature
Components: SQL
Reporter: Andras Nemeth
Priority: Minor
We are writing code that's dealing with generic DataFrames as inputs and
further processes their contents with normal RDD operations (not SQL). We need
some mechanism that tells us exactly what Scala types we will find inside a Row
of a given DataFrame.
The schema of the DataFrame contains this information in an abstract sense. But
we need to map it to TypeTags, as that's what the rest of the system uses to
identify what RDD contains what type of data - quite the natural choice in
Scala.
As far as I can tell, there is no good way to do this today. For now we have a
hand coded mapping, but that feels very fragile as spark evolves. Is there a
better way I'm missing? And if not, could we create one? Adding a typeTag or
scalaTypeTag method to DataType, or at least to AtomicType seems easy enough.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]