so you can use a input output format & read it whichever way you write... You can additionally provide variables in hadoop configuration to configure.
Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi <https://twitter.com/mayur_rustagi> On Thu, May 8, 2014 at 10:39 AM, Debasish Das <debasish.da...@gmail.com>wrote: > Hi, > > For each line that we read as textLine from HDFS, we have a schema..if > there is an API that takes the schema as List[Symbol] and maps each token > to the Symbol it will be helpful... > > Does RDDs provide a schema view of the dataset on HDFS ? > > Thanks. > Deb >