Could you provide an example of what you mean? I know it's possible to create an RDD from a path with wildcards, like in the subject.
For example, sc.textFile('s3n://bucket/2014-??-??/*.gz'). You can also
provide a comma delimited list of paths.
Nick
2014년 6월 1일 일요일, Oleg Proudnikov<[email protected]>님이 작성한 메시지:
> Hi All,
>
> Is it possible to create an RDD from a directory tree of the following
> form?
>
> RDD[(PATH, Seq[TEXT])]
>
> Thank you,
> Oleg
>
>
