I am aware that today PySpark can not load sequence files directly. Are there work-arounds people are using (short of duplicating all the data to text files) for accessing this data?
- Workarounds for accessing sequence file data via PySpark? Gary Malouf
- Re: Workarounds for accessing sequence file data via P... Nick Pentreath