Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/685#issuecomment-42727421
I wrote a parser, which should be much faster than Scala's parser
combinators. For the schema, an alternative would be JSON. JSON doesn't
separate array from tuple. So we may need either object with named fields or
nested array to represent vectors and labeled points.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---