Github user alessandrozucca00 commented on the issue:
https://github.com/apache/spark/pull/17136
Hi,
I have some concerns about not treating shorter records as malformed ones:
this could lead to corrupt/inconsistent data, since there is no reason why a
record's missing tokens can not be 'in the middle' and not at the end of the
record.
I think that at least it would be useful to add an option to define a
policy for this.
If you think it is better, I can open an issue for this enhancement.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]