Github user ygcao commented on the pull request:
https://github.com/apache/spark/pull/10152#issuecomment-164112022
update codes:
1. passed lint-scala check by adding spaces and cut comment line lengths
2. removed distance functions as we discussed, including the existing
zombie one(not used and not accessible from outside)
3. made the changes to add back maxSentenceSize as a configurable variable
while still respect sentence boundary by default, just to meet demands of
people who will construct sentences in a unimaginable way. I'll personally
always set it large enough(up to document size) to never affect nature sentence
boundary. Anyway, it's good to have an option for different people's need.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]