Github user jkbradley commented on a diff in the pull request:

    https://github.com/apache/spark/pull/11536#discussion_r56709463
  
    --- Diff: 
mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizer.scala ---
    @@ -207,6 +207,27 @@ class CountVectorizerModel(override val uid: String, 
val vocabulary: Array[Strin
       /** @group setParam */
       def setMinTF(value: Double): this.type = set(minTF, value)
     
    +  /**
    +    * Binary toggle to control the output vector values.
    +    * If True, all non zero counts are set to 1. This is useful for 
discrete probabilistic
    --- End diff --
    
    Could you clarify that this ("all non-zero counts") is after the minTF 
filter has been applied?
    (Also, as long as you're at it, could you please correct the indentation?)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to