[ https://issues.apache.org/jira/browse/SPARK-20795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Turan Gojayev updated SPARK-20795: ---------------------------------- Description: In current implementation of IDF there is no way for setting maximum number of documents for filtering the terms. I assume that the functionality is the same for minimum document frequency, and was wondering, if there is a special reason for not having maxDocFreq parameter and filtering. > Maximum document frequency for IDF > ---------------------------------- > > Key: SPARK-20795 > URL: https://issues.apache.org/jira/browse/SPARK-20795 > Project: Spark > Issue Type: Improvement > Components: ML > Affects Versions: 2.1.0 > Reporter: Turan Gojayev > Priority: Minor > > In current implementation of IDF there is no way for setting maximum number > of documents for filtering the terms. I assume that the functionality is the > same for minimum document frequency, and was wondering, if there is a special > reason for not having maxDocFreq parameter and filtering. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org