[
https://issues.apache.org/jira/browse/SPARK-21066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16055336#comment-16055336
]
Yan Facai (颜发才) commented on SPARK-21066:
-----------------------------------------
[~sowen] I believe that the API has explained well in details.
If unspecified or nonpositive, the number of features will be determined
automatically at the cost of one additional pass.
> LibSVM load just one input file
> -------------------------------
>
> Key: SPARK-21066
> URL: https://issues.apache.org/jira/browse/SPARK-21066
> Project: Spark
> Issue Type: Bug
> Components: ML
> Affects Versions: 2.1.1
> Reporter: darion yaphet
>
> Currently when we using SVM to train dataset we found the input files limit
> only one .
> The file store on the Distributed File System such as HDFS is split into
> mutil piece and I think this limit is not necessary .
> We can join input paths into a string split with comma.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]