Hi team :
Currently when we using SVM to train dataset we found the input files limit
only one .
the source code as following :
valpath=if (dataFiles.length ==1) {
dataFiles.head.getPath.toUri.toString
} elseif (dataFiles.isEmpty) {
thrownewIOException("No input path specified for libsvm data")
} else {
thrownewIOException("Multiple input paths are not supported for libsvm data.")
}
The file store on the Distributed File System such as HDFS is split into mutil
piece and I think this limit is not necessary . I'm not sure is it a bug ? or
something I'm using not correctly .
thanks a lot ~~~