[ 
https://issues.apache.org/jira/browse/MADLIB-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15667937#comment-15667937
 ] 

Xiaocheng Tang commented on MADLIB-1037:
----------------------------------------

The first PR is created here: https://github.com/apache/incubator-madlib/pull/75

This PR implements multi-class support vector machine and a new training 
mechanism using buffered mini-batch stochastic gradient descent (hence 
"in-memory" computations across epochs). Preliminary experiments demonstrate 
potential on greatly improving training speed and accuracy in MADlib. For 
example, training on mnist datasets (10 classes) for 1s on HAWQ 2.0 half-rack 
DCA and the model is able to achieve over 90% accuracy on holdout data. The 
training and testing set in this case consist of 46900 and 23100 images, 
respectively.

> Multi-class SVM with mini-batching 
> -----------------------------------
>
>                 Key: MADLIB-1037
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1037
>             Project: Apache MADlib
>          Issue Type: New Feature
>          Components: Module: Support Vector Machines
>            Reporter: Frank McQuillan
>            Assignee: Xiaocheng Tang
>             Fix For: v1.10
>
>
> Prototype of structure svm trained with mini-batch
> Acceptance
> 1) train on mnist and achieve acceptable accuracy



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to