FYI, I've used sklearn's LogisticRegression in an online/real-time text
classification app without having to dig into the internals and gotten
~2.5ms response time (including vectorizing; vocab size ~200k).
On 23 September 2013 06:37, Peter Prettenhofer <peter.prettenho...@gmail.com
> wrote:
> We don't have a PMML interface yet [1] - so you need to write custom code
> to extract internal state each individual classifier.
>
> What do you mean by performance critical (<1ms, <<1ms)? Do you make
> predictions per sample or can you buffer samples and make predictions for
> batches?
> In general, what kills performance is the overhead of python function
> calls - its usually way larger than the actual prediction (which usually
> happens in C-land).
>
> [1] http://en.wikipedia.org/wiki/Predictive_Model_Markup_Language
>
>
> 2013/9/23 Fred Baba <fred.b...@gmail.com>
>
>> I'd like to use classifiers trained via sklearn in a real-time
>> application, performance critical application. How do I access the internal
>> representation of trained classifiers?
>>
>> For linear classifiers/regressions, I can simply store the coefficients
>> and generate the linear combination myself. For tree regressions, I can use
>> sklearn.tree.export_graphviz. Ideally there would be an export facility for
>> all classifiers (particularly for examining the structure of generated
>> models). Is there a general solution way to do this?
>>
>>
>>
>> ------------------------------------------------------------------------------
>> LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99!
>> 1,500+ hours of tutorials including VisualStudio 2012, Windows 8,
>> SharePoint
>> 2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack
>> includes
>> Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13.
>>
>> http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk
>> _______________________________________________
>> Scikit-learn-general mailing list
>> Scikit-learn-general@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>>
>>
>
>
> --
> Peter Prettenhofer
>
>
> ------------------------------------------------------------------------------
> LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99!
> 1,500+ hours of tutorials including VisualStudio 2012, Windows 8,
> SharePoint
> 2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack
> includes
> Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13.
> http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
------------------------------------------------------------------------------
LIMITED TIME SALE - Full Year of Microsoft Training For Just $49.99!
1,500+ hours of tutorials including VisualStudio 2012, Windows 8, SharePoint
2013, SQL 2012, MVC 4, more. BEST VALUE: New Multi-Library Power Pack includes
Mobile, Cloud, Java, and UX Design. Lowest price ever! Ends 9/20/13.
http://pubads.g.doubleclick.net/gampad/clk?id=58041151&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general