Dear Scikit-learn community, This is my first email to the ML (mailing list), so let me start by expressing my appreciation to all involved in the scikit-learn project. I have learned much even from just following the ML.
I would like to know if anyone using scikit-learn has considered or has already started to combine the open source tranSMART biomedical big data platform with scikit-learn for performing analytics on large life science datasets? (tranSMART: http://transmartfoundation.org/) The system has been geared towards using R, but as I'm not the biggest fan of R and would prefer to work in Python and scipy/scikit-learn, I wonder how challenging the integration is or would be, notably with respect to passing data in memory between the two platforms, and trying to avoid intermediate text file processing. With thanks, J.B. Brown Assistant Professor, Kyoto University Graduate School of Medicine Department of Clinical System Onco-Informatics ------------------------------------------------------------------------------ HPCC Systems Open Source Big Data Platform from LexisNexis Risk Solutions Find What Matters Most in Your Big Data with HPCC Systems Open Source. Fast. Scalable. Simple. Ideal for Dirty Data. Leverages Graph Analysis for Fast Processing & Easy Data Exploration http://p.sf.net/sfu/hpccsystems _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general