Re: [Scikit-learn-general] Deep Learning Proposal

Andreas Mueller Sun, 05 May 2013 15:02:05 -0700

Hi Issam.

Sorry to break this to you, but sklearn will not add any GPU code in thenear future.

Also, we will probably not use numba in quite a while.

I think it is possible that we want to replace some cython with numba,but I don't see this happening this year.

For your proposal: actually Deep Boltzmann machines are strictly moregeneral than Deep Belief Networks.If you don't know this, I'm not sure you know enough about thesealgorithms to implement them.Also, stacked denoising autoencoders are synonymous with deepautoencoders (modulo modifying the input).

You write "Learn and implement GPU accelerated Python techniques (eg.shared variables) to improve speed".Are you talking about theano there? So you want to add atheano-dependency to sklearn?

No, sorry, that won't happen either.

I don't think there is any point in submitting your proposal.

Sorry.
Andy


On 05/02/2013 09:33 AM, Issam wrote:

Hi Scikit-learn,
Apologies for not replying to the comments I received in thepreproposal, for some reason I had the mailing-list notificationturned off :(. AsYou
- Its very related to MLP and RBM
- For Performance, I will be exploiting numba and GPU techniques.
-For fear of duplicating Theano:
Scikit-learn enjoys a very friendly, flexible API that allows forsimple, black-box usage to perform any complex algorithm. Eventually,scikit-learn would need easy-to-use neural networks package, that thisproject aims to fulfill. Deep learning is just an extension toprimitive neural network techniques.Implementing the basis of deep learning algorithms in scikit has a lotof future benefits, since the research area is evolving quickly andmany algorithms are being proposed. Theano might not have all thesealgorithms. In addition, scikit allows for easy, robust scalability,thus, extending deep learning algorithms to meet the current researchdemands becomes an ease.
I submitted a proposal to GSOC 2013 which is structured as follows,

*Title,*

(scikit-learn) Deep Learning capabilities for scikit-learn

*Short Description*,
Here I am proposing to work on deep learning to augment its algorithmsin scikit-learn. Deep learning is a relatively new research area thatis progressing fast with a lot of potential for contributions. Itinvolves an interesting idea by trying to imitate the brain, as ituses many levels (hidden layers) of processing. Where the levels areat decreasing order of abstractions!
In this project, I'm planning to work on each step carefully, first Ilook into "Deep Boltzmann machines", then "Deep beliefnetworks","Deep auto-encoders", "Stacked denoising auto-encoders", andmore.
My plan is to establish an easy to use, fast (GPU enhanced),black-box, plugin, to the scikit-learn library.
This is necessary since Deep Learning is evolving quickly, and havingit in an attractive API such as of scikit's would allow the library toeasily adapt to new changes in this research area
*
**Content,*

The plan is as follows,
Please note that all implementations are done in consistence withscikit's-learn convention, while exploiting numba and GPU techniquesfor high-speed performance
*Until Jun 17*

Familiarize self with deep learning and scikit-learn-'s API
Analyze related algorithms in the pull request (scikit-learn github)and ensure they are bug free
Implement/reuse dependent algorithms (eg. MLP, RBM)

*Jun 17 - Jun 24*

Read and  sketch out "Deep directed networks" in pseudo code

*Jun 24 - Jul 1*

Implement  and document  "Deep directed networks"

*Jul  1 - Jul 8*

Read and  sketch out "Deep Boltzmann machines" in pseudo code

*Jul 8 - Jul 15*

Implement and document  " Deep Boltzmann machines "

*Jul  15 - Jul 22*
Learn and implement GPU accelerated Python techniques (eg. sharedvariables) to improve speed
Add examples to the aforementioned algorithms

*Jul 22 - Jul 29*

Inspect implemented code thoroughly and submit for mid-term evaluation

*Jul 29 - Aug 5*

Read and  sketch out " Deep belief networks" in pseudo code

*Aug  5 - Aug 12*

Implement  and document " Deep belief networks"

*Aug  12 - Aug 19*

Read and  sketch out " Greedy layer-wise learning of DBNs" in pseudo code

*Aug  19 - Aug 26*

Implement  and document  " Greedy layer-wise learning of DBNs"

*Aug  26 - Sep  2*

Read and  sketch out " Deep multi-layer perceptrons" in pseudo code

Read and  sketch out " Deep auto-encoders" in pseudo code

*Sep  2 - Sep  9*

Implement  and document " Deep multi-layer perceptrons"

Implement  and document " Deep auto-encoders "

*Sep  9 -Sep 16*

Inspect all code thoroughly, and fix defects

Add proper examples:

  * Handwritten digit classification using DBNs
  * Data visualization and feature discovery using deep auto-encoders
  * Information retrieval using deep auto-encoders
  * Learning audio features using 1d convolutional DBNs
  * Learning image features using 2d convolutional DBNs

*Sep  16 -Sep 23*
Write a user-friendly global documentation giving a step-by-stepguidance for getting started with deep learning in scikit-learn
*Link to a patch/code sample:*

http://deeplearning.net/tutorial/gettingstarted.html**

*Additional Information:*
[1]/Book/, Machine Learning a Probabilistic Perspective,/author/,Kevin P. Murphy
[2]Lee, Honglak, et al. "Convolutional deep belief networks forscalable unsupervised learning of hierarchicalrepresentations."/Proceedings of the 26th Annual InternationalConference on Machine Learning/. ACM, 2009.
[3] Hinton, Geoffrey E., Simon Osindero, and Yee-Whye Teh. "A fastlearning algorithm for deep belief nets."/Neural computation/ 18.7(2006): 1527-1554.
[4]Lee, Honglak, et al. "Convolutional deep belief networks forscalable unsupervised learning of hierarchicalrepresentations."/Proceedings of the 26th Annual InternationalConference on Machine Learning/. ACM, 2009.
------------------------------------------------------------------------------
Introducing AppDynamics Lite, a free troubleshooting tool for Java/.NET
Get 100% visibility into your production application - at no cost.
Code-level diagnostics for performance bottlenecks with <2% overhead
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap1


_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite
It's a free troubleshooting tool designed for production
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap2

_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] Deep Learning Proposal

Reply via email to