In that case, Doug King, would you be interested in leading a team in a competition? I'll help you gather interested parties on the mailing list and elsewhere.
--------- Matt Taylor OS Community Flag-Bearer Numenta On Tue, Nov 19, 2013 at 4:54 AM, Pedro Tabacof <[email protected]> wrote: > I'm sorry for disappearing, but I work full time and it has been hell those > past weeks (and I also moved to a new place). I don't know when I will be > available for this kind of endeavor again. > > Pedro. > > > On Fri, Nov 15, 2013 at 5:16 PM, Matthew Taylor <[email protected]> wrote: >> >> I don't think it matters whether it's a Kaggle competition or another ML >> competition or public data set that is regularly used for this type of >> thing. >> >> I started this wiki with ML competitions, and I'm interested in adding to >> it. (Anyone else want to help out filling this list?) >> >> https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions >> >> So there are some people in the community interested in working on a >> competition with NuPIC, we just need a way to identify and organize them. >> Since Pedro started this initiative, maybe he'd still be interested in >> leading the charge? >> >> Pedro, what say you? >> >> >> --------- >> Matt Taylor >> OS Community Flag-Bearer >> Numenta >> >> >> On Thu, Nov 14, 2013 at 10:18 AM, Doug King <[email protected]> wrote: >>> >>> Matt, >>> >>> I'm still interested in doing a Kaggle contest. I'm not sure why we did >>> not get the momentum we needed - i assumed the Hackathon took some focus >>> away, and it was ambitious to jump into a Kaggle contest half way to the >>> deadline. >>> >>> I thought the contest we chose, weather prediction, was an good fit for >>> the CLA - time based prediction, practical application, good metric for >>> comparison to other approaches. >>> >>> Anyway, I'm interested in selecting another Kaggle contest. I think it's >>> a great way to prove the value of the CLA approach and the dynamics of a >>> contest using the Kaggle framework provide an excellent platform for this. >>> If enough members want to do this its possible we could even assemble more >>> than one team from our forum. >>> >>> Thoughts? >>> >>> -Doug >>> >>> >>> On Wed, Nov 13, 2013 at 9:01 PM, Azat <[email protected]> wrote: >>>> >>>> Is convolutional network is NuPIC's category then. >>>> How about preprocessing of images as linking the same digits via elastic >>>> temporal transformations - it must be perfect in HTM's terms ? >>>> I'm not sure it is allowed though.. >>>> >>>> Azat >>>> >>>> -------------------------------------------- >>>> On Wed, 11/13/13, Matthew Taylor <[email protected]> wrote: >>>> >>>> Subject: Re: [nupic-dev] Kaggle team assemble >>>> To: "NuPIC general mailing list." <[email protected]> >>>> Date: Wednesday, November 13, 2013, 9:33 PM >>>> >>>> That sounds good to me. >>>> Numenta has always been criticized[1] for not providing >>>> comparisons between NuPIC and other ML approaches by posting >>>> the results for standard data sets and taking part in >>>> competitions. It would be great if we could mobilize >>>> motivated members of the community to tackle things like >>>> this. I've created a wiki[2] with a list of common data >>>> sets used for ML comparisons, as well as competitions. >>>> >>>> >>>> If anyone is willing to work with one of these >>>> data sets, I would be happy to publicize your work as well >>>> as I can. The more we can interact with the existing ML >>>> community, the more participation we'll get from them. >>>> >>>> >>>> >>>> [1] >>>> http://www.reddit.com/r/MachineLearning/comments/1qdu8i/cla_quiz_office_hour/[2] >>>> https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions >>>> >>>> >>>> >>>> ---------Matt >>>> TaylorOS Community >>>> Flag-BearerNumenta >>>> >>>> >>>> On Wed, Nov 13, 2013 at >>>> 7:11 PM, Chetan Surpur <[email protected]> >>>> wrote: >>>> >>>> >>>> Matt, >>>> I didn't actually see any communication about >>>> it since the last email, so I'm not sure what happened. >>>> I ended up focusing my time on preparing the Pilot framework >>>> for the hackathon. Either way, there's not much time >>>> left in the competition now. Maybe we can shoot for another >>>> one? >>>> >>>> >>>> >>>> >>>> - Chetan >>>> >>>> On Wed, Nov 13, 2013 at >>>> 7:14 AM, Matthew Taylor <[email protected]> >>>> wrote: >>>> >>>> >>>> >>>> >>>> So are you guys making any progress? >>>> >>>> >>>> ---------Matt TaylorOS >>>> Community Flag-BearerNumenta >>>> >>>> >>>> On Tue, >>>> Oct 22, 2013 at 8:23 PM, Chetan Surpur <[email protected]> >>>> wrote: >>>> >>>> >>>> >>>> >>>> Kaggle team, >>>> Hope everyone's okay with communicating >>>> general stuff here, as per request. >>>> I was thinking that we could start simply, by >>>> just feeding in the stream of solar energy readings to the >>>> CLA sequentially. This will allow us to see how well it does >>>> at predicting solar energy from just the temporal sequence, >>>> and the time of day, day of week, etc. >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> In parallel, someone else can learn to extract >>>> what we need from the GEFS weather data, correlating the >>>> weather variables with each solar energy reading time. We >>>> can build a CSV file of all the variables together, and run >>>> a swarm. >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> How does this approach sound? >>>> I can volunteer to set up the project and start >>>> feeding the solar energy with temporal information to the >>>> CLA. Whoever wants to help is welcome to. Anyone up for >>>> massaging the GEFS data as described above? >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> Thanks,Chetan >>>> P.S. We still need a team >>>> name! >>>> >>>> On Mon, Oct 21, 2013 at >>>> 9:31 PM, Tim McNamara <[email protected]> >>>> wrote: >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> +1 >>>> Very interested in observing this >>>> project >>>> On 22/10/2013 12:08 AM, >>>> "Matthew Taylor" <[email protected]> >>>> wrote: >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> Pedro et al, >>>> >>>> >>>> >>>> Unless you're actually going to be dealing with >>>> proprietary >>>> >>>> information, I encourage you to have your Kaggle discussions >>>> here on >>>> >>>> the mailing list. I'm sure there are others out there >>>> interested in >>>> >>>> what you're doing, but unable to take part (like me). >>>> >>>> >>>> >>>> Feel free to start a new thread, or use this one, but please >>>> use this >>>> >>>> list if you can. No one has complained about too many >>>> emails... yet >>>> >>>> ;). >>>> >>>> >>>> >>>> --------- >>>> >>>> Matt Taylor >>>> >>>> OS Community Flag-Bearer >>>> >>>> Numenta >>>> >>>> >>>> >>>> >>>> >>>> On Fri, Oct 18, 2013 at 11:39 AM, Pedro Tabacof <[email protected]> >>>> wrote: >>>> >>>> > Hello, >>>> >>>> > >>>> >>>> > Thank you all for the interest. I think now it's >>>> time to create a private >>>> >>>> > thread for the competition. The Solar Energy >>>> competition was the most voted >>>> >>>> > one so it can be our first Kaggle attempt. >>>> >>>> > >>>> >>>> > If anyone else is also interested in joining the >>>> competition, just send me a >>>> >>>> > message. >>>> >>>> > >>>> >>>> > Pedro. >>>> >>>> > >>>> >>>> > >>>> >>>> > On Fri, Oct 18, 2013 at 3:13 PM, Doug King <[email protected]> >>>> wrote: >>>> >>>> >> >>>> >>>> >> I'm interested in joining the team. I looked at >>>> the Kaggle contests and I >>>> >>>> >> agree that the best choice would be Solar Energy >>>> Prediction, but I wouldn't >>>> >>>> >> discount the Phone accelerometer contest. I think >>>> its a good fit for the CLA >>>> >>>> >> and and interesting application. The forum posts >>>> for that contest seem to >>>> >>>> >> indicate the sponsors are willing to judge a >>>> 'clean' (no leakage) version of >>>> >>>> >> the contest. >>>> >>>> >> >>>> >>>> >> >>>> >>>> >> On Thu, Oct 17, 2013 at 10:45 AM, Jay Gokhale >>>> <[email protected]> >>>> wrote: >>>> >>>> >>> >>>> >>>> >>> I'd love to use CLA's on a kaggle >>>> project! >>>> >>>> >>> >>>> >>>> >>> If Chetan's description is correct, it >>>> seems to me that the Solar Energy >>>> >>>> >>> Prediction contest would be the most straight >>>> forward one to work on. >>>> >>>> >>> >>>> >>>> >>> Jay Gokhale >>>> >>>> >>> >>>> >>>> >>> >>>> >>>> >>> On Wed, Oct 16, 2013 at 9:07 PM, Matthew Taylor >>>> <[email protected]> >>>> wrote: >>>> >>>> >>>> >>>> >>>> >>>> These also sound like good hackathon >>>> projects! >>>> >>>> >>>> >>>> >>>> >>>> Matt >>>> >>>> >>>> >>>> >>>> >>>> Sent from my MegaPhone >>>> >>>> >>>> >>>> >>>> >>>> On Oct 16, 2013, at 8:03 PM, Chetan Surpur >>>> <[email protected]> >>>> wrote: >>>> >>>> >>>> >>>> >>>> >>>> I'm interested in participating! >>>> >>>> >>>> >>>> >>>> >>>> I haven't decided on one yet, but I >>>> wanted to point out that the CLA >>>> >>>> >>>> looks ideally suited for the AMS 2013-2014 >>>> Solar Energy Prediction contest. >>>> >>>> >>>> From what I can tell, it's about >>>> predicting the daily energy output of solar >>>> >>>> >>>> farms from 12, 15, 18, 21, 24-hourly >>>> training data. There's 29 days to go. >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> On Wed, Oct 16, 2013 at 7:31 PM, Pedro >>>> Tabacof <[email protected]> >>>> >>>> >>>> wrote: >>>> >>>> >>>>> >>>> >>>> >>>>> Hello, >>>> >>>> >>>>> >>>> >>>> >>>>> After the warm reception of the Kaggle >>>> competition idea, I decided to >>>> >>>> >>>>> create the thread asap. For those who >>>> are not aware, Kaggle is a data mining >>>> >>>> >>>>> competition website. It has been hyped >>>> on the machine learning media and has >>>> >>>> >>>>> some great people participating in it. >>>> It rewards with different prizes, but >>>> >>>> >>>>> my focus is not earn money, but rather >>>> to see how good NuPIC would fare >>>> >>>> >>>>> against the state of the art. I found >>>> those following competitions to be a >>>> >>>> >>>>> viable start: >>>> >>>> >>>>> >>>> >>>> >>>>> 1) Accelerometer Biometric Competition >>>> >>>> >>>>> From real acceleration data you have to >>>> guess from which cell phone >>>> >>>> >>>>> came the test samples. >>>> >>>> >>>>> 36 days to go >>>> >>>> >>>>> Pros: >>>> >>>> >>>>> -Multivariable, temporal data seems >>>> ideal for the CLA >>>> >>>> >>>>> Cons: >>>> >>>> >>>>> -The best competitors are using data >>>> leaks (meta-information), so the >>>> >>>> >>>>> chance of winning fairly is zero >>>> (though they will give a consolation prize >>>> >>>> >>>>> to the best "honest" entry) >>>> >>>> >>>>> Questions: >>>> >>>> >>>>> -How fast would NuPIC go through a 1GB >>>> CSV file? >>>> >>>> >>>>> -The test data is not open-ended, that >>>> is, you have to say whether a >>>> >>>> >>>>> recording belongs to a specific cell >>>> phone or not, with 50% chance of being >>>> >>>> >>>>> right. Would NuPIC do well in this >>>> case? >>>> >>>> >>>>> >>>> >>>> >>>>> 2) Multi-Label Bird Species >>>> Classification >>>> >>>> >>>>> Try to guess the bird species present >>>> on some audio recordings >>>> >>>> >>>>> 38 days to go >>>> >>>> >>>>> Pros: >>>> >>>> >>>>> -Temporal data >>>> >>>> >>>>> Cons: >>>> >>>> >>>>> -Multiple labels, so I think we would >>>> need 87 different models, which >>>> >>>> >>>>> is kinda impractical >>>> >>>> >>>>> Question: >>>> >>>> >>>>> -Is there a better way to do >>>> multi-label classification with NuPIC? >>>> >>>> >>>>> >>>> >>>> >>>>> 3) Conway's Reverse Game of Life >>>> >>>> >>>>> Reverse the game of life for 1 to 5 >>>> steps >>>> >>>> >>>>> 4 months to go >>>> >>>> >>>>> Pros: >>>> >>>> >>>>> -Natural input sparseness (have to >>>> check on it) >>>> >>>> >>>>> -Temporal data >>>> >>>> >>>>> -Multistep classification >>>> >>>> >>>>> -We can generate our own training data >>>> if necessary >>>> >>>> >>>>> Cons: >>>> >>>> >>>>> -Maybe there are very straightforward >>>> ways to solve this (rule based, >>>> >>>> >>>>> brute force, etc) >>>> >>>> >>>>> -Too much time before the competition >>>> ends >>>> >>>> >>>>> Questions: >>>> >>>> >>>>> -How sparse does the input need to be >>>> for the CLA to work well? >>>> >>>> >>>>> -Is it possible to use a 2D arrangement >>>> on the cells connection matrix? >>>> >>>> >>>>> >>>> >>>> >>>>> There is also the AMS 2013-2014 Solar >>>> Energy Prediction contest, but I >>>> >>>> >>>>> didn't have time to go over it. The >>>> other competitions don't seem to be >>>> >>>> >>>>> suited for NuPIC. >>>> >>>> >>>>> >>>> >>>> >>>>> Who here is interested on participating >>>> and what competition do you >>>> >>>> >>>>> think to be the most promising one? >>>> >>>> >>>>> >>>> >>>> >>>>> Pedro. >>>> >>>> >>>>> -- >>>> >>>> >>>>> Pedro Tabacof, >>>> >>>> >>>>> Unicamp - Eng. de Computação 08. >>>> >>>> >>>>> >>>> >>>> >>>>> >>>> _______________________________________________ >>>> >>>> >>>>> nupic mailing list >>>> >>>> >>>>> [email protected] >>>> >>>> >>>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> >>>> nupic mailing list >>>> >>>> >>>> [email protected] >>>> >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> >>>> nupic mailing list >>>> >>>> >>>> [email protected] >>>> >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> >>>> >>> >>>> >>>> >>> >>>> >>>> >>> >>>> _______________________________________________ >>>> >>>> >>> nupic mailing list >>>> >>>> >>> [email protected] >>>> >>>> >>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>> >>>> >>>> >> >>>> >>>> >> >>>> >>>> >> _______________________________________________ >>>> >>>> >> nupic mailing list >>>> >>>> >> [email protected] >>>> >>>> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >> >>>> >>>> > >>>> >>>> > >>>> >>>> > >>>> >>>> > -- >>>> >>>> > Pedro Tabacof, >>>> >>>> > Unicamp - Eng. de Computação 08. >>>> >>>> > >>>> >>>> > _______________________________________________ >>>> >>>> > nupic mailing list >>>> >>>> > [email protected] >>>> >>>> > http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> > >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> nupic mailing list >>>> >>>> [email protected] >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> nupic mailing list >>>> >>>> [email protected] >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> nupic mailing list >>>> >>>> [email protected] >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> nupic mailing list >>>> >>>> [email protected] >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> >>>> nupic mailing list >>>> >>>> [email protected] >>>> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> >>>> >>>> >>>> -----Inline Attachment Follows----- >>>> >>>> _______________________________________________ >>>> nupic mailing list >>>> [email protected] >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>>> >>>> >>>> _______________________________________________ >>>> nupic mailing list >>>> [email protected] >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>> >>> >>> >>> _______________________________________________ >>> nupic mailing list >>> [email protected] >>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >>> >> >> >> _______________________________________________ >> nupic mailing list >> [email protected] >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> > > > > -- > Pedro Tabacof, > Unicamp - Eng. de Computação 08. > > _______________________________________________ > nupic mailing list > [email protected] > http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org > _______________________________________________ nupic mailing list [email protected] http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
