I don't think it matters whether it's a Kaggle competition or another ML competition or public data set that is regularly used for this type of thing.
I started this wiki with ML competitions, and I'm interested in adding to it. (Anyone else want to help out filling this list?) https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions So there are some people in the community interested in working on a competition with NuPIC, we just need a way to identify and organize them. Since Pedro started this initiative, maybe he'd still be interested in leading the charge? Pedro, what say you? --------- Matt Taylor OS Community Flag-Bearer Numenta On Thu, Nov 14, 2013 at 10:18 AM, Doug King <[email protected]> wrote: > Matt, > > I'm still interested in doing a Kaggle contest. I'm not sure why we did > not get the momentum we needed - i assumed the Hackathon took some focus > away, and it was ambitious to jump into a Kaggle contest half way to the > deadline. > > I thought the contest we chose, weather prediction, was an good fit for > the CLA - time based prediction, practical application, good metric for > comparison to other approaches. > > Anyway, I'm interested in selecting another Kaggle contest. I think it's a > great way to prove the value of the CLA approach and the dynamics of a > contest using the Kaggle framework provide an excellent platform for this. > If enough members want to do this its possible we could even assemble more > than one team from our forum. > > Thoughts? > > -Doug > > > On Wed, Nov 13, 2013 at 9:01 PM, Azat <[email protected]> wrote: > >> Is convolutional network is NuPIC's category then. >> How about preprocessing of images as linking the same digits via elastic >> temporal transformations - it must be perfect in HTM's terms ? >> I'm not sure it is allowed though.. >> >> Azat >> >> -------------------------------------------- >> On Wed, 11/13/13, Matthew Taylor <[email protected]> wrote: >> >> Subject: Re: [nupic-dev] Kaggle team assemble >> To: "NuPIC general mailing list." <[email protected]> >> Date: Wednesday, November 13, 2013, 9:33 PM >> >> That sounds good to me. >> Numenta has always been criticized[1] for not providing >> comparisons between NuPIC and other ML approaches by posting >> the results for standard data sets and taking part in >> competitions. It would be great if we could mobilize >> motivated members of the community to tackle things like >> this. I've created a wiki[2] with a list of common data >> sets used for ML comparisons, as well as competitions. >> >> >> If anyone is willing to work with one of these >> data sets, I would be happy to publicize your work as well >> as I can. The more we can interact with the existing ML >> community, the more participation we'll get from them. >> >> >> >> [1] >> http://www.reddit.com/r/MachineLearning/comments/1qdu8i/cla_quiz_office_hour/[2] >> https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions >> >> >> >> ---------Matt >> TaylorOS Community >> Flag-BearerNumenta >> >> >> On Wed, Nov 13, 2013 at >> 7:11 PM, Chetan Surpur <[email protected]> >> wrote: >> >> >> Matt, >> I didn't actually see any communication about >> it since the last email, so I'm not sure what happened. >> I ended up focusing my time on preparing the Pilot framework >> for the hackathon. Either way, there's not much time >> left in the competition now. Maybe we can shoot for another >> one? >> >> >> >> >> - Chetan >> >> On Wed, Nov 13, 2013 at >> 7:14 AM, Matthew Taylor <[email protected]> >> wrote: >> >> >> >> >> So are you guys making any progress? >> >> >> ---------Matt TaylorOS >> Community Flag-BearerNumenta >> >> >> On Tue, >> Oct 22, 2013 at 8:23 PM, Chetan Surpur <[email protected]> >> wrote: >> >> >> >> >> Kaggle team, >> Hope everyone's okay with communicating >> general stuff here, as per request. >> I was thinking that we could start simply, by >> just feeding in the stream of solar energy readings to the >> CLA sequentially. This will allow us to see how well it does >> at predicting solar energy from just the temporal sequence, >> and the time of day, day of week, etc. >> >> >> >> >> >> >> >> >> In parallel, someone else can learn to extract >> what we need from the GEFS weather data, correlating the >> weather variables with each solar energy reading time. We >> can build a CSV file of all the variables together, and run >> a swarm. >> >> >> >> >> >> >> >> >> How does this approach sound? >> I can volunteer to set up the project and start >> feeding the solar energy with temporal information to the >> CLA. Whoever wants to help is welcome to. Anyone up for >> massaging the GEFS data as described above? >> >> >> >> >> >> >> >> >> Thanks,Chetan >> P.S. We still need a team >> name! >> >> On Mon, Oct 21, 2013 at >> 9:31 PM, Tim McNamara <[email protected]> >> wrote: >> >> >> >> >> >> >> >> >> +1 >> Very interested in observing this >> project >> On 22/10/2013 12:08 AM, >> "Matthew Taylor" <[email protected]> >> wrote: >> >> >> >> >> >> >> >> >> >> Pedro et al, >> >> >> >> Unless you're actually going to be dealing with >> proprietary >> >> information, I encourage you to have your Kaggle discussions >> here on >> >> the mailing list. I'm sure there are others out there >> interested in >> >> what you're doing, but unable to take part (like me). >> >> >> >> Feel free to start a new thread, or use this one, but please >> use this >> >> list if you can. No one has complained about too many >> emails... yet >> >> ;). >> >> >> >> --------- >> >> Matt Taylor >> >> OS Community Flag-Bearer >> >> Numenta >> >> >> >> >> >> On Fri, Oct 18, 2013 at 11:39 AM, Pedro Tabacof <[email protected]> >> wrote: >> >> > Hello, >> >> > >> >> > Thank you all for the interest. I think now it's >> time to create a private >> >> > thread for the competition. The Solar Energy >> competition was the most voted >> >> > one so it can be our first Kaggle attempt. >> >> > >> >> > If anyone else is also interested in joining the >> competition, just send me a >> >> > message. >> >> > >> >> > Pedro. >> >> > >> >> > >> >> > On Fri, Oct 18, 2013 at 3:13 PM, Doug King <[email protected]> >> wrote: >> >> >> >> >> >> I'm interested in joining the team. I looked at >> the Kaggle contests and I >> >> >> agree that the best choice would be Solar Energy >> Prediction, but I wouldn't >> >> >> discount the Phone accelerometer contest. I think >> its a good fit for the CLA >> >> >> and and interesting application. The forum posts >> for that contest seem to >> >> >> indicate the sponsors are willing to judge a >> 'clean' (no leakage) version of >> >> >> the contest. >> >> >> >> >> >> >> >> >> On Thu, Oct 17, 2013 at 10:45 AM, Jay Gokhale >> <[email protected]> >> wrote: >> >> >>> >> >> >>> I'd love to use CLA's on a kaggle >> project! >> >> >>> >> >> >>> If Chetan's description is correct, it >> seems to me that the Solar Energy >> >> >>> Prediction contest would be the most straight >> forward one to work on. >> >> >>> >> >> >>> Jay Gokhale >> >> >>> >> >> >>> >> >> >>> On Wed, Oct 16, 2013 at 9:07 PM, Matthew Taylor >> <[email protected]> >> wrote: >> >> >>>> >> >> >>>> These also sound like good hackathon >> projects! >> >> >>>> >> >> >>>> Matt >> >> >>>> >> >> >>>> Sent from my MegaPhone >> >> >>>> >> >> >>>> On Oct 16, 2013, at 8:03 PM, Chetan Surpur >> <[email protected]> >> wrote: >> >> >>>> >> >> >>>> I'm interested in participating! >> >> >>>> >> >> >>>> I haven't decided on one yet, but I >> wanted to point out that the CLA >> >> >>>> looks ideally suited for the AMS 2013-2014 >> Solar Energy Prediction contest. >> >> >>>> From what I can tell, it's about >> predicting the daily energy output of solar >> >> >>>> farms from 12, 15, 18, 21, 24-hourly >> training data. There's 29 days to go. >> >> >>>> >> >> >>>> >> >> >>>> On Wed, Oct 16, 2013 at 7:31 PM, Pedro >> Tabacof <[email protected]> >> >> >>>> wrote: >> >> >>>>> >> >> >>>>> Hello, >> >> >>>>> >> >> >>>>> After the warm reception of the Kaggle >> competition idea, I decided to >> >> >>>>> create the thread asap. For those who >> are not aware, Kaggle is a data mining >> >> >>>>> competition website. It has been hyped >> on the machine learning media and has >> >> >>>>> some great people participating in it. >> It rewards with different prizes, but >> >> >>>>> my focus is not earn money, but rather >> to see how good NuPIC would fare >> >> >>>>> against the state of the art. I found >> those following competitions to be a >> >> >>>>> viable start: >> >> >>>>> >> >> >>>>> 1) Accelerometer Biometric Competition >> >> >>>>> From real acceleration data you have to >> guess from which cell phone >> >> >>>>> came the test samples. >> >> >>>>> 36 days to go >> >> >>>>> Pros: >> >> >>>>> -Multivariable, temporal data seems >> ideal for the CLA >> >> >>>>> Cons: >> >> >>>>> -The best competitors are using data >> leaks (meta-information), so the >> >> >>>>> chance of winning fairly is zero >> (though they will give a consolation prize >> >> >>>>> to the best "honest" entry) >> >> >>>>> Questions: >> >> >>>>> -How fast would NuPIC go through a 1GB >> CSV file? >> >> >>>>> -The test data is not open-ended, that >> is, you have to say whether a >> >> >>>>> recording belongs to a specific cell >> phone or not, with 50% chance of being >> >> >>>>> right. Would NuPIC do well in this >> case? >> >> >>>>> >> >> >>>>> 2) Multi-Label Bird Species >> Classification >> >> >>>>> Try to guess the bird species present >> on some audio recordings >> >> >>>>> 38 days to go >> >> >>>>> Pros: >> >> >>>>> -Temporal data >> >> >>>>> Cons: >> >> >>>>> -Multiple labels, so I think we would >> need 87 different models, which >> >> >>>>> is kinda impractical >> >> >>>>> Question: >> >> >>>>> -Is there a better way to do >> multi-label classification with NuPIC? >> >> >>>>> >> >> >>>>> 3) Conway's Reverse Game of Life >> >> >>>>> Reverse the game of life for 1 to 5 >> steps >> >> >>>>> 4 months to go >> >> >>>>> Pros: >> >> >>>>> -Natural input sparseness (have to >> check on it) >> >> >>>>> -Temporal data >> >> >>>>> -Multistep classification >> >> >>>>> -We can generate our own training data >> if necessary >> >> >>>>> Cons: >> >> >>>>> -Maybe there are very straightforward >> ways to solve this (rule based, >> >> >>>>> brute force, etc) >> >> >>>>> -Too much time before the competition >> ends >> >> >>>>> Questions: >> >> >>>>> -How sparse does the input need to be >> for the CLA to work well? >> >> >>>>> -Is it possible to use a 2D arrangement >> on the cells connection matrix? >> >> >>>>> >> >> >>>>> There is also the AMS 2013-2014 Solar >> Energy Prediction contest, but I >> >> >>>>> didn't have time to go over it. The >> other competitions don't seem to be >> >> >>>>> suited for NuPIC. >> >> >>>>> >> >> >>>>> Who here is interested on participating >> and what competition do you >> >> >>>>> think to be the most promising one? >> >> >>>>> >> >> >>>>> Pedro. >> >> >>>>> -- >> >> >>>>> Pedro Tabacof, >> >> >>>>> Unicamp - Eng. de Computação 08. >> >> >>>>> >> >> >>>>> >> _______________________________________________ >> >> >>>>> nupic mailing list >> >> >>>>> [email protected] >> >> >>>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >>>>> >> >> >>>> >> >> >>>> >> _______________________________________________ >> >> >>>> nupic mailing list >> >> >>>> [email protected] >> >> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >>>> >> >> >>>> >> >> >>>> >> _______________________________________________ >> >> >>>> nupic mailing list >> >> >>>> [email protected] >> >> >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >>>> >> >> >>> >> >> >>> >> >> >>> >> _______________________________________________ >> >> >>> nupic mailing list >> >> >>> [email protected] >> >> >>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >>> >> >> >> >> >> >> >> >> >> _______________________________________________ >> >> >> nupic mailing list >> >> >> [email protected] >> >> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> >> >> > >> >> > >> >> > >> >> > -- >> >> > Pedro Tabacof, >> >> > Unicamp - Eng. de Computação 08. >> >> > >> >> > _______________________________________________ >> >> > nupic mailing list >> >> > [email protected] >> >> > http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> > >> >> >> >> _______________________________________________ >> >> nupic mailing list >> >> [email protected] >> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> >> _______________________________________________ >> >> nupic mailing list >> >> [email protected] >> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> >> >> >> _______________________________________________ >> >> nupic mailing list >> >> [email protected] >> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> >> >> >> _______________________________________________ >> >> nupic mailing list >> >> [email protected] >> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> >> >> >> _______________________________________________ >> >> nupic mailing list >> >> [email protected] >> >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> >> >> >> -----Inline Attachment Follows----- >> >> _______________________________________________ >> nupic mailing list >> [email protected] >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> >> >> _______________________________________________ >> nupic mailing list >> [email protected] >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org >> > > > _______________________________________________ > nupic mailing list > [email protected] > http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org > >
_______________________________________________ nupic mailing list [email protected] http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
