I don't think it matters whether it's a Kaggle competition or another ML
competition or public data set that is regularly used for this type of
thing.

I started this wiki with ML competitions, and I'm interested in adding to
it. (Anyone else want to help out filling this list?)

https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions

So there are some people in the community interested in working on a
competition with NuPIC, we just need a way to identify and organize them.
Since Pedro started this initiative, maybe he'd still be interested in
leading the charge?

Pedro, what say you?


---------
Matt Taylor
OS Community Flag-Bearer
Numenta


On Thu, Nov 14, 2013 at 10:18 AM, Doug King <[email protected]> wrote:

> Matt,
>
> I'm still interested in doing a Kaggle contest. I'm not sure why we did
> not get the momentum we needed - i assumed the Hackathon took some focus
> away, and it was ambitious to jump into a Kaggle contest half way to the
> deadline.
>
> I thought the contest we chose, weather prediction, was an good fit for
> the CLA - time based prediction, practical application, good metric for
> comparison to other approaches.
>
> Anyway, I'm interested in selecting another Kaggle contest. I think it's a
> great way to prove the value of the CLA approach and the dynamics of a
> contest using the Kaggle framework provide an excellent platform for this.
> If enough members want to do this its possible we could even assemble more
> than one team from our forum.
>
> Thoughts?
>
> -Doug
>
>
> On Wed, Nov 13, 2013 at 9:01 PM, Azat <[email protected]> wrote:
>
>> Is convolutional network is NuPIC's category then.
>> How about preprocessing of images as linking the same digits via elastic
>> temporal transformations - it must be perfect in HTM's terms ?
>> I'm not sure it is allowed though..
>>
>> Azat
>>
>> --------------------------------------------
>> On Wed, 11/13/13, Matthew Taylor <[email protected]> wrote:
>>
>>  Subject: Re: [nupic-dev] Kaggle team assemble
>>  To: "NuPIC general mailing list." <[email protected]>
>>  Date: Wednesday, November 13, 2013, 9:33 PM
>>
>>  That sounds good to me.
>>  Numenta has always been criticized[1] for not providing
>>  comparisons between NuPIC and other ML approaches by posting
>>  the results for standard data sets and taking part in
>>  competitions. It would be great if we could mobilize
>>  motivated members of the community to tackle things like
>>  this. I've created a wiki[2] with a list of common data
>>  sets used for ML comparisons, as well as competitions.
>>
>>
>>  If anyone is willing to work with one of these
>>  data sets, I would be happy to publicize your work as well
>>  as I can. The more we can interact with the existing ML
>>  community, the more participation we'll get from them.
>>
>>
>>
>>  [1]
>> http://www.reddit.com/r/MachineLearning/comments/1qdu8i/cla_quiz_office_hour/[2]
>>  https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions
>>
>>
>>
>>  ---------Matt
>>  TaylorOS Community
>>  Flag-BearerNumenta
>>
>>
>>  On Wed, Nov 13, 2013 at
>>  7:11 PM, Chetan Surpur <[email protected]>
>>  wrote:
>>
>>
>>  Matt,
>>  I didn't actually see any communication about
>>  it since the last email, so I'm not sure what happened.
>>  I ended up focusing my time on preparing the Pilot framework
>>  for the hackathon. Either way, there's not much time
>>  left in the competition now. Maybe we can shoot for another
>>  one?
>>
>>
>>
>>
>>  - Chetan
>>
>>  On Wed, Nov 13, 2013 at
>>  7:14 AM, Matthew Taylor <[email protected]>
>>  wrote:
>>
>>
>>
>>
>>  So are you guys making any progress?
>>
>>
>>  ---------Matt TaylorOS
>>  Community Flag-BearerNumenta
>>
>>
>>  On Tue,
>>  Oct 22, 2013 at 8:23 PM, Chetan Surpur <[email protected]>
>>  wrote:
>>
>>
>>
>>
>>  Kaggle team,
>>  Hope everyone's okay with communicating
>>  general stuff here, as per request.
>>  I was thinking that we could start simply, by
>>  just feeding in the stream of solar energy readings to the
>>  CLA sequentially. This will allow us to see how well it does
>>  at predicting solar energy from just the temporal sequence,
>>  and the time of day, day of week, etc.
>>
>>
>>
>>
>>
>>
>>
>>
>>  In parallel, someone else can learn to extract
>>  what we need from the GEFS weather data, correlating the
>>  weather variables with each solar energy reading time. We
>>  can build a CSV file of all the variables together, and run
>>  a swarm.
>>
>>
>>
>>
>>
>>
>>
>>
>>  How does this approach sound?
>>  I can volunteer to set up the project and start
>>  feeding the solar energy with temporal information to the
>>  CLA. Whoever wants to help is welcome to. Anyone up for
>>  massaging the GEFS data as described above?
>>
>>
>>
>>
>>
>>
>>
>>
>>  Thanks,Chetan
>>  P.S. We still need a team
>>  name!
>>
>>  On Mon, Oct 21, 2013 at
>>  9:31 PM, Tim McNamara <[email protected]>
>>  wrote:
>>
>>
>>
>>
>>
>>
>>
>>
>>  +1
>>  Very interested in observing this
>>  project
>>  On 22/10/2013 12:08 AM,
>>  "Matthew Taylor" <[email protected]>
>>  wrote:
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>  Pedro et al,
>>
>>
>>
>>  Unless you're actually going to be dealing with
>>  proprietary
>>
>>  information, I encourage you to have your Kaggle discussions
>>  here on
>>
>>  the mailing list. I'm sure there are others out there
>>  interested in
>>
>>  what you're doing, but unable to take part (like me).
>>
>>
>>
>>  Feel free to start a new thread, or use this one, but please
>>  use this
>>
>>  list if you can. No one has complained about too many
>>  emails... yet
>>
>>  ;).
>>
>>
>>
>>  ---------
>>
>>  Matt Taylor
>>
>>  OS Community Flag-Bearer
>>
>>  Numenta
>>
>>
>>
>>
>>
>>  On Fri, Oct 18, 2013 at 11:39 AM, Pedro Tabacof <[email protected]>
>>  wrote:
>>
>>  > Hello,
>>
>>  >
>>
>>  > Thank you all for the interest. I think now it's
>>  time to create a private
>>
>>  > thread for the competition. The Solar Energy
>>  competition was the most voted
>>
>>  > one so it can be our first Kaggle attempt.
>>
>>  >
>>
>>  > If anyone else is also interested in joining the
>>  competition, just send me a
>>
>>  > message.
>>
>>  >
>>
>>  > Pedro.
>>
>>  >
>>
>>  >
>>
>>  > On Fri, Oct 18, 2013 at 3:13 PM, Doug King <[email protected]>
>>  wrote:
>>
>>  >>
>>
>>  >> I'm interested in joining the team. I looked at
>>  the Kaggle contests and I
>>
>>  >> agree that the best choice would be Solar Energy
>>  Prediction, but I wouldn't
>>
>>  >> discount the Phone accelerometer contest. I think
>>  its a good fit for the CLA
>>
>>  >> and and interesting application. The forum posts
>>  for that contest seem to
>>
>>  >> indicate the sponsors are willing to judge a
>>  'clean' (no leakage) version of
>>
>>  >> the contest.
>>
>>  >>
>>
>>  >>
>>
>>  >> On Thu, Oct 17, 2013 at 10:45 AM, Jay Gokhale
>>  <[email protected]>
>>  wrote:
>>
>>  >>>
>>
>>  >>> I'd love to use CLA's on a kaggle
>>  project!
>>
>>  >>>
>>
>>  >>> If Chetan's description is correct, it
>>  seems to me that the Solar Energy
>>
>>  >>> Prediction contest would be the most straight
>>  forward one to work on.
>>
>>  >>>
>>
>>  >>> Jay Gokhale
>>
>>  >>>
>>
>>  >>>
>>
>>  >>> On Wed, Oct 16, 2013 at 9:07 PM, Matthew Taylor
>>  <[email protected]>
>>  wrote:
>>
>>  >>>>
>>
>>  >>>> These also sound like good hackathon
>>  projects!
>>
>>  >>>>
>>
>>  >>>> Matt
>>
>>  >>>>
>>
>>  >>>> Sent from my MegaPhone
>>
>>  >>>>
>>
>>  >>>> On Oct 16, 2013, at 8:03 PM, Chetan Surpur
>>  <[email protected]>
>>  wrote:
>>
>>  >>>>
>>
>>  >>>> I'm interested in participating!
>>
>>  >>>>
>>
>>  >>>> I haven't decided on one yet, but I
>>  wanted to point out that the CLA
>>
>>  >>>> looks ideally suited for the AMS 2013-2014
>>  Solar Energy Prediction contest.
>>
>>  >>>> From what I can tell, it's about
>>  predicting the daily energy output of solar
>>
>>  >>>> farms from 12, 15, 18, 21, 24-hourly
>>  training data. There's 29 days to go.
>>
>>  >>>>
>>
>>  >>>>
>>
>>  >>>> On Wed, Oct 16, 2013 at 7:31 PM, Pedro
>>  Tabacof <[email protected]>
>>
>>  >>>> wrote:
>>
>>  >>>>>
>>
>>  >>>>> Hello,
>>
>>  >>>>>
>>
>>  >>>>> After the warm reception of the Kaggle
>>  competition idea, I decided to
>>
>>  >>>>> create the thread asap. For those who
>>  are not aware, Kaggle is a data mining
>>
>>  >>>>> competition website. It has been hyped
>>  on the machine learning media and has
>>
>>  >>>>> some great people participating in it.
>>  It rewards with different prizes, but
>>
>>  >>>>> my focus is not earn money, but rather
>>  to see how good NuPIC would fare
>>
>>  >>>>> against the state of the art. I found
>>  those following competitions to be a
>>
>>  >>>>> viable start:
>>
>>  >>>>>
>>
>>  >>>>> 1) Accelerometer Biometric Competition
>>
>>  >>>>> From real acceleration data you have to
>>  guess from which cell phone
>>
>>  >>>>> came the test samples.
>>
>>  >>>>> 36 days to go
>>
>>  >>>>> Pros:
>>
>>  >>>>> -Multivariable, temporal data seems
>>  ideal for the CLA
>>
>>  >>>>> Cons:
>>
>>  >>>>> -The best competitors are using data
>>  leaks (meta-information), so the
>>
>>  >>>>> chance of winning fairly is zero
>>  (though they will give a consolation prize
>>
>>  >>>>> to the best "honest" entry)
>>
>>  >>>>> Questions:
>>
>>  >>>>> -How fast would NuPIC go through a 1GB
>>  CSV file?
>>
>>  >>>>> -The test data is not open-ended, that
>>  is, you have to say whether a
>>
>>  >>>>> recording belongs to a specific cell
>>  phone or not, with 50% chance of being
>>
>>  >>>>> right. Would NuPIC do well in this
>>  case?
>>
>>  >>>>>
>>
>>  >>>>> 2) Multi-Label Bird Species
>>  Classification
>>
>>  >>>>> Try to guess the bird species present
>>  on some audio recordings
>>
>>  >>>>> 38 days to go
>>
>>  >>>>> Pros:
>>
>>  >>>>> -Temporal data
>>
>>  >>>>> Cons:
>>
>>  >>>>> -Multiple labels, so I think we would
>>  need 87 different models, which
>>
>>  >>>>> is kinda impractical
>>
>>  >>>>> Question:
>>
>>  >>>>> -Is there a better way to do
>>  multi-label classification with NuPIC?
>>
>>  >>>>>
>>
>>  >>>>> 3) Conway's Reverse Game of Life
>>
>>  >>>>> Reverse the game of life for 1 to 5
>>  steps
>>
>>  >>>>> 4 months to go
>>
>>  >>>>> Pros:
>>
>>  >>>>> -Natural input sparseness (have to
>>  check on it)
>>
>>  >>>>> -Temporal data
>>
>>  >>>>> -Multistep classification
>>
>>  >>>>> -We can generate our own training data
>>  if necessary
>>
>>  >>>>> Cons:
>>
>>  >>>>> -Maybe there are very straightforward
>>  ways to solve this (rule based,
>>
>>  >>>>> brute force, etc)
>>
>>  >>>>> -Too much time before the competition
>>  ends
>>
>>  >>>>> Questions:
>>
>>  >>>>> -How sparse does the input need to be
>>  for the CLA to work well?
>>
>>  >>>>> -Is it possible to use a 2D arrangement
>>  on the cells connection matrix?
>>
>>  >>>>>
>>
>>  >>>>> There is also the AMS 2013-2014 Solar
>>  Energy Prediction contest, but I
>>
>>  >>>>> didn't have time to go over it. The
>>  other competitions don't seem to be
>>
>>  >>>>> suited for NuPIC.
>>
>>  >>>>>
>>
>>  >>>>> Who here is interested on participating
>>  and what competition do you
>>
>>  >>>>> think to be the most promising one?
>>
>>  >>>>>
>>
>>  >>>>> Pedro.
>>
>>  >>>>> --
>>
>>  >>>>> Pedro Tabacof,
>>
>>  >>>>> Unicamp - Eng. de Computação 08.
>>
>>  >>>>>
>>
>>  >>>>>
>>  _______________________________________________
>>
>>  >>>>> nupic mailing list
>>
>>  >>>>> [email protected]
>>
>>  >>>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>  >>>>>
>>
>>  >>>>
>>
>>  >>>>
>>  _______________________________________________
>>
>>  >>>> nupic mailing list
>>
>>  >>>> [email protected]
>>
>>  >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>  >>>>
>>
>>  >>>>
>>
>>  >>>>
>>  _______________________________________________
>>
>>  >>>> nupic mailing list
>>
>>  >>>> [email protected]
>>
>>  >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>  >>>>
>>
>>  >>>
>>
>>  >>>
>>
>>  >>>
>>  _______________________________________________
>>
>>  >>> nupic mailing list
>>
>>  >>> [email protected]
>>
>>  >>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>  >>>
>>
>>  >>
>>
>>  >>
>>
>>  >> _______________________________________________
>>
>>  >> nupic mailing list
>>
>>  >> [email protected]
>>
>>  >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>  >>
>>
>>  >
>>
>>  >
>>
>>  >
>>
>>  > --
>>
>>  > Pedro Tabacof,
>>
>>  > Unicamp - Eng. de Computação 08.
>>
>>  >
>>
>>  > _______________________________________________
>>
>>  > nupic mailing list
>>
>>  > [email protected]
>>
>>  > http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>  >
>>
>>
>>
>>  _______________________________________________
>>
>>  nupic mailing list
>>
>>  [email protected]
>>
>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>>
>>  _______________________________________________
>>
>>  nupic mailing list
>>
>>  [email protected]
>>
>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>>
>>
>>
>>  _______________________________________________
>>
>>  nupic mailing list
>>
>>  [email protected]
>>
>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>>
>>
>>
>>  _______________________________________________
>>
>>  nupic mailing list
>>
>>  [email protected]
>>
>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>>
>>
>>
>>  _______________________________________________
>>
>>  nupic mailing list
>>
>>  [email protected]
>>
>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>>
>>
>>
>>  -----Inline Attachment Follows-----
>>
>>  _______________________________________________
>>  nupic mailing list
>>  [email protected]
>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>> _______________________________________________
>> nupic mailing list
>> [email protected]
>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>
>
> _______________________________________________
> nupic mailing list
> [email protected]
> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>
>
_______________________________________________
nupic mailing list
[email protected]
http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org

Reply via email to