I'm sorry for disappearing, but I work full time and it has been hell those
past weeks (and I also moved to a new place). I don't know when I will be
available for this kind of endeavor again.

Pedro.


On Fri, Nov 15, 2013 at 5:16 PM, Matthew Taylor <[email protected]> wrote:

> I don't think it matters whether it's a Kaggle competition or another ML
> competition or public data set that is regularly used for this type of
> thing.
>
> I started this wiki with ML competitions, and I'm interested in adding to
> it. (Anyone else want to help out filling this list?)
>
> https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions
>
> So there are some people in the community interested in working on a
> competition with NuPIC, we just need a way to identify and organize them.
> Since Pedro started this initiative, maybe he'd still be interested in
> leading the charge?
>
> Pedro, what say you?
>
>
> ---------
> Matt Taylor
> OS Community Flag-Bearer
> Numenta
>
>
> On Thu, Nov 14, 2013 at 10:18 AM, Doug King <[email protected]> wrote:
>
>> Matt,
>>
>> I'm still interested in doing a Kaggle contest. I'm not sure why we did
>> not get the momentum we needed - i assumed the Hackathon took some focus
>> away, and it was ambitious to jump into a Kaggle contest half way to the
>> deadline.
>>
>> I thought the contest we chose, weather prediction, was an good fit for
>> the CLA - time based prediction, practical application, good metric for
>> comparison to other approaches.
>>
>> Anyway, I'm interested in selecting another Kaggle contest. I think it's
>> a great way to prove the value of the CLA approach and the dynamics of a
>> contest using the Kaggle framework provide an excellent platform for this.
>> If enough members want to do this its possible we could even assemble more
>> than one team from our forum.
>>
>> Thoughts?
>>
>> -Doug
>>
>>
>> On Wed, Nov 13, 2013 at 9:01 PM, Azat <[email protected]> wrote:
>>
>>> Is convolutional network is NuPIC's category then.
>>> How about preprocessing of images as linking the same digits via elastic
>>> temporal transformations - it must be perfect in HTM's terms ?
>>> I'm not sure it is allowed though..
>>>
>>> Azat
>>>
>>> --------------------------------------------
>>> On Wed, 11/13/13, Matthew Taylor <[email protected]> wrote:
>>>
>>>  Subject: Re: [nupic-dev] Kaggle team assemble
>>>  To: "NuPIC general mailing list." <[email protected]>
>>>  Date: Wednesday, November 13, 2013, 9:33 PM
>>>
>>>  That sounds good to me.
>>>  Numenta has always been criticized[1] for not providing
>>>  comparisons between NuPIC and other ML approaches by posting
>>>  the results for standard data sets and taking part in
>>>  competitions. It would be great if we could mobilize
>>>  motivated members of the community to tackle things like
>>>  this. I've created a wiki[2] with a list of common data
>>>  sets used for ML comparisons, as well as competitions.
>>>
>>>
>>>  If anyone is willing to work with one of these
>>>  data sets, I would be happy to publicize your work as well
>>>  as I can. The more we can interact with the existing ML
>>>  community, the more participation we'll get from them.
>>>
>>>
>>>
>>>  [1]
>>> http://www.reddit.com/r/MachineLearning/comments/1qdu8i/cla_quiz_office_hour/[2]
>>>  https://github.com/numenta/nupic/wiki/Machine-Learning-Competitions
>>>
>>>
>>>
>>>  ---------Matt
>>>  TaylorOS Community
>>>  Flag-BearerNumenta
>>>
>>>
>>>  On Wed, Nov 13, 2013 at
>>>  7:11 PM, Chetan Surpur <[email protected]>
>>>  wrote:
>>>
>>>
>>>  Matt,
>>>  I didn't actually see any communication about
>>>  it since the last email, so I'm not sure what happened.
>>>  I ended up focusing my time on preparing the Pilot framework
>>>  for the hackathon. Either way, there's not much time
>>>  left in the competition now. Maybe we can shoot for another
>>>  one?
>>>
>>>
>>>
>>>
>>>  - Chetan
>>>
>>>  On Wed, Nov 13, 2013 at
>>>  7:14 AM, Matthew Taylor <[email protected]>
>>>  wrote:
>>>
>>>
>>>
>>>
>>>  So are you guys making any progress?
>>>
>>>
>>>  ---------Matt TaylorOS
>>>  Community Flag-BearerNumenta
>>>
>>>
>>>  On Tue,
>>>  Oct 22, 2013 at 8:23 PM, Chetan Surpur <[email protected]>
>>>  wrote:
>>>
>>>
>>>
>>>
>>>  Kaggle team,
>>>  Hope everyone's okay with communicating
>>>  general stuff here, as per request.
>>>  I was thinking that we could start simply, by
>>>  just feeding in the stream of solar energy readings to the
>>>  CLA sequentially. This will allow us to see how well it does
>>>  at predicting solar energy from just the temporal sequence,
>>>  and the time of day, day of week, etc.
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>  In parallel, someone else can learn to extract
>>>  what we need from the GEFS weather data, correlating the
>>>  weather variables with each solar energy reading time. We
>>>  can build a CSV file of all the variables together, and run
>>>  a swarm.
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>  How does this approach sound?
>>>  I can volunteer to set up the project and start
>>>  feeding the solar energy with temporal information to the
>>>  CLA. Whoever wants to help is welcome to. Anyone up for
>>>  massaging the GEFS data as described above?
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>  Thanks,Chetan
>>>  P.S. We still need a team
>>>  name!
>>>
>>>  On Mon, Oct 21, 2013 at
>>>  9:31 PM, Tim McNamara <[email protected]>
>>>  wrote:
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>  +1
>>>  Very interested in observing this
>>>  project
>>>  On 22/10/2013 12:08 AM,
>>>  "Matthew Taylor" <[email protected]>
>>>  wrote:
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>  Pedro et al,
>>>
>>>
>>>
>>>  Unless you're actually going to be dealing with
>>>  proprietary
>>>
>>>  information, I encourage you to have your Kaggle discussions
>>>  here on
>>>
>>>  the mailing list. I'm sure there are others out there
>>>  interested in
>>>
>>>  what you're doing, but unable to take part (like me).
>>>
>>>
>>>
>>>  Feel free to start a new thread, or use this one, but please
>>>  use this
>>>
>>>  list if you can. No one has complained about too many
>>>  emails... yet
>>>
>>>  ;).
>>>
>>>
>>>
>>>  ---------
>>>
>>>  Matt Taylor
>>>
>>>  OS Community Flag-Bearer
>>>
>>>  Numenta
>>>
>>>
>>>
>>>
>>>
>>>  On Fri, Oct 18, 2013 at 11:39 AM, Pedro Tabacof <[email protected]>
>>>  wrote:
>>>
>>>  > Hello,
>>>
>>>  >
>>>
>>>  > Thank you all for the interest. I think now it's
>>>  time to create a private
>>>
>>>  > thread for the competition. The Solar Energy
>>>  competition was the most voted
>>>
>>>  > one so it can be our first Kaggle attempt.
>>>
>>>  >
>>>
>>>  > If anyone else is also interested in joining the
>>>  competition, just send me a
>>>
>>>  > message.
>>>
>>>  >
>>>
>>>  > Pedro.
>>>
>>>  >
>>>
>>>  >
>>>
>>>  > On Fri, Oct 18, 2013 at 3:13 PM, Doug King <[email protected]>
>>>  wrote:
>>>
>>>  >>
>>>
>>>  >> I'm interested in joining the team. I looked at
>>>  the Kaggle contests and I
>>>
>>>  >> agree that the best choice would be Solar Energy
>>>  Prediction, but I wouldn't
>>>
>>>  >> discount the Phone accelerometer contest. I think
>>>  its a good fit for the CLA
>>>
>>>  >> and and interesting application. The forum posts
>>>  for that contest seem to
>>>
>>>  >> indicate the sponsors are willing to judge a
>>>  'clean' (no leakage) version of
>>>
>>>  >> the contest.
>>>
>>>  >>
>>>
>>>  >>
>>>
>>>  >> On Thu, Oct 17, 2013 at 10:45 AM, Jay Gokhale
>>>  <[email protected]>
>>>  wrote:
>>>
>>>  >>>
>>>
>>>  >>> I'd love to use CLA's on a kaggle
>>>  project!
>>>
>>>  >>>
>>>
>>>  >>> If Chetan's description is correct, it
>>>  seems to me that the Solar Energy
>>>
>>>  >>> Prediction contest would be the most straight
>>>  forward one to work on.
>>>
>>>  >>>
>>>
>>>  >>> Jay Gokhale
>>>
>>>  >>>
>>>
>>>  >>>
>>>
>>>  >>> On Wed, Oct 16, 2013 at 9:07 PM, Matthew Taylor
>>>  <[email protected]>
>>>  wrote:
>>>
>>>  >>>>
>>>
>>>  >>>> These also sound like good hackathon
>>>  projects!
>>>
>>>  >>>>
>>>
>>>  >>>> Matt
>>>
>>>  >>>>
>>>
>>>  >>>> Sent from my MegaPhone
>>>
>>>  >>>>
>>>
>>>  >>>> On Oct 16, 2013, at 8:03 PM, Chetan Surpur
>>>  <[email protected]>
>>>  wrote:
>>>
>>>  >>>>
>>>
>>>  >>>> I'm interested in participating!
>>>
>>>  >>>>
>>>
>>>  >>>> I haven't decided on one yet, but I
>>>  wanted to point out that the CLA
>>>
>>>  >>>> looks ideally suited for the AMS 2013-2014
>>>  Solar Energy Prediction contest.
>>>
>>>  >>>> From what I can tell, it's about
>>>  predicting the daily energy output of solar
>>>
>>>  >>>> farms from 12, 15, 18, 21, 24-hourly
>>>  training data. There's 29 days to go.
>>>
>>>  >>>>
>>>
>>>  >>>>
>>>
>>>  >>>> On Wed, Oct 16, 2013 at 7:31 PM, Pedro
>>>  Tabacof <[email protected]>
>>>
>>>  >>>> wrote:
>>>
>>>  >>>>>
>>>
>>>  >>>>> Hello,
>>>
>>>  >>>>>
>>>
>>>  >>>>> After the warm reception of the Kaggle
>>>  competition idea, I decided to
>>>
>>>  >>>>> create the thread asap. For those who
>>>  are not aware, Kaggle is a data mining
>>>
>>>  >>>>> competition website. It has been hyped
>>>  on the machine learning media and has
>>>
>>>  >>>>> some great people participating in it.
>>>  It rewards with different prizes, but
>>>
>>>  >>>>> my focus is not earn money, but rather
>>>  to see how good NuPIC would fare
>>>
>>>  >>>>> against the state of the art. I found
>>>  those following competitions to be a
>>>
>>>  >>>>> viable start:
>>>
>>>  >>>>>
>>>
>>>  >>>>> 1) Accelerometer Biometric Competition
>>>
>>>  >>>>> From real acceleration data you have to
>>>  guess from which cell phone
>>>
>>>  >>>>> came the test samples.
>>>
>>>  >>>>> 36 days to go
>>>
>>>  >>>>> Pros:
>>>
>>>  >>>>> -Multivariable, temporal data seems
>>>  ideal for the CLA
>>>
>>>  >>>>> Cons:
>>>
>>>  >>>>> -The best competitors are using data
>>>  leaks (meta-information), so the
>>>
>>>  >>>>> chance of winning fairly is zero
>>>  (though they will give a consolation prize
>>>
>>>  >>>>> to the best "honest" entry)
>>>
>>>  >>>>> Questions:
>>>
>>>  >>>>> -How fast would NuPIC go through a 1GB
>>>  CSV file?
>>>
>>>  >>>>> -The test data is not open-ended, that
>>>  is, you have to say whether a
>>>
>>>  >>>>> recording belongs to a specific cell
>>>  phone or not, with 50% chance of being
>>>
>>>  >>>>> right. Would NuPIC do well in this
>>>  case?
>>>
>>>  >>>>>
>>>
>>>  >>>>> 2) Multi-Label Bird Species
>>>  Classification
>>>
>>>  >>>>> Try to guess the bird species present
>>>  on some audio recordings
>>>
>>>  >>>>> 38 days to go
>>>
>>>  >>>>> Pros:
>>>
>>>  >>>>> -Temporal data
>>>
>>>  >>>>> Cons:
>>>
>>>  >>>>> -Multiple labels, so I think we would
>>>  need 87 different models, which
>>>
>>>  >>>>> is kinda impractical
>>>
>>>  >>>>> Question:
>>>
>>>  >>>>> -Is there a better way to do
>>>  multi-label classification with NuPIC?
>>>
>>>  >>>>>
>>>
>>>  >>>>> 3) Conway's Reverse Game of Life
>>>
>>>  >>>>> Reverse the game of life for 1 to 5
>>>  steps
>>>
>>>  >>>>> 4 months to go
>>>
>>>  >>>>> Pros:
>>>
>>>  >>>>> -Natural input sparseness (have to
>>>  check on it)
>>>
>>>  >>>>> -Temporal data
>>>
>>>  >>>>> -Multistep classification
>>>
>>>  >>>>> -We can generate our own training data
>>>  if necessary
>>>
>>>  >>>>> Cons:
>>>
>>>  >>>>> -Maybe there are very straightforward
>>>  ways to solve this (rule based,
>>>
>>>  >>>>> brute force, etc)
>>>
>>>  >>>>> -Too much time before the competition
>>>  ends
>>>
>>>  >>>>> Questions:
>>>
>>>  >>>>> -How sparse does the input need to be
>>>  for the CLA to work well?
>>>
>>>  >>>>> -Is it possible to use a 2D arrangement
>>>  on the cells connection matrix?
>>>
>>>  >>>>>
>>>
>>>  >>>>> There is also the AMS 2013-2014 Solar
>>>  Energy Prediction contest, but I
>>>
>>>  >>>>> didn't have time to go over it. The
>>>  other competitions don't seem to be
>>>
>>>  >>>>> suited for NuPIC.
>>>
>>>  >>>>>
>>>
>>>  >>>>> Who here is interested on participating
>>>  and what competition do you
>>>
>>>  >>>>> think to be the most promising one?
>>>
>>>  >>>>>
>>>
>>>  >>>>> Pedro.
>>>
>>>  >>>>> --
>>>
>>>  >>>>> Pedro Tabacof,
>>>
>>>  >>>>> Unicamp - Eng. de Computação 08.
>>>
>>>  >>>>>
>>>
>>>  >>>>>
>>>  _______________________________________________
>>>
>>>  >>>>> nupic mailing list
>>>
>>>  >>>>> [email protected]
>>>
>>>  >>>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>  >>>>>
>>>
>>>  >>>>
>>>
>>>  >>>>
>>>  _______________________________________________
>>>
>>>  >>>> nupic mailing list
>>>
>>>  >>>> [email protected]
>>>
>>>  >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>  >>>>
>>>
>>>  >>>>
>>>
>>>  >>>>
>>>  _______________________________________________
>>>
>>>  >>>> nupic mailing list
>>>
>>>  >>>> [email protected]
>>>
>>>  >>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>  >>>>
>>>
>>>  >>>
>>>
>>>  >>>
>>>
>>>  >>>
>>>  _______________________________________________
>>>
>>>  >>> nupic mailing list
>>>
>>>  >>> [email protected]
>>>
>>>  >>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>  >>>
>>>
>>>  >>
>>>
>>>  >>
>>>
>>>  >> _______________________________________________
>>>
>>>  >> nupic mailing list
>>>
>>>  >> [email protected]
>>>
>>>  >> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>  >>
>>>
>>>  >
>>>
>>>  >
>>>
>>>  >
>>>
>>>  > --
>>>
>>>  > Pedro Tabacof,
>>>
>>>  > Unicamp - Eng. de Computação 08.
>>>
>>>  >
>>>
>>>  > _______________________________________________
>>>
>>>  > nupic mailing list
>>>
>>>  > [email protected]
>>>
>>>  > http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>  >
>>>
>>>
>>>
>>>  _______________________________________________
>>>
>>>  nupic mailing list
>>>
>>>  [email protected]
>>>
>>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>
>>>
>>>  _______________________________________________
>>>
>>>  nupic mailing list
>>>
>>>  [email protected]
>>>
>>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>
>>>
>>>
>>>
>>>  _______________________________________________
>>>
>>>  nupic mailing list
>>>
>>>  [email protected]
>>>
>>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>
>>>
>>>
>>>
>>>  _______________________________________________
>>>
>>>  nupic mailing list
>>>
>>>  [email protected]
>>>
>>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>
>>>
>>>
>>>
>>>  _______________________________________________
>>>
>>>  nupic mailing list
>>>
>>>  [email protected]
>>>
>>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>
>>>
>>>
>>>
>>>  -----Inline Attachment Follows-----
>>>
>>>  _______________________________________________
>>>  nupic mailing list
>>>  [email protected]
>>>  http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>>
>>> _______________________________________________
>>> nupic mailing list
>>> [email protected]
>>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>>
>>
>>
>> _______________________________________________
>> nupic mailing list
>> [email protected]
>> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>>
>>
>
> _______________________________________________
> nupic mailing list
> [email protected]
> http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org
>
>


-- 
Pedro Tabacof,
Unicamp - Eng. de Computação 08.
_______________________________________________
nupic mailing list
[email protected]
http://lists.numenta.org/mailman/listinfo/nupic_lists.numenta.org

Reply via email to