Re: [Scikit-learn-general] HMM: Determination of the state numbers

Didier Vila Thu, 18 Oct 2012 01:03:01 -0700

Mathieu,


Thanks for your email. 

 

Problem 1:

 

*        I will have a look to the grid search model.

*        If you have any example of testing hmm using the grid search
module, it will be brilliant. (basic code are more than welcome !!!)

 

Problem 2 ( classification)

 

It's not clear for me (I will work first on the problem 1).

 

Regards and merci encore.

 

Didier

 

 

Didier Vila, PhD | Risk | CapQuest Group Ltd | Fleet 27 | Rye Close |
Fleet | Hampshire | GU51 2QQ | Fax: 0871 574 2992 | Email:
dv...@capquestco.com <mailto:mbruna...@capquestco.com>  

 

From: mblon...@gmail.com [mailto:mblon...@gmail.com] On Behalf Of
Mathieu Blondel
Sent: 18 October 2012 06:00
To: Didier Vila
Cc: scikit-learn-general@lists.sourceforge.net
Subject: Re: [Scikit-learn-general] HMM: Determination of the state
numbers

 

 

On Thu, Oct 18, 2012 at 1:37 AM, Didier Vila <dv...@capquestco.com>
wrote:

Problem 1: My problem is a risk problem at the moment.

 

*        I want to represent the behaviour of my 20 000 time series and
generates some monte carlo simulations  using the method.sample()

*        Intuitively, i have to choose between 1 or 3 ( max 4) the
number of states.

*        Then,  I want to capture the risk for each time series and the
risk at the aggregate level.( I will generate 100 Monte Carlo
Simulations)

Alright, then optimizing "score" may make sense it that case.

 

        *        Question 1: I don t see how I can train and cross
validate my HMM  in Scikit Learn ( First time I use Scikit Learn for
this purpose)

        *        Question 2: the lenght of the time serie is 32 periods,
is it enough to  make cross validation and validation ?

 

Have a look at http://scikit-learn.org/0.12/modules/grid_search.html. 

         

        Problem 2: Classification

        *        In the near future, I will try to make some
classification of time series but I have no ideas how to handle the
problem ? Should I use an SVM ? Can you refer any paper ?

 

You can group your time series per class and train one HMM per class
with those time series. Then given a new time series, you can decide its
class by the argmax of the Bayes rule:

Class = argmax P(Class | Time Series) = argmax P(Time Series | Class) *
P(Class) / P(Time Series) = argmax P(Time Series | Class) * P(Class)

 

P(Time Series | Class) can be computed by the forward algorithm or can
be approximated by the Viterbi algorithm (which is more numerically
stable).

P(Class) can be computed by counting the number of time series in each
class.

 

        Generic Questions: I was wondering if your algorithm is
developed iin Python ? Do you think your algo is relevant to apply to my
business problem ?

 

My method is useful for classifying time series which are made of
smaller parts whose label you don't know. So I don't think it would work
for you. 

 

Mathieu

 

This e-mail is intended solely for the addressee, is strictly confidential and 
may also be legally privileged. If you are not the addressee please do not 
read, print, re-transmit, store or act in reliance on it or any attachments. 
Instead, please email it back to the sender and then immediately permanently 
delete it. E-mail communications cannot be guaranteed to be secure or error 
free, as information could be intercepted, corrupted, amended, lost, destroyed, 
arrive late or incomplete, or contain viruses. We do not accept liability for 
any such matters or their consequences. Anyone who communicates with us by 
e-mail is taken to accept the risks in doing so. Opinions, conclusions and 
other information in this e-mail and any attachments are solely those of the 
author and do not represent those of CapQuest Group Limited or any of its 
subsidiaries unless otherwise stated. CapQuest Group Limited (registered number 
4936030), CapQuest Debt Recovery Limited (registered number 3772278), CapQuest 
Investments Limited (registered number 5245825), CapQuest Asset Management 
Limited (registered number 5245829) and CapQuest Mortgage Servicing Limited 
(registered number 05821008) are all limited companies registered in England 
and Wales with their registered offices at Fleet 27, Rye Close, Fleet, 
Hampshire, GU51 2QQ. Each company is a separate and independent legal entity. 
None of the companies have any liability for each other's acts or omissions. 
This communication is from the company named in the sender's details above.

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct

_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] HMM: Determination of the state numbers

Reply via email to