There has been quite some interest in this in the last couple of months, so I'm 
sure it will get some momentum. The question is whether Jake and Olivier's 
points about the inappropriateness of the data structures can actually get a 
workaround or if this is (more or less) pointless.

If crfsuite cannot be cast into a scikit-learn style API, then I think 
pyCRFsuite should remain as an external package, possibly with convenience 
functions for converting between numpy arrays and crfsuite files.

I feel it would be ugly to do such conversions behind the scenes just for the 
sake of including CRFs in scikit-learn.

However, if you conceive of a possible solution, that changes the whole problem.

Just my 2 bani (Romanian fractionary currency)

On May 2, 2012, at 17:07 , Jacob VanderPlas wrote:

> Hi Rob,
> The crfsuite wrapper is still far from complete (a couple test cases 
> even seg-fault).  I also echo Olivier's input on the problem that numpy 
> arrays/scipy sparse matrices cannot be mapped to the crfsuite internal 
> data structure.  For that reason, it would be very hard to use crfsuite 
> in scikit-learn for the purpose it's designed: efficient computations 
> for large datasets.
> 
> That being said, if you have the time to work on it, that would be 
> great!  I currently don't have the extra bandwidth.  Thanks!
>   Jake
> 
> Rob Zinkov wrote:
>> Hello,
>> 
>> I would like to merge Jake Vanderplas's work
>> https://github.com/jakevdp/pyCRFsuite into scikit-learn. I have gotten
>> tacit approval in the irc channel.
>> 
>> I am curious since this code requires crfsuite. Should that also be
>> bundled, or should I just throw an exception if it can't be found?
>> 
>> Cheers,
>> Rob Zinkov
>> 
>> ------------------------------------------------------------------------------
>> Live Security Virtual Conference
>> Exclusive live event will cover all the ways today's security and 
>> threat landscape has changed and how IT managers can respond. Discussions 
>> will include endpoint security, mobile security and the latest in malware 
>> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>> _______________________________________________
>> Scikit-learn-general mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>> 
> 
> ------------------------------------------------------------------------------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and 
> threat landscape has changed and how IT managers can respond. Discussions 
> will include endpoint security, mobile security and the latest in malware 
> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> Scikit-learn-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------
Vlad N.
http://vene.ro





------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to