[
https://issues.apache.org/jira/browse/MAHOUT-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13982431#comment-13982431
]
Suneel Marthi commented on MAHOUT-1252:
---------------------------------------
Drew, I am leaning towards moving existing seq2sparse to Spark and do away with
MR, given that all of the existing MR has already been moved under 'mrlegacy'.
Question for u - If we were to do seq2sparse all over again, would word2vec be
a good candidate to look at? I believe it was either u (or someone else from
Booz Allen) who had a prezo on word2vec at DC Big Data Conf in March.
> Add support for Finite State Transducers (FST) as a DictionaryType.
> -------------------------------------------------------------------
>
> Key: MAHOUT-1252
> URL: https://issues.apache.org/jira/browse/MAHOUT-1252
> Project: Mahout
> Issue Type: Improvement
> Components: Integration
> Affects Versions: 0.7
> Reporter: Suneel Marthi
> Assignee: Suneel Marthi
> Fix For: 1.0
>
>
> Add support for Finite State Transducers (FST) as a DictionaryType, this
> should result in an order of magnitude speedup of seq2sparse.
--
This message was sent by Atlassian JIRA
(v6.2#6252)