[
https://issues.apache.org/jira/browse/MAHOUT-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13721528#comment-13721528
]
Suneel Marthi commented on MAHOUT-1287:
---------------------------------------
[~alex.franchuk] I committed this patch to trunk, I had to add Solr 4.3
solr-common-csv.jar to core/pom.xml.
> classifier.sgd.CsvRecordFactory incorrectly parses CSV format
> -------------------------------------------------------------
>
> Key: MAHOUT-1287
> URL: https://issues.apache.org/jira/browse/MAHOUT-1287
> Project: Mahout
> Issue Type: Bug
> Components: Classification
> Affects Versions: 0.7
> Reporter: Alex Franchuk
> Assignee: Suneel Marthi
> Priority: Minor
> Labels: csv, parser
> Fix For: 0.9
>
> Attachments: CsvRecordFactory_CsvParseFix.patch
>
> Original Estimate: 0h
> Remaining Estimate: 0h
>
> CsvRecordFactory uses very simplistic CSV parsing, and incorrectly parses CSV
> strings when there are double-quoted fields with commas present.
> This problem also affects the command-line demo programs which use
> CsvRecordFactory (mostly the sgd-related programs).
> Attached is a patch to fix the problem.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira