[
https://issues.apache.org/jira/browse/MAHOUT-799?focusedWorklogId=1001325&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-1001325
]
ASF GitHub Bot logged work on MAHOUT-799:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 22/Jan/26 12:15
Start Date: 22/Jan/26 12:15
Worklog Time Spent: 10m
Work Description: ryankert01 commented on PR #868:
URL: https://github.com/apache/mahout/pull/868#issuecomment-3784084030
PTAL @guan404ming @rich7420
Issue Time Tracking
-------------------
Worklog Id: (was: 1001325)
Time Spent: 1h (was: 50m)
> Cannot run SequenceFilesFromCsvFilter, ever
> -------------------------------------------
>
> Key: MAHOUT-799
> URL: https://issues.apache.org/jira/browse/MAHOUT-799
> Project: Mahout
> Issue Type: Bug
> Components: classic
> Affects Versions: 0.5
> Reporter: Jack Tanner
> Assignee: Sean R. Owen
> Priority: Major
> Fix For: 0.6
>
> Attachments: MAHOUT-799.patch, MAHOUT-799.patch
>
> Time Spent: 1h
> Remaining Estimate: 0h
>
> As described here:
> http://mail-archives.apache.org/mod_mbox/mahout-user/201106.mbox/%[email protected]%3E
> SequenceFilesFromCsvFilter cannot be invoked with default parameter values,
> because it dies like so:
> bin/mahout seqdirectory -i input -o output -filter
> org.apache.mahout.text.SequenceFilesFromCsvFilter
> ...
> Caused by: java.lang.NumberFormatException: null
> at java.lang.Integer.parseInt(Integer.java:417)
> at java.lang.Integer.parseInt(Integer.java:499)
> at
> org.apache.mahout.text.SequenceFilesFromCsvFilter.<init>(SequenceFilesFromCsvFilter.java:56)
> If one adds the parameters -kcol 0 -vcol 0 (or their long-form versions), it
> dies like so:
> Unexpected -kcol while processing Job-Specific Options
> Commenting out SequenceFilesFromCsvFilter:56 and
> SequenceFilesFromCsvFilter:57, like so, allows the run to proceed
> // this.keyColumn = Integer.parseInt(options.get(KEY_COLUMN_OPTION[0]));
> // this.valueColumn =
> Integer.parseInt(options.get(VALUE_COLUMN_OPTION[0]));
--
This message was sent by Atlassian Jira
(v8.20.10#820010)