[jira] Commented: (MAHOUT-388) Upgrade Lucene

2010-05-05 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12864630#action_12864630 ] Drew Farris commented on MAHOUT-388: Any objections to this? If not I'll plan on

[jira] Updated: (MAHOUT-388) Upgrade Lucene

2010-05-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-388: --- Status: Patch Available (was: Open) Upgrade Lucene -- Key:

[jira] Updated: (MAHOUT-388) Upgrade Lucene

2010-05-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-388: --- Attachment: MAHOUT-388.patch Updates to Lucene 3.0.1, created DefaultAnalyzer in mahout-util which

[jira] Created: (MAHOUT-373) VectorDumper/VectorHelper doesn't dump values when dictionary is present

2010-04-09 Thread Drew Farris (JIRA)
VectorDumper/VectorHelper doesn't dump values when dictionary is present Key: MAHOUT-373 URL: https://issues.apache.org/jira/browse/MAHOUT-373 Project: Mahout Issue

[jira] Commented: (MAHOUT-361) SLF4J dependency structure leads to unpleasant surproses

2010-04-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852792#action_12852792 ] Drew Farris commented on MAHOUT-361: I've run into this too as a result of having the

[jira] Commented: (MAHOUT-361) SLF4J dependency structure leads to unpleasant surproses

2010-04-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852842#action_12852842 ] Drew Farris commented on MAHOUT-361: Sorry, I probably wasn't being clear; I'm not

[jira] Commented: (MAHOUT-350) add one JobName and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-04-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852360#action_12852360 ] Drew Farris commented on MAHOUT-350: bq. (Incidentally, now might be a good time to

[jira] Commented: (MAHOUT-350) add one JobName and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-03-31 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852012#action_12852012 ] Drew Farris commented on MAHOUT-350: Not sure if this is helpful Sean,

[jira] Commented: (MAHOUT-350) add one JobName and reduceNumber parameter to org.apache.mahout.cf.taste.hadoop.item.RecommenderJob

2010-03-31 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852224#action_12852224 ] Drew Farris commented on MAHOUT-350: {quote} Hmm, I don't understand that.

[jira] Commented: (MAHOUT-344) Minhash based clustering

2010-03-30 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12851686#action_12851686 ] Drew Farris commented on MAHOUT-344: Hi Cristi, Sounds like a great start. Answers for

[jira] Commented: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12843569#action_12843569 ] Drew Farris commented on MAHOUT-325: What issue did you run into? Error messages, etc?

[jira] Commented: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12843572#action_12843572 ] Drew Farris commented on MAHOUT-325: Did it happen to indicate which urls it attempted

[jira] Closed: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris closed MAHOUT-325. -- Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

[jira] Updated: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-09 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-325: --- Status: Patch Available (was: Open) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

[jira] Created: (MAHOUT-325) Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release)

2010-03-09 Thread Drew Farris (JIRA)
Switch from hadoop-0.20.2-SNAPSHOT to hadoop-0.20.2 (release) - Key: MAHOUT-325 URL: https://issues.apache.org/jira/browse/MAHOUT-325 Project: Mahout Issue Type: Improvement

[jira] Assigned: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris reassigned MAHOUT-317: -- Assignee: Drew Farris Collocations: Eliminate in-memory frequency calculation

[jira] Resolved: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris resolved MAHOUT-317. Resolution: Fixed Committed in r919798 Collocations: Eliminate in-memory frequency calculation

[jira] Commented: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840597#action_12840597 ] Drew Farris commented on MAHOUT-317: Thanks for trying it out Robin. I'll take a closer

[jira] Commented: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840617#action_12840617 ] Drew Farris commented on MAHOUT-320: I certainlly can't argure about the space savings.

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-03 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch re-added missing minSupport, thanks for pointing this out Robin. Fixed

[jira] Created: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
Modify IntPairWritable in LDA implementation to be binary comparable to improve performance. Key: MAHOUT-320 URL: https://issues.apache.org/jira/browse/MAHOUT-320

[jira] Updated: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-320: --- Assignee: Robin Anil Status: Patch Available (was: Open) Modify IntPairWritable in LDA

[jira] Updated: (MAHOUT-320) Modify IntPairWritable in LDA implementation to be binary comparable to improve performance.

2010-03-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-320: --- Attachment: MAHOUT-320.patch binary comparable implementation plus unit test for get/set, writable

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-03-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch Replaced GramTuple with GramKey which achieves the same end in a more

[jira] Updated: (MAHOUT-317) Collocations: Eliminate in-memory frequency calculation

2010-02-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-317: --- Attachment: MAHOUT-317.patch This patch addresses the original problem by using the

[jira] Created: (MAHOUT-311) Update assemblies to include components of launcher script from MAHOUT-301

2010-02-26 Thread Drew Farris (JIRA)
Update assemblies to include components of launcher script from MAHOUT-301 -- Key: MAHOUT-311 URL: https://issues.apache.org/jira/browse/MAHOUT-311 Project: Mahout

[jira] Updated: (MAHOUT-311) Update assemblies to include components of launcher script from MAHOUT-301

2010-02-26 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-311: --- Attachment: MAHOUT-311.patch In addition to the goals of this issue, this patch adjusts the way that

[jira] Updated: (MAHOUT-311) Update assemblies to include components of launcher script from MAHOUT-301

2010-02-26 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-311: --- Status: Patch Available (was: Open) Update assemblies to include components of launcher script

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-26 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12838868#action_12838868 ] Drew Farris commented on MAHOUT-301: bq. Can you upload the patch for the maven

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-25 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12838694#action_12838694 ] Drew Farris commented on MAHOUT-301: Had a chance to take this out for a spin tonight.

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-24 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837763#action_12837763 ] Drew Farris commented on MAHOUT-301: This sounds great. I will take it for a spin when

[jira] Updated: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-24 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-301: --- Attachment: MAHOUT-301-drew.patch Jake, this is looking really great. Here's a partial patch that

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837234#action_12837234 ] Drew Farris commented on MAHOUT-301: bq. including the job jar is much cleaner than

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837243#action_12837243 ] Drew Farris commented on MAHOUT-301: bq. BTW. How is hadoop execution done using shell

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837376#action_12837376 ] Drew Farris commented on MAHOUT-301: {quote} This wasn't a problem with my patch,

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837434#action_12837434 ] Drew Farris commented on MAHOUT-301: Jake, the basic idea is that you would always use

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837448#action_12837448 ] Drew Farris commented on MAHOUT-301: {quote} Hmm... ok. I'm a little reticent about

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837477#action_12837477 ] Drew Farris commented on MAHOUT-301: bq. Cool, so why not just check to see if

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12837607#action_12837607 ] Drew Farris commented on MAHOUT-301: It doesn't appear that the following command works

[jira] Updated: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-22 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-301: --- Attachment: MAHOUT-301-drew.patch Did some testing, here's a patch to clean some of these things up

[jira] Commented: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12836207#action_12836207 ] Drew Farris commented on MAHOUT-299: Thanks for the review Sean, I'll get it committed

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12836209#action_12836209 ] Drew Farris commented on MAHOUT-301: This is pretty nice, it gets to the point where

[jira] Commented: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12836224#action_12836224 ] Drew Farris commented on MAHOUT-299: bq. I'd not throw RuntimeException -

[jira] Assigned: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris reassigned MAHOUT-299: -- Assignee: Drew Farris Collocations: improve performance by making Gram BinaryComparable

[jira] Updated: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-299: --- Resolution: Fixed Status: Resolved (was: Patch Available) resolved in r912189

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-20 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12836268#action_12836268 ] Drew Farris commented on MAHOUT-301: {blockquote} What does GenericOptionsParser do if

[jira] Created: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-18 Thread Drew Farris (JIRA)
Collocations: improve performance by making Gram BinaryComparable - Key: MAHOUT-299 URL: https://issues.apache.org/jira/browse/MAHOUT-299 Project: Mahout Issue Type:

[jira] Updated: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-18 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-299: --- Attachment: MAHOUT-299.patch Patch as described above: Included other cleanups: * Gram is no

[jira] Updated: (MAHOUT-299) Collocations: improve performance by making Gram BinaryComparable

2010-02-18 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-299: --- Status: Patch Available (was: Open) Collocations: improve performance by making Gram

[jira] Commented: (MAHOUT-291) Mahout Code Cleanup

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12833806#action_12833806 ] Drew Farris commented on MAHOUT-291: Thanks very much Robin for posting a patch to

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.bz Status update w/ new tarball which contains a maven project

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-colloc.tar.gz re-added latest tarball with proper extension. Use avro for

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.gz (this is really the right tarball this time, honest) Use

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: (was: mahout-colloc.tar.gz) Use avro for serialization of structured documents.

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Comment: was deleted (was: re-added latest tarball with proper extension.) Use avro for

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-15 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: (was: mahout-avro-examples.tar.bz) Use avro for serialization of structured

[jira] Commented: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12832047#action_12832047 ] Drew Farris commented on MAHOUT-285: Yes, I'm very close on this and should be able to

[jira] Updated: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-285: --- Attachment: MAHOUT-285.patch Robin, check out the DocumentProcessor integration here, is this what

[jira] Created: (MAHOUT-285) Wrap up collocation and dictionary vectorizer integration

2010-02-09 Thread Drew Farris (JIRA)
Wrap up collocation and dictionary vectorizer integration - Key: MAHOUT-285 URL: https://issues.apache.org/jira/browse/MAHOUT-285 Project: Mahout Issue Type: Improvement Affects

[jira] Created: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency) Key: MAHOUT-282 URL: https://issues.apache.org/jira/browse/MAHOUT-282

[jira] Updated: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-282: --- Attachment: MAHOUT-282.patch Remove assembly from core, re-add commons-cli 1.x (no longer exluced

[jira] Updated: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-282: --- Status: Patch Available (was: Open) Remove assembly from core, re-add commons-cli 1.x (no longer

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Updated patch now includes a combiner for pass1 LLR Collocation

[jira] Created: (MAHOUT-283) Update assemblies to include mahout-collections for release build

2010-02-08 Thread Drew Farris (JIRA)
Update assemblies to include mahout-collections for release build - Key: MAHOUT-283 URL: https://issues.apache.org/jira/browse/MAHOUT-283 Project: Mahout Issue Type: Sub-task

[jira] Commented: (MAHOUT-282) Remove assembly from core, re-add commons-cli 1.x (no longer exluced from hadoop dependency)

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12830970#action_12830970 ] Drew Farris commented on MAHOUT-282: Mahout doesn't pull commons-cli in directly,

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-02-08 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Moved to utils based on discussion on the dev list. This can be

[jira] Commented: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12830630#action_12830630 ] Drew Farris commented on MAHOUT-274: I suspect providing a writable wrapper that

[jira] Created: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-05 Thread Drew Farris (JIRA)
Use avro for serialization of structured documents. --- Key: MAHOUT-274 URL: https://issues.apache.org/jira/browse/MAHOUT-274 Project: Mahout Issue Type: Improvement Reporter: Drew

[jira] Updated: (MAHOUT-274) Use avro for serialization of structured documents.

2010-02-05 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-274: --- Attachment: mahout-avro-examples.tar.gz Very rudimentary exploration of using avro to produce

[jira] Created: (MAHOUT-272) Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies. --- Key: MAHOUT-272 URL:

[jira] Updated: (MAHOUT-272) Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-272: --- Attachment: MAHOUT-272.patch * Added exclusion for eclipse core to hadoop dependency in

[jira] Updated: (MAHOUT-272) Add licences for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-272: --- Status: Patch Available (was: Open) Add licences for 3rd party jars to mahout binary release and

[jira] Updated: (MAHOUT-272) Add licenses for 3rd party jars to mahout binary release and remove additional unused dependencies.

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-272: --- Summary: Add licenses for 3rd party jars to mahout binary release and remove additional unused

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-02-02 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Updated patch, removed pom modifications checked in as a part of

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-02-01 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12828430#action_12828430 ] Drew Farris commented on MAHOUT-215: It looks like there might have been a problem with

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-01-29 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12806357#action_12806357 ] Drew Farris commented on MAHOUT-242: bq. Hey Drew, I'm not much of a maven guy - what's

[jira] Closed: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris closed MAHOUT-215. -- Provide jars with mahout release. - Key: MAHOUT-215

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12805910#action_12805910 ] Drew Farris commented on MAHOUT-215: Thanks for the review and commit Jake Provide

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12805942#action_12805942 ] Drew Farris commented on MAHOUT-215: bq. Just an FYI, we need to make sure we can

[jira] Updated: (MAHOUT-215) Provide jars with mahout release.

2010-01-27 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-215: --- Status: Patch Available (was: Open) Provide jars with mahout release.

[jira] Updated: (MAHOUT-215) Provide jars with mahout release.

2010-01-27 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-215: --- Attachment: MAHOUT-215.patch This patch adds build directives that produce a number of artifacts

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-01-24 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12804344#action_12804344 ] Drew Farris commented on MAHOUT-215: I need to do a bit more work on this one, the

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-01-23 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12804158#action_12804158 ] Drew Farris commented on MAHOUT-242: Ted, Thanks for the advice, I'll take a look at

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-01-22 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: MAHOUT-242.patch Thanks for the review Isabel, here's an updated patch. {quote}

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-01-22 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12803944#action_12803944 ] Drew Farris commented on MAHOUT-242: Decoupling the tokenization logic from the

[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

2010-01-16 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-185: --- Attachment: MAHOUT-185.patch This patch adds bin/mahout, a simple bash script based heavily on

[jira] Commented: (MAHOUT-252) Sets (primitive types)

2010-01-16 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12801204#action_12801204 ] Drew Farris commented on MAHOUT-252: Is this committed? It seems like there are classes

[jira] Commented: (MAHOUT-252) Sets (primitive types)

2010-01-16 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12801247#action_12801247 ] Drew Farris commented on MAHOUT-252: It was:

[jira] Commented: (MAHOUT-244) Add root log-likelihood method to LogLikehood class.

2010-01-14 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12800196#action_12800196 ] Drew Farris commented on MAHOUT-244: Thanks Isabel Add root log-likelihood method to

[jira] Created: (MAHOUT-244) Add root log-likelihood method to LogLikehood class.

2010-01-13 Thread Drew Farris (JIRA)
Add root log-likelihood method to LogLikehood class. Key: MAHOUT-244 URL: https://issues.apache.org/jira/browse/MAHOUT-244 Project: Mahout Issue Type: Improvement Components:

[jira] Updated: (MAHOUT-244) Add root log-likelihood method to LogLikehood class.

2010-01-13 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-244: --- Attachment: MAHOUT-244.patch Add root log-likelihood method to LogLikehood class.

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-01-13 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: mahout-colloc.tar.gz Thanks for taking a look and providing some great feedback Robin.

[jira] Created: (MAHOUT-242) LLR Collocation Identifier

2010-01-10 Thread Drew Farris (JIRA)
LLR Collocation Identifier -- Key: MAHOUT-242 URL: https://issues.apache.org/jira/browse/MAHOUT-242 Project: Mahout Issue Type: New Feature Affects Versions: 0.3 Reporter: Drew Farris

[jira] Updated: (MAHOUT-242) LLR Collocation Identifier

2010-01-10 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-242: --- Attachment: mahout-colloc.tar.gz LLR Collocation Identifier --

[jira] Commented: (MAHOUT-238) Further Dependency Cleanup

2010-01-07 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797719#action_12797719 ] Drew Farris commented on MAHOUT-238: Thanks Sean. Further Dependency Cleanup

[jira] Commented: (MAHOUT-205) Pull Writable (and anything else hadoop dependent) out of the matrix module

2010-01-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797221#action_12797221 ] Drew Farris commented on MAHOUT-205: Works for me, with a clean checkout latest patch

[jira] Created: (MAHOUT-238) Further Dependency Cleanup

2010-01-06 Thread Drew Farris (JIRA)
Further Dependency Cleanup -- Key: MAHOUT-238 URL: https://issues.apache.org/jira/browse/MAHOUT-238 Project: Mahout Issue Type: Sub-task Reporter: Drew Farris Priority: Minor Fix

[jira] Updated: (MAHOUT-238) Further Dependency Cleanup

2010-01-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-238: --- Attachment: MAHOUT-238.patch patch added Further Dependency Cleanup --

[jira] Updated: (MAHOUT-238) Further Dependency Cleanup

2010-01-06 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Drew Farris updated MAHOUT-238: --- Affects Version/s: 0.2 Status: Patch Available (was: Open) Further Dependency

[jira] Commented: (MAHOUT-235) GenericSorting.java also needs replacing

2010-01-04 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12796521#action_12796521 ] Drew Farris commented on MAHOUT-235: Ok, applied patch against r895535 by running the

  1   2   >