[
https://issues.apache.org/jira/browse/MAHOUT-997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13242903#comment-13242903
]
Lance Norskog commented on MAHOUT-997:
--
This is a general problem, not a splitData
[
https://issues.apache.org/jira/browse/MAHOUT-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13236249#comment-13236249
]
Lance Norskog commented on MAHOUT-994:
--
Don't the job jars pack up Hadoop libs like
[
https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13221492#comment-13221492
]
Lance Norskog commented on MAHOUT-944:
--
Would the bugfix also apply over HDFS or S3?
[
https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13212988#comment-13212988
]
Lance Norskog commented on MAHOUT-944:
--
Can the configuration object also store
[
https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13209102#comment-13209102
]
Lance Norskog commented on MAHOUT-944:
--
bq. Why the need to get the scorer, etc.? I
[
https://issues.apache.org/jira/browse/MAHOUT-975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206660#comment-13206660
]
Lance Norskog commented on MAHOUT-975:
--
The newest patch does not compile against the
[
https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206670#comment-13206670
]
Lance Norskog commented on MAHOUT-944:
--
This is a Lucene query. It's already sorted!
[
https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206332#comment-13206332
]
Lance Norskog commented on MAHOUT-947:
--
There is a sequencefile utility code pattern
[
https://issues.apache.org/jira/browse/MAHOUT-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206348#comment-13206348
]
Lance Norskog commented on MAHOUT-944:
--
A map-reduce version:
# Lets you handle much
[
https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206349#comment-13206349
]
Lance Norskog commented on MAHOUT-784:
--
I'm sure there are a lot of diffs. My general
[
https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13206047#comment-13206047
]
Lance Norskog commented on MAHOUT-784:
--
Joe- which Mahout formatter do you use? It
[
https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13205003#comment-13205003
]
Lance Norskog commented on MAHOUT-947:
--
I won't be able to try it. The patch looks
[
https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13203290#comment-13203290
]
Lance Norskog commented on MAHOUT-946:
--
bq. My current view of AbstractJob is that it
[
https://issues.apache.org/jira/browse/MAHOUT-970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13199576#comment-13199576
]
Lance Norskog commented on MAHOUT-970:
--
There is a similar situation with Lucene,
[
https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13195416#comment-13195416
]
Lance Norskog commented on MAHOUT-946:
--
Sweet!
Map-reduce job
[
https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13188152#comment-13188152
]
Lance Norskog commented on MAHOUT-946:
--
A job shutdown method should remember to
[
https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13187291#comment-13187291
]
Lance Norskog commented on MAHOUT-947:
--
mahout/src/conf/driver.classes.props lists
[
https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13187369#comment-13187369
]
Lance Norskog commented on MAHOUT-946:
--
Yup, you're right. Shell script should be a
[
https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13186626#comment-13186626
]
Lance Norskog commented on MAHOUT-946:
--
Don't forget the example shell scripts!
[
https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13186630#comment-13186630
]
Lance Norskog commented on MAHOUT-947:
--
VectorDumper is a custom class just for
[
https://issues.apache.org/jira/browse/MAHOUT-863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13185410#comment-13185410
]
Lance Norskog commented on MAHOUT-863:
--
Just tested this- works great. Thanks for
[
https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13183871#comment-13183871
]
Lance Norskog commented on MAHOUT-939:
--
In answer to the previous comment: I only ran
[
https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13183874#comment-13183874
]
Lance Norskog commented on MAHOUT-939:
--
asf_samples_list.txt is a complete listing of
[
https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182447#comment-13182447
]
Lance Norskog commented on MAHOUT-939:
--
Another oddity: using the full apache commons
[
https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182299#comment-13182299
]
Lance Norskog commented on MAHOUT-939:
--
SGD now does:
* 88% with just the subject,
[
https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182326#comment-13182326
]
Lance Norskog commented on MAHOUT-939:
--
I did my testing on the Apache commons/cocoon
[
https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13179255#comment-13179255
]
Lance Norskog commented on MAHOUT-939:
--
Details please.
# Data set and provenance
#
[
https://issues.apache.org/jira/browse/MAHOUT-904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172014#comment-13172014
]
Lance Norskog commented on MAHOUT-904:
--
Hi-
Don't see the 'add review' button. Can
[
https://issues.apache.org/jira/browse/MAHOUT-923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13167337#comment-13167337
]
Lance Norskog commented on MAHOUT-923:
--
MatrixRowMeanJob writes
[
https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13163310#comment-13163310
]
Lance Norskog commented on MAHOUT-913:
--
Eclipse has Checkstyle PMD available as
[
https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13163317#comment-13163317
]
Lance Norskog commented on MAHOUT-913:
--
Does this break any active JIRA patches?
[
https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13161952#comment-13161952
]
Lance Norskog commented on MAHOUT-910:
--
{code}
return userIDs1.size()
[
https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13161394#comment-13161394
]
Lance Norskog commented on MAHOUT-880:
--
Another problem I've seen in some places is
[
https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13161398#comment-13161398
]
Lance Norskog commented on MAHOUT-880:
--
Oops sorry. This is about the set of pairwise
[
https://issues.apache.org/jira/browse/MAHOUT-895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158181#comment-13158181
]
Lance Norskog commented on MAHOUT-895:
--
+1. Anything to make the examples more clear.
[
https://issues.apache.org/jira/browse/MAHOUT-840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158193#comment-13158193
]
Lance Norskog commented on MAHOUT-840:
--
A couple of points:
# (int) on a double means
[
https://issues.apache.org/jira/browse/MAHOUT-899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158236#comment-13158236
]
Lance Norskog commented on MAHOUT-899:
--
Suggestion on sampling: use reservoir
[
https://issues.apache.org/jira/browse/MAHOUT-869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13157647#comment-13157647
]
Lance Norskog commented on MAHOUT-869:
--
I suggest changing all the dumpers to: dump*.
[
https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13157669#comment-13157669
]
Lance Norskog commented on MAHOUT-884:
--
Map-reduce does not handle this well. There
[
https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13151074#comment-13151074
]
Lance Norskog commented on MAHOUT-884:
--
bq. Then this should be a map-reduce job, not
[
https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13151075#comment-13151075
]
Lance Norskog commented on MAHOUT-884:
--
What is the scope of matrix sizes where this
[
https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13149373#comment-13149373
]
Lance Norskog commented on MAHOUT-845:
--
1) Is this feature useful in any other code
[
https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13149469#comment-13149469
]
Lance Norskog commented on MAHOUT-884:
--
I forgot about NamedVectors :(
[
https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13148249#comment-13148249
]
Lance Norskog commented on MAHOUT-784:
--
Hi-
Yes, that sounds like a great idea.
[
https://issues.apache.org/jira/browse/MAHOUT-830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13146672#comment-13146672
]
Lance Norskog commented on MAHOUT-830:
--
Never checked in. It's a fine idea.
[
https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13145166#comment-13145166
]
Lance Norskog commented on MAHOUT-784:
--
No, I meant that the script that runs 'mahout
[
https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13144616#comment-13144616
]
Lance Norskog commented on MAHOUT-874:
--
If you're going to unify clustering and
[
https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13144924#comment-13144924
]
Lance Norskog commented on MAHOUT-784:
--
The tools are there to get this output. You
[
https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13143784#comment-13143784
]
Lance Norskog commented on MAHOUT-838:
--
The November 1 patch is correct:
[
https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13144458#comment-13144458
]
Lance Norskog commented on MAHOUT-838:
--
Now we're all set to ditch matrix labels and
[
https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13143685#comment-13143685
]
Lance Norskog commented on MAHOUT-838:
--
There's something very wrong with how I'm
[
https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13139560#comment-13139560
]
Lance Norskog commented on MAHOUT-838:
--
Joe-
I see your point. I have code to
[
https://issues.apache.org/jira/browse/MAHOUT-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13133493#comment-13133493
]
Lance Norskog commented on MAHOUT-849:
--
Error: wanted 43, received 43. This usually
[
https://issues.apache.org/jira/browse/MAHOUT-849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13133566#comment-13133566
]
Lance Norskog commented on MAHOUT-849:
--
My code attempted to multiply two MxN
[
https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13133234#comment-13133234
]
Lance Norskog commented on MAHOUT-847:
--
How does this compare with
[
https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13133241#comment-13133241
]
Lance Norskog commented on MAHOUT-838:
--
MAHOUT-812 was committed on Oct. 3. If your
[
https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13132186#comment-13132186
]
Lance Norskog commented on MAHOUT-847:
--
A instance of this class will probably be
[
https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13132191#comment-13132191
]
Lance Norskog commented on MAHOUT-847:
--
As a perpetual beginner, it is daunting to
[
https://issues.apache.org/jira/browse/MAHOUT-832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13123604#comment-13123604
]
Lance Norskog commented on MAHOUT-832:
--
For the output of the mail archives
[
https://issues.apache.org/jira/browse/MAHOUT-828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13121688#comment-13121688
]
Lance Norskog commented on MAHOUT-828:
--
Yes, please stop printing the classpath.
[
https://issues.apache.org/jira/browse/MAHOUT-824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119116#comment-13119116
]
Lance Norskog commented on MAHOUT-824:
--
Sure, this is fine.
MemoryDiffStorage
[
https://issues.apache.org/jira/browse/MAHOUT-824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13118947#comment-13118947
]
Lance Norskog commented on MAHOUT-824:
--
The new MemoryDiffStorage2 is revamped to use
[
https://issues.apache.org/jira/browse/MAHOUT-812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13116204#comment-13116204
]
Lance Norskog commented on MAHOUT-812:
--
Yeah, it did not seem right to me either.
[
https://issues.apache.org/jira/browse/MAHOUT-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13116223#comment-13116223
]
Lance Norskog commented on MAHOUT-778:
--
Could the final iteration files be renamed at
64 matches
Mail list logo