[jira] Commented: (LUCENENET-357) Mostly won't work on shared hosts / Mosso cloud etc due to 'Trust' levels

2010-04-21 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859414#action_12859414 ] Digy commented on LUCENENET-357: {quote} Btw: Should I be using this version (2.9)

[jira] Commented: (LUCENENET-357) Mostly won't work on shared hosts / Mosso cloud etc due to 'Trust' levels

2010-04-21 Thread Frank West (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859480#action_12859480 ] Frank West commented on LUCENENET-357: -- I've checked a couple of times for anything

[jira] Commented: (LUCENENET-357) Mostly won't work on shared hosts / Mosso cloud etc due to 'Trust' levels

2010-04-21 Thread Frank West (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859505#action_12859505 ] Frank West commented on LUCENENET-357: -- Absolutely agree re: divergence - good

[jira] Commented: (LUCENENET-357) Mostly won't work on shared hosts / Mosso cloud etc due to 'Trust' levels

2010-04-21 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859521#action_12859521 ] Digy commented on LUCENENET-357: Sorry Frank, It is not that easy just to say use cache

[jira] Updated: (LUCENENET-357) Mostly won't work on shared hosts / Mosso cloud etc due to 'Trust' levels

2010-04-21 Thread Robert Jordan (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Jordan updated LUCENENET-357: Attachment: WeakKey.diff Diggi, the patch fixes the issue by wrapping a WeakReference

[jira] Commented: (LUCENENET-357) Mostly won't work on shared hosts / Mosso cloud etc due to 'Trust' levels

2010-04-21 Thread Robert Jordan (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859532#action_12859532 ] Robert Jordan commented on LUCENENET-357: - DIGY, of course. Sorry for that :)

[jira] Resolved: (MAHOUT-316) CardinalityException and IndexException should remove the default constructor, and always construct with arguments saying what the error was

2010-04-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-316. -- Assignee: Sean Owen Resolution: Fixed Good idea, I made this happen. CardinalityException and

[jira] Updated: (MAHOUT-379) SequentialAccessSparseVector.equals does not agree with AbstractVector.equivalent

2010-04-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-379: - Status: Resolved (was: Patch Available) Fix Version/s: 0.4 (was: 0.3)

Re: Status of Mahout TLP

2010-04-21 Thread Robin Anil
Today is the day :) On Tue, Apr 13, 2010 at 5:16 AM, Benson Margulies bimargul...@gmail.comwrote: Here's a practical matter: svn layout. starting at the root we get, I propose: - sandboxes - mahout(/trunk,tag,branches) - collections(/trunk/tag/branches) sandboxes gives us a

Re: Status of Mahout TLP

2010-04-21 Thread Grant Ingersoll
On Apr 21, 2010, at 5:28 AM, Robin Anil wrote: Today is the day :) Assuming it passes... (which it should.) We'll have some heavy lifting to do for a few days/weeks before any practical part of it is noticeable, just so people have reasonable expectations. Anyone up for a website

Re: Status of Mahout TLP

2010-04-21 Thread Robin Anil
I can help out in the redesign. Is there a CMS approved by apache security, something which will get patched automatically? Robin On Wed, Apr 21, 2010 at 3:34 PM, Grant Ingersoll gsing...@apache.orgwrote: On Apr 21, 2010, at 5:28 AM, Robin Anil wrote: Today is the day :) Assuming it

Re: Status of Mahout TLP

2010-04-21 Thread Benson Margulies
Confluence? On Wed, Apr 21, 2010 at 7:10 AM, Robin Anil robin.a...@gmail.com wrote: I can help out in the redesign. Is there a CMS approved by apache security, something which will get patched automatically? Robin On Wed, Apr 21, 2010 at 3:34 PM, Grant Ingersoll gsing...@apache.org

Re: Status of Mahout TLP

2010-04-21 Thread Grant Ingersoll
On Apr 21, 2010, at 7:55 AM, Benson Margulies wrote: Confluence? Yeah, that's pretty much it. I'm thinking we have a static front page and then backed by some pages on Confluence. Check out what http://ofbiz.apache.org. -Grant On Wed, Apr 21, 2010 at 7:10 AM, Robin Anil

Re: Status of Mahout TLP

2010-04-21 Thread Benson Margulies
Yup, same scheme at CXF. On Wed, Apr 21, 2010 at 8:31 AM, Grant Ingersoll gsing...@apache.orgwrote: On Apr 21, 2010, at 7:55 AM, Benson Margulies wrote: Confluence? Yeah, that's pretty much it. I'm thinking we have a static front page and then backed by some pages on Confluence. Check

TLP Status

2010-04-21 Thread Grant Ingersoll
The Board has approved Mahout, Tika, and Nutch moving to be top level status. Congrats! Now begins the fun part of changing mailing lists, domains, etc. -Grant

Re: TLP Status

2010-04-21 Thread Jeff Eastman
Yeay team! On 4/21/10 1:09 PM, Grant Ingersoll wrote: The Board has approved Mahout, Tika, and Nutch moving to be top level status. Congrats! Now begins the fun part of changing mailing lists, domains, etc. -Grant

Re: TLP Status

2010-04-21 Thread Drew Farris
Contratulations everyone! On Wed, Apr 21, 2010 at 4:42 PM, Jeff Eastman j...@windwardsolutions.com wrote: Yeay team!

Re: TLP Status

2010-04-21 Thread Drew Farris
urm.. Congratulations :-D On Wed, Apr 21, 2010 at 5:00 PM, Drew Farris drew.far...@gmail.com wrote: Contratulations everyone!

Re: Status of Mahout TLP

2010-04-21 Thread Drew Farris
On Wed, Apr 21, 2010 at 6:04 AM, Grant Ingersoll gsing...@apache.org wrote: Assuming it passes...  (which it should.)  We'll have some heavy lifting to do for a few days/weeks before any practical part of it is noticeable, just so people have reasonable expectations. What sort of things

[Idea] Support Facebook Opengraph JSON format as an input

2010-04-21 Thread Robin Anil
The details are not clear at the moment. But, I am sure this will help adoption of the mahout quickly. Things to do. Parse JSON and make the SequenceFiles for use for clustering, classification and recommendation. Robin

Re: Status of Mahout TLP

2010-04-21 Thread Benson Margulies
I thought it would be a bit less confusing if the current toplevel moved directly to REPO/mahout/mahout instead of moving to REPO/mahout. On Wed, Apr 21, 2010 at 5:43 PM, Grant Ingersoll gsing...@apache.orgwrote: On Apr 21, 2010, at 5:01 PM, Drew Farris wrote: On Wed, Apr 21, 2010 at 6:04

Re: [Idea] Support Facebook Opengraph JSON format as an input

2010-04-21 Thread Jeff Eastman
Mahout Vectors and Clusters currently support JSON encodings for input and output. What else is needed? Jeff On 4/21/10 4:18 PM, Robin Anil wrote: The details are not clear at the moment. But, I am sure this will help adoption of the mahout quickly. Things to do. Parse JSON and make the

Mahout TLP to-do list

2010-04-21 Thread Drew Farris
Probably worth starting another thread for this. Mahout TLP to-do's: Website design: Robin SVN: Grant will take care can take care of the move when we are ready. Mailing lists: JIRA's for INFRA Jira/Confluence: the same as today. Can we mod the confluence theme to match the website?

[jira] Created: (MAHOUT-384) Implement of AVF algorithm

2010-04-21 Thread tony cui (JIRA)
Implement of AVF algorithm -- Key: MAHOUT-384 URL: https://issues.apache.org/jira/browse/MAHOUT-384 Project: Mahout Issue Type: New Feature Components: Collaborative Filtering Reporter: tony cui

[jira] Updated: (MAHOUT-384) Implement of AVF algorithm

2010-04-21 Thread tony cui (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tony cui updated MAHOUT-384: Attachment: mahout-384.patch Implement of AVF algorithm --

[jira] Commented: (MAHOUT-384) Implement of AVF algorithm

2010-04-21 Thread tony cui (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859654#action_12859654 ] tony cui commented on MAHOUT-384: - I just committed this patch which realize avf algorithm.

[jira] Commented: (MAHOUT-384) Implement of AVF algorithm

2010-04-21 Thread tony cui (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859655#action_12859655 ] tony cui commented on MAHOUT-384: - I mean, what am I supposed to do next? Implement of

Re: [Idea] Support Facebook Opengraph JSON format as an input

2010-04-21 Thread Robin Anil
Basically support extraction of fields and vector. Here is my public info graph.facebook.com/robin.anil. Here is coco cola graph.facebook.com/cocacola http://graph.facebook.com/robin.anilRobin On Thu, Apr 22, 2010 at 6:08 AM, Jeff Eastman j...@windwardsolutions.comwrote: Mahout Vectors and

[jira] Commented: (NUTCH-710) Support for rel=canonical attribute

2010-04-21 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859286#action_12859286 ] Julien Nioche commented on NUTCH-710: - As suggested previously we could either treat

TLP Status

2010-04-21 Thread Grant Ingersoll
The Board has approved Mahout, Tika, and Nutch moving to be top level status. Congrats! Now begins the fun part of changing mailing lists, domains, etc. -Grant

[jira] Updated: (TIKA-242) Incremental configuration AutoDetectParser

2010-04-21 Thread Aaron Kaplan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Kaplan updated TIKA-242: -- Attachment: TikaConfig-patch Here's a patch that implements this. It adds an optional boolean argument

[jira] Commented: (LUCENE-2405) Benchmark DocMaker no longer allows off prescription usage

2010-04-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859352#action_12859352 ] Michael McCandless commented on LUCENE-2405: Grant was this really Won't Fix?

[jira] Commented: (LUCENE-2402) Add an explicit method to invoke IndexDeletionPolicy

2010-04-21 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859377#action_12859377 ] Shai Erera commented on LUCENE-2402: bq. I think we need to open up a new package

[jira] Created: (LUCENE-2408) Add Document.set/getSourceID, as an optional hint to IndexWriter to improve indexing performance

2010-04-21 Thread Michael McCandless (JIRA)
Add Document.set/getSourceID, as an optional hint to IndexWriter to improve indexing performance Key: LUCENE-2408 URL:

Re: Proposal about Version API relaxation

2010-04-21 Thread Michael McCandless
I like these separate levels, to characterize index compatibility. As far as I know we've never had a level 3 major release :) Mike On Mon, Apr 19, 2010 at 4:28 AM, Doron Cohen cdor...@gmail.com wrote: Late joining... could we agree on an intention to provide an index migration tool when/if

Re: Proposal about Version API relaxation

2010-04-21 Thread Michael McCandless
Trying to summarize what we seem to be roughly converging to, here: * Up front: consolidate all Solr core, Lucene core, contrib analyzers into one place (contrib/analyzers). Don't use Version in there; instead, the released JAR is versioned. The app picks its required version

Re: Per-Thread DW and IW

2010-04-21 Thread Michael Busch
Yeah, sounds like we have the same things in mind here. In fact, this is pretty similar to what we discussed a while ago on LUCENE-2026 I think. SegmentWriter could be a higher level interface with more than one implementation. E.g. there could be one SegmentWriter that supports appending

Re: Per-Thread DW and IW

2010-04-21 Thread Shai Erera
I don't advocate to develop PI as an external entity to Lucene, you've already done that ! :) We should open up IW enough to develop PI efficiently, but I think we should always allow some freedom and flexibility to using applications. If IW simply created a Parallel DW, handle the merges on its

Re: Proposal about Version API relaxation

2010-04-21 Thread Shai Erera
So basically, API-wise, the stable branch will remain like it is today: API changes under deprecation path, bw breaks as long as they are documented in CHANGES etc. Trunk will be allowed to change the API as it sees fit (but still document the changes in CHANGES). Index-format wise, we adopt