[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-05-21 Thread Thomas Mueller (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553771#comment-14553771
 ] 

Thomas Mueller commented on OAK-2663:
-

https://svn.apache.org/r1680748 (1.0 branch) - a few changes were need to apply 
the patch

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Thomas Mueller
  Labels: performance
 Fix For: 1.3.0, 1.2.3, 1.0.15

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-05-20 Thread Thomas Mueller (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552304#comment-14552304
 ] 

Thomas Mueller commented on OAK-2663:
-

https://svn.apache.org/r1680559 (trunk)

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Thomas Mueller
  Labels: performance
 Fix For: 1.3.0, 1.2.3, 1.0.15

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-05-20 Thread Thomas Mueller (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552311#comment-14552311
 ] 

Thomas Mueller commented on OAK-2663:
-

I made some changes to the patch, for example I split the combined method 
filterFailingUniques. I used a more conservative approach to detect duplicates: 
the node itself is checked (same as in the old code), not just the existence of 
the node (as in the patch). While adding entries, it is now checked that the 
same path is only indexed once. I made those changes because a test case showed 
problems with the existing approach: if an index is outdated (for example by 
removing nodes directly in MongoDB), then re-adding the nodes that are already 
indexed will result in a duplicate key exception. This is now prevented. I also 
removed some untested, and currently unused code 
(ContentMirrorStoreStrategy.exists), but added a comment to this issue if this 
code is ever needed.

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Thomas Mueller
  Labels: performance
 Fix For: 1.3.0, 1.2.3, 1.0.15

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-05-20 Thread Thomas Mueller (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14552345#comment-14552345
 ] 

Thomas Mueller commented on OAK-2663:
-

https://svn.apache.org/r1680577 (1.2 branch)

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Thomas Mueller
  Labels: performance
 Fix For: 1.3.0, 1.2.3, 1.0.15

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-05-19 Thread Thomas Mueller (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14550551#comment-14550551
 ] 

Thomas Mueller commented on OAK-2663:
-

There was some change since the patch, but other than that it looks OK.
I'm trying to write a special test case for it.

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Thomas Mueller
  Labels: performance
 Fix For: 1.3.0, 1.2.3, 1.0.15

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-05-14 Thread Chetan Mehrotra (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14543388#comment-14543388
 ] 

Chetan Mehrotra commented on OAK-2663:
--

ping [~tmueller] [~mduerig] Can you have a look at the patch. This behaviour 
causing large memory requirement for bigger migration and also slows done the 
migration. So having this fixed would be helpful 

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
  Labels: performance
 Fix For: 1.3.0

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-03-25 Thread Chetan Mehrotra (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14379923#comment-14379923
 ] 

Chetan Mehrotra commented on OAK-2663:
--

[~tmueller] [~mduerig] Can you review the patch. It would be helpful to have 
this included

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Chetan Mehrotra
 Fix For: 1.1.8

 Attachments: OAK-2663.patch


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (OAK-2663) Unique property index can trigger OOM during upgrade of large repository

2015-03-21 Thread Chetan Mehrotra (JIRA)

[ 
https://issues.apache.org/jira/browse/OAK-2663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14372712#comment-14372712
 ] 

Chetan Mehrotra commented on OAK-2663:
--

In one the upgrade (to segment) disabling the last check for uniqueness 
constraint saved 8 mins (time in index editor reduced from 40 min to 32 min for 
~13 M nodes). I believe the saving would be higher when using Mongo

 Unique property index can trigger OOM during upgrade of large repository
 

 Key: OAK-2663
 URL: https://issues.apache.org/jira/browse/OAK-2663
 Project: Jackrabbit Oak
  Issue Type: Bug
  Components: upgrade
Reporter: Chetan Mehrotra
Assignee: Chetan Mehrotra
 Fix For: 1.1.8


 {{PropertyIndexEditor}} when configured for unique index maintains an in 
 memory state of indexed property in {{keysToCheckForUniqueness}}. This set 
 would accumulate all the unique values being indexed.
 In case of upgrade where the complete upgrade is performed in single commit 
 this state can become very large. Further later while exiting the editor 
 validates that all such values are actually unique by iterating over all such 
 values.
 We should look into other possible ways to enforce uniqueness constraint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)