[jira] [Created] (OAK-10736) Collect DetailedGC Stats for DryRun mode

2024-04-02 Thread Rishabh Daim (Jira)
Rishabh Daim created OAK-10736:
--

 Summary: Collect DetailedGC Stats for DryRun mode
 Key: OAK-10736
 URL: https://issues.apache.org/jira/browse/OAK-10736
 Project: Jackrabbit Oak
  Issue Type: Task
Reporter: Rishabh Daim
Assignee: Rishabh Daim






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (OAK-10695) oak-benchmarks/oak-it-osgi: update commons-compress dependency to 1.26.1

2024-04-02 Thread Julian Reschke (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Julian Reschke resolved OAK-10695.
--
Resolution: Fixed

> oak-benchmarks/oak-it-osgi: update commons-compress dependency to 1.26.1
> 
>
> Key: OAK-10695
> URL: https://issues.apache.org/jira/browse/OAK-10695
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: benchmarks, it, osgi
>Reporter: Julian Reschke
>Assignee: Julian Reschke
>Priority: Minor
>  Labels: candidate_oak_1_22
> Fix For: 1.62.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] (OAK-10695) oak-benchmarks/oak-it-osgi: update commons-compress dependency to 1.26.1

2024-04-02 Thread Julian Reschke (Jira)


[ https://issues.apache.org/jira/browse/OAK-10695 ]


Julian Reschke deleted comment on OAK-10695:
--

was (Author: reschke):
trunk: 
[5e054a5fa1|https://github.com/apache/jackrabbit-oak/commit/5e054a5fa1a0f5c7c11427413f86ff2566a3512d]

> oak-benchmarks/oak-it-osgi: update commons-compress dependency to 1.26.1
> 
>
> Key: OAK-10695
> URL: https://issues.apache.org/jira/browse/OAK-10695
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: benchmarks, it, osgi
>Reporter: Julian Reschke
>Assignee: Julian Reschke
>Priority: Minor
>  Labels: candidate_oak_1_22
> Fix For: 1.62.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (OAK-10695) oak-benchmarks/oak-it-osgi: update commons-compress dependency to 1.26.1

2024-04-02 Thread Julian Reschke (Jira)


[ 
https://issues.apache.org/jira/browse/OAK-10695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17833266#comment-17833266
 ] 

Julian Reschke commented on OAK-10695:
--

trunk: 
[089a42407f|https://github.com/apache/jackrabbit-oak/commit/089a42407f4b0429ce5d415b95553c1c8e4d7b64]
 
[5e054a5fa1|https://github.com/apache/jackrabbit-oak/commit/5e054a5fa1a0f5c7c11427413f86ff2566a3512d]

> oak-benchmarks/oak-it-osgi: update commons-compress dependency to 1.26.1
> 
>
> Key: OAK-10695
> URL: https://issues.apache.org/jira/browse/OAK-10695
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: benchmarks, it, osgi
>Reporter: Julian Reschke
>Assignee: Julian Reschke
>Priority: Minor
>  Labels: candidate_oak_1_22
> Fix For: 1.62.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (OAK-10708) DocumentNodeStore: error-log failures to update the journal

2024-04-02 Thread Julian Reschke (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Julian Reschke updated OAK-10708:
-
Fix Version/s: 1.64.0
   (was: 1.62.0)

> DocumentNodeStore: error-log failures to update the journal 
> 
>
> Key: OAK-10708
> URL: https://issues.apache.org/jira/browse/OAK-10708
> Project: Jackrabbit Oak
>  Issue Type: Technical task
>  Components: documentmk
>Reporter: Julian Reschke
>Assignee: Julian Reschke
>Priority: Minor
> Fix For: 1.64.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (OAK-10725) Upgrade jackson-databind dependency to 2.16.2

2024-04-02 Thread Julian Reschke (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Julian Reschke resolved OAK-10725.
--
Resolution: Fixed

> Upgrade jackson-databind dependency to 2.16.2
> -
>
> Key: OAK-10725
> URL: https://issues.apache.org/jira/browse/OAK-10725
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: parent
>Reporter: Fabrizio Fortino
>Assignee: Julian Reschke
>Priority: Major
>  Labels: candidate_oak_1_22
> Fix For: 1.62.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (OAK-10734) DetailedGC must keep entries in "_revisions" for non branch commits, unless older than sweep

2024-04-02 Thread Stefan Egli (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stefan Egli resolved OAK-10734.
---
Resolution: Done

PR merged

> DetailedGC must keep entries in "_revisions" for non branch commits, unless 
> older than sweep
> 
>
> Key: OAK-10734
> URL: https://issues.apache.org/jira/browse/OAK-10734
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: documentmk
>Reporter: Stefan Egli
>Assignee: Stefan Egli
>Priority: Major
>  Labels: DetailedGC
>
> Entries in "_revisions" (for non root documents) could be referenced by 
> children in case of non branch commits. They must thus be kept. Unless older 
> than sweep.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (OAK-10735) Release Oak 1.62.0

2024-04-02 Thread Julian Reschke (Jira)
Julian Reschke created OAK-10735:


 Summary: Release Oak 1.62.0
 Key: OAK-10735
 URL: https://issues.apache.org/jira/browse/OAK-10735
 Project: Jackrabbit Oak
  Issue Type: Task
Reporter: Julian Reschke
Assignee: Julian Reschke






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (OAK-10682) [Indexing job] Improve Mongo regex filter to only use positive conditions (no negations)

2024-04-02 Thread Nuno Santos (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nuno Santos resolved OAK-10682.
---
Fix Version/s: 1.62.0
   Resolution: Done

> [Indexing job] Improve Mongo regex filter to only use positive conditions (no 
> negations)
> 
>
> Key: OAK-10682
> URL: https://issues.apache.org/jira/browse/OAK-10682
> Project: Jackrabbit Oak
>  Issue Type: Improvement
>  Components: indexing
> Environment: The current implementation of filtering excluded paths 
> and custom regex is using a condition like
> {noformat}
> { _id:  { $nin: [ /^[0-9]{1,3}:\/content\/dam\/.*$/ ]} {noformat}
> Mongo cannot evaluate this condition without retrieving the full document, 
> because a value of {{_null}} would also match this condition and the index 
> does not contain {{null}} values. Therefore, when the index contains excluded 
> paths, the download will be much slower because Mongo has to retrieve every 
> single document to evaluate the condition.
> As a workaround, we can transform the regex on an equivalent one that matches 
> the complement of the original regex using [negative 
> lookahead|https://stackoverflow.com/questions/1240275/how-to-negate-specific-word-in-regex].
>  This allows rewriting the filter condition using only positive conditions, 
> which can be evaluated using only the index.
>Reporter: Nuno Santos
>Priority: Major
>  Labels: indexing
> Fix For: 1.62.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (OAK-10535) Clean up old revisions in a document

2024-04-02 Thread Stefan Egli (Jira)


[ 
https://issues.apache.org/jira/browse/OAK-10535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17833215#comment-17833215
 ] 

Stefan Egli commented on OAK-10535:
---

[~corderob], I've now un-ignored the test case originally added that was 
blocked by OAK-10535, as we now have different variants and amongst them one 
that fixes it, so I'd like to have it unignored. The PR is 
[PR#1396|https://github.com/apache/jackrabbit-oak/pull/1396] (it is against 
another PR of mine, but is otherwise just an unignore and adjustments in 
asserts)

> Clean up old revisions in a document
> 
>
> Key: OAK-10535
> URL: https://issues.apache.org/jira/browse/OAK-10535
> Project: Jackrabbit Oak
>  Issue Type: Improvement
>  Components: documentmk
>Reporter: José Andrés Cordero Benítez
>Assignee: José Andrés Cordero Benítez
>Priority: Minor
>
> Introduce a way to safely detect and delete old revisions in a document. This 
> could be useful to cleanup documents that sometimes grows above the supported 
> size in MongoDB (16MB).
> It could be also integrate into the detailed GC.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (OAK-10734) DetailedGC must keep entries in "_revisions" for non branch commits, unless older than sweep

2024-04-02 Thread Stefan Egli (Jira)


[ 
https://issues.apache.org/jira/browse/OAK-10734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17833096#comment-17833096
 ] 

Stefan Egli commented on OAK-10734:
---

* created https://github.com/apache/jackrabbit-oak/pull/1393

> DetailedGC must keep entries in "_revisions" for non branch commits, unless 
> older than sweep
> 
>
> Key: OAK-10734
> URL: https://issues.apache.org/jira/browse/OAK-10734
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: documentmk
>Reporter: Stefan Egli
>Assignee: Stefan Egli
>Priority: Major
>  Labels: DetailedGC
>
> Entries in "_revisions" (for non root documents) could be referenced by 
> children in case of non branch commits. They must thus be kept. Unless older 
> than sweep.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (OAK-10734) DetailedGC must keep entries in "_revisions" for non branch commits, unless older than sweep

2024-04-02 Thread Stefan Egli (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stefan Egli updated OAK-10734:
--
Labels: DetailedGC  (was: )

> DetailedGC must keep entries in "_revisions" for non branch commits, unless 
> older than sweep
> 
>
> Key: OAK-10734
> URL: https://issues.apache.org/jira/browse/OAK-10734
> Project: Jackrabbit Oak
>  Issue Type: Task
>  Components: documentmk
>Reporter: Stefan Egli
>Assignee: Stefan Egli
>Priority: Major
>  Labels: DetailedGC
>
> Entries in "_revisions" (for non root documents) could be referenced by 
> children in case of non branch commits. They must thus be kept. Unless older 
> than sweep.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (OAK-10734) DetailedGC must keep entries in "_revisions" for non branch commits, unless older than sweep

2024-04-02 Thread Stefan Egli (Jira)
Stefan Egli created OAK-10734:
-

 Summary: DetailedGC must keep entries in "_revisions" for non 
branch commits, unless older than sweep
 Key: OAK-10734
 URL: https://issues.apache.org/jira/browse/OAK-10734
 Project: Jackrabbit Oak
  Issue Type: Task
  Components: documentmk
Reporter: Stefan Egli
Assignee: Stefan Egli


Entries in "_revisions" (for non root documents) could be referenced by 
children in case of non branch commits. They must thus be kept. Unless older 
than sweep.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (OAK-10546) Tika 1.28.5 references a vulnerable Guava version

2024-04-02 Thread Fabrizio Fortino (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fabrizio Fortino resolved OAK-10546.

Resolution: Won't Do

The linked PR won't be enough to fix the issue that will be addressed by 
https://issues.apache.org/jira/browse/OAK-9752.

> Tika 1.28.5 references a vulnerable Guava version
> -
>
> Key: OAK-10546
> URL: https://issues.apache.org/jira/browse/OAK-10546
> Project: Jackrabbit Oak
>  Issue Type: Improvement
>  Components: oak-examples, oak-run, oak-search-elastic, oak-solr-core
>Reporter: Fabrizio Fortino
>Assignee: Fabrizio Fortino
>Priority: Major
>
> Guava 31.1 has a critical vulnerability [0]. It is included as a transient 
> dependency of Tika 1.28.5 [1]. This is the latest 1.x available release of 
> Tika. Being EOL it won't receive any security-related updates [2].
> The work to upgrade to Tika 2.x would require some time.
> If possible, we should find an alternative solution to avoid including this 
> vulnerable dependency.
> [0] [https://www.opencve.io/cve/CVE-2023-2976] 
> [1] [https://mvnrepository.com/artifact/org.apache.tika/tika-parsers/1.28.5]
> [2] [https://lists.apache.org/thread/yq6n7o01kw544dvj1jsoqk29g6yqjkp3] 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Assigned] (OAK-10733) Filter out hidden properties from content in FlatFileStore

2024-04-02 Thread Nitin Gupta (Jira)


 [ 
https://issues.apache.org/jira/browse/OAK-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nitin Gupta reassigned OAK-10733:
-

Assignee: Nitin Gupta

> Filter out hidden properties from content in FlatFileStore
> --
>
> Key: OAK-10733
> URL: https://issues.apache.org/jira/browse/OAK-10733
> Project: Jackrabbit Oak
>  Issue Type: Task
>Reporter: Nitin Gupta
>Assignee: Nitin Gupta
>Priority: Major
>
> Currently we ignore/filter out hidden nodes while building the FFS but not 
> the hidden properties.
> We however ignore any changes to hidden properties (using the VisibleEditor) 
> during async indexing cycles, so it makes little sense to have these in the 
> FFS.
>  
> This task is to see if these can be removed, and if gives some benefit during 
> reindexing phase.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (OAK-10733) Filter out hidden properties from content in FlatFileStore

2024-04-02 Thread Nitin Gupta (Jira)
Nitin Gupta created OAK-10733:
-

 Summary: Filter out hidden properties from content in FlatFileStore
 Key: OAK-10733
 URL: https://issues.apache.org/jira/browse/OAK-10733
 Project: Jackrabbit Oak
  Issue Type: Task
Reporter: Nitin Gupta


Currently we ignore/filter out hidden nodes while building the FFS but not the 
hidden properties.

We however ignore any changes to hidden properties (using the VisibleEditor) 
during async indexing cycles, so it makes little sense to have these in the FFS.

 

This task is to see if these can be removed, and if gives some benefit during 
reindexing phase.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)