[jira] [Created] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
Jonathan Prates created PDFBOX-5823: --- Summary: StringUtil.PATTERN_SPACE memory optmisation Key: PDFBOX-5823 URL: https://issues.apache.org/jira/browse/PDFBOX-5823 Project: PDFBox Issue Type

[jira] [Updated] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Prates updated PDFBOX-5823: Description: PDAbstractContentStream uses StringUtil.PATTERN_SPACE regexp to evaluate if a

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2024-05-20 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847839#comment-17847839 ] ASF subversion and git services commented on PDFBOX-5660: - Commi

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2024-05-20 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847841#comment-17847841 ] ASF subversion and git services commented on PDFBOX-5660: - Commi

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2024-05-20 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847842#comment-17847842 ] ASF subversion and git services commented on PDFBOX-5660: - Commi

[jira] [Commented] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847844#comment-17847844 ] Tilman Hausherr commented on PDFBOX-5823: - Isn't your solution slower? It would

[jira] [Commented] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847855#comment-17847855 ] Jonathan Prates commented on PDFBOX-5823: - Sure, I mean, contains() is slower fo

[jira] [Updated] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Prates updated PDFBOX-5823: Attachment: Main.java > StringUtil.PATTERN_SPACE memory optmisation >

[jira] [Comment Edited] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847855#comment-17847855 ] Jonathan Prates edited comment on PDFBOX-5823 at 5/20/24 11:26 AM: ---

[jira] [Comment Edited] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847855#comment-17847855 ] Jonathan Prates edited comment on PDFBOX-5823 at 5/20/24 12:44 PM: ---

[jira] [Comment Edited] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847855#comment-17847855 ] Jonathan Prates edited comment on PDFBOX-5823 at 5/20/24 12:44 PM: ---

[jira] [Commented] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847901#comment-17847901 ] Andreas Lehmkühler commented on PDFBOX-5823: Those tokens either doesn't con

[jira] [Commented] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847912#comment-17847912 ] Jonathan Prates commented on PDFBOX-5823: - [~lehmi] I tested it locally and inde

[jira] [Comment Edited] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847912#comment-17847912 ] Jonathan Prates edited comment on PDFBOX-5823 at 5/20/24 4:08 PM:

[jira] [Comment Edited] (PDFBOX-5823) StringUtil.PATTERN_SPACE memory optmisation

2024-05-20 Thread Jonathan Prates (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17847912#comment-17847912 ] Jonathan Prates edited comment on PDFBOX-5823 at 5/20/24 4:50 PM:

Re: [PR] PDFBOX-5823: remove regexp matcher to reduce memory utilisation [pdfbox]

2024-05-20 Thread via GitHub
DvonHolten commented on PR #195: URL: https://github.com/apache/pdfbox/pull/195#issuecomment-2120894730 but what is the purpose of this? if you just want/need to know, if some string contains a space ' ', the fastest and cheapest way is to use indexOf( ' ' ) >= 0. if you need to check fo

Re: [PR] PDFBOX-5823: remove regexp matcher to reduce memory utilisation [pdfbox]

2024-05-20 Thread via GitHub
jonathansp commented on PR #195: URL: https://github.com/apache/pdfbox/pull/195#issuecomment-2121248909 @DvonHolten patterns work fine but also cause a **massive** memory overhead while creating large documents. In order to cover whitespaces, we'd need to call `indexOf()` a few times, provi