[jira] [Comment Edited] (PDFBOX-5134) Very slow rendering on PageDrawer.shadingFill

2021-03-17 Thread Oliver Schmidtmer (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303733#comment-17303733
 ] 

Oliver Schmidtmer edited comment on PDFBOX-5134 at 3/17/21, 9:28 PM:
-

That is correct, the slow painting is Java itself. However, the clipping area 
is null and the paint applied to the whole image while those shades are 
PDTriangleBasedShadingTypes which only cover a very small area.

For example with the device bounds [x=0,y=0,width=4031,height=2024] a bounding 
rectangle, calculated in GouraudShadingContext.setTriangleList, around the 
triangles for shading COSName\{Sh0} is only 
[x=2916.300048828125,y=897.4940185546875,w=16.97998046875,h=17.0400390625]
 Applying a Paint to such a small area should be much faster. I probably need 
more time to look into how to calculate that beforehand and setting the 
clipping area.

I'm sorry I hadn't already analyzed it that deeply a few hours ago. I only 
suspected there should be some way to set a better clipping area for a such 
small painted area. :)

 


was (Author: schmidor):
That is correct, the slow painting is Java itself. However, the clipping area 
is null and the paint applied to the whole image while those shades are 
PDTriangleBasedShadingTypes which only cover a very small area.

For example with the device bounds [x=0,y=0,width=4031,height=2024] a bounding 
rectangle, calculated in GouraudShadingContext.setTriangleList, around the 
triangles for shading COSName\{Sh0} is only 
[x=2916.300048828125,y=897.4940185546875,w=16.97998046875,h=17.0400390625]
 Applying a Paint to such a small area should be much faster. I probably need 
more time to look into how to calculate that beforehand and setting the 
clipping area.

 

> Very slow rendering on PageDrawer.shadingFill
> -
>
> Key: PDFBOX-5134
> URL: https://issues.apache.org/jira/browse/PDFBOX-5134
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Oliver Schmidtmer
>Priority: Major
>  Labels: shading
> Attachments: Plan zum Testen.pdf, image-2021-03-17-20-16-49-221.png
>
>
> The PDF contains many effectively very small shadings, the light cones. As 
> the gradient painting for each light cone is applied on the whole image, 
> rendering is very slow: 29 of 33 seconds are spent in shadingFill.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5134) Very slow rendering on PageDrawer.shadingFill

2021-03-17 Thread Oliver Schmidtmer (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303733#comment-17303733
 ] 

Oliver Schmidtmer commented on PDFBOX-5134:
---

That is correct, the slow painting is Java itself. However, the clipping area 
is null and the paint applied to the whole image while those shades are 
PDTriangleBasedShadingTypes which only cover a very small area.

For example with the device bounds [x=0,y=0,width=4031,height=2024] a bounding 
rectangle, calculated in GouraudShadingContext.setTriangleList, around the 
triangles for shading COSName\{Sh0} is only 
[x=2916.300048828125,y=897.4940185546875,w=16.97998046875,h=17.0400390625]
 Applying a Paint to such a small area should be much faster. I probably need 
more time to look into how to calculate that beforehand and setting the 
clipping area.

 

> Very slow rendering on PageDrawer.shadingFill
> -
>
> Key: PDFBOX-5134
> URL: https://issues.apache.org/jira/browse/PDFBOX-5134
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Oliver Schmidtmer
>Priority: Major
>  Labels: shading
> Attachments: Plan zum Testen.pdf, image-2021-03-17-20-16-49-221.png
>
>
> The PDF contains many effectively very small shadings, the light cones. As 
> the gradient painting for each light cone is applied on the whole image, 
> rendering is very slow: 29 of 33 seconds are spent in shadingFill.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5134) Very slow rendering on PageDrawer.shadingFill

2021-03-17 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303676#comment-17303676
 ] 

Tilman Hausherr commented on PDFBOX-5134:
-

I started the profiler and yes, the shadingFill takes the time, but then it's 
java itself?!

> Very slow rendering on PageDrawer.shadingFill
> -
>
> Key: PDFBOX-5134
> URL: https://issues.apache.org/jira/browse/PDFBOX-5134
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Oliver Schmidtmer
>Priority: Major
>  Labels: shading
> Attachments: Plan zum Testen.pdf, image-2021-03-17-20-16-49-221.png
>
>
> The PDF contains many effectively very small shadings, the light cones. As 
> the gradient painting for each light cone is applied on the whole image, 
> rendering is very slow: 29 of 33 seconds are spent in shadingFill.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5134) Very slow rendering on PageDrawer.shadingFill

2021-03-17 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5134:

Attachment: image-2021-03-17-20-16-49-221.png

> Very slow rendering on PageDrawer.shadingFill
> -
>
> Key: PDFBOX-5134
> URL: https://issues.apache.org/jira/browse/PDFBOX-5134
> Project: PDFBox
>  Issue Type: Bug
>Reporter: Oliver Schmidtmer
>Priority: Major
>  Labels: shading
> Attachments: Plan zum Testen.pdf, image-2021-03-17-20-16-49-221.png
>
>
> The PDF contains many effectively very small shadings, the light cones. As 
> the gradient painting for each light cone is applied on the whole image, 
> rendering is very slow: 29 of 33 seconds are spent in shadingFill.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2021-03-17 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303563#comment-17303563
 ] 

ASF subversion and git services commented on PDFBOX-4892:
-

Commit 1887756 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1887756 ]

PDFBOX-4892: use convenience methods to simplify code, add new constants, 
optimize COSDictionary methods

> Improve code quality (4)
> 
>
> Key: PDFBOX-4892
> URL: https://issues.apache.org/jira/browse/PDFBOX-4892
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 2.0.20
>Reporter: Tilman Hausherr
>Priority: Minor
>
> This is a longterm issue for the task to improve code quality, by using the 
> [SonarQube report|https://sonarcloud.io/project/issues?id=pdfbox-reactor], 
> hints in different IDEs, the FindBugs tool and other code quality tools.
> This is a follow-up of PDFBOX-4071, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5134) Very slow rendering on PageDrawer.shadingFill

2021-03-17 Thread Oliver Schmidtmer (Jira)
Oliver Schmidtmer created PDFBOX-5134:
-

 Summary: Very slow rendering on PageDrawer.shadingFill
 Key: PDFBOX-5134
 URL: https://issues.apache.org/jira/browse/PDFBOX-5134
 Project: PDFBox
  Issue Type: Bug
Reporter: Oliver Schmidtmer
 Attachments: Plan zum Testen.pdf

The PDF contains many effectively very small shadings, the light cones. As the 
gradient painting for each light cone is applied on the whole image, rendering 
is very slow: 29 of 33 seconds are spent in shadingFill.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Maruan Sahyoun (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303529#comment-17303529
 ] 

Maruan Sahyoun commented on PDFBOX-5128:


Thank you for providing the files. Will try to add handling some non standard 
files first and then run using the test bed. Not likely before early next week.

> Support parsing non standardized XMP 
> -
>
> Key: PDFBOX-5128
> URL: https://issues.apache.org/jira/browse/PDFBOX-5128
> Project: PDFBox
>  Issue Type: Task
>  Components: XmpBox
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Attachments: PDFBOX.zip, image-2021-03-17-09-00-57-653.png
>
>
> XMP currently only supports parsing known XMP schema as has been discussed. 
> That shall be extended to support arbitrary but valid  XMP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303522#comment-17303522
 ] 

Tim Allison commented on PDFBOX-5128:
-

The process hasn't finished, but I'm dumping the files here:

[https://corpora.tika.apache.org/base/xmps/]

I'm roughly binning them by the file type of the container file, including: 
[https://corpora.tika.apache.org/base/xmps/pdf/] 

 

Let me know if I can do any processing on these or if I botched the extraction.

 

> Support parsing non standardized XMP 
> -
>
> Key: PDFBOX-5128
> URL: https://issues.apache.org/jira/browse/PDFBOX-5128
> Project: PDFBox
>  Issue Type: Task
>  Components: XmpBox
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Attachments: PDFBOX.zip, image-2021-03-17-09-00-57-653.png
>
>
> XMP currently only supports parsing known XMP schema as has been discussed. 
> That shall be extended to support arbitrary but valid  XMP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303391#comment-17303391
 ] 

Tim Allison edited comment on PDFBOX-5128 at 3/17/21, 1:01 PM:
---

Side note...I'm looking at the EOFs for my xmp byte scanner, and I notice that 
Oracle Outside In (at least back in 2011) didn't include a closing packet – 
PDFBOX-1192

!image-2021-03-17-09-00-57-653.png! --


was (Author: talli...@mitre.org):
Side note...I'm looking at the EOFs for my xmp byte scanner, and I notice that 
Oracle Outsid !image-2021-03-17-09-00-57-653.png! e In (at least back in 2011) 
didn't include a closing packet – PDFBOX-1192

> Support parsing non standardized XMP 
> -
>
> Key: PDFBOX-5128
> URL: https://issues.apache.org/jira/browse/PDFBOX-5128
> Project: PDFBox
>  Issue Type: Task
>  Components: XmpBox
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Attachments: PDFBOX.zip, image-2021-03-17-09-00-57-653.png
>
>
> XMP currently only supports parsing known XMP schema as has been discussed. 
> That shall be extended to support arbitrary but valid  XMP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated PDFBOX-5128:

Attachment: image-2021-03-17-09-00-57-653.png

> Support parsing non standardized XMP 
> -
>
> Key: PDFBOX-5128
> URL: https://issues.apache.org/jira/browse/PDFBOX-5128
> Project: PDFBox
>  Issue Type: Task
>  Components: XmpBox
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Attachments: PDFBOX.zip, image-2021-03-17-09-00-57-653.png
>
>
> XMP currently only supports parsing known XMP schema as has been discussed. 
> That shall be extended to support arbitrary but valid  XMP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303391#comment-17303391
 ] 

Tim Allison commented on PDFBOX-5128:
-

Side note...I'm looking at the EOFs for my xmp byte scanner, and I notice that 
Oracle Outsid !image-2021-03-17-09-00-57-653.png! e In (at least back in 2011) 
didn't include a closing packet – PDFBOX-1192

> Support parsing non standardized XMP 
> -
>
> Key: PDFBOX-5128
> URL: https://issues.apache.org/jira/browse/PDFBOX-5128
> Project: PDFBox
>  Issue Type: Task
>  Components: XmpBox
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Attachments: PDFBOX.zip, image-2021-03-17-09-00-57-653.png
>
>
> XMP currently only supports parsing known XMP schema as has been discussed. 
> That shall be extended to support arbitrary but valid  XMP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Maruan Sahyoun (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303266#comment-17303266
 ] 

Maruan Sahyoun commented on PDFBOX-5128:


[~tallison] yes, that's fine
[~pwyatt] thank's for the information. I'll look into that as soon as I have 
the base stuff working

> Support parsing non standardized XMP 
> -
>
> Key: PDFBOX-5128
> URL: https://issues.apache.org/jira/browse/PDFBOX-5128
> Project: PDFBox
>  Issue Type: Task
>  Components: XmpBox
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Major
> Attachments: PDFBOX.zip
>
>
> XMP currently only supports parsing known XMP schema as has been discussed. 
> That shall be extended to support arbitrary but valid  XMP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[GitHub] [pdfbox] wdick opened a new pull request #108: PDFBOX-xxxx: Add COSName for DocMDP

2021-03-17 Thread GitBox


wdick opened a new pull request #108:
URL: https://github.com/apache/pdfbox/pull/108


   Add COSNames needed to read DocMDP settings
   see PDF 32000-1:2008, Table 253 – Entries in a signature reference dictionary



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2021-03-17 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303149#comment-17303149
 ] 

ASF subversion and git services commented on PDFBOX-4892:
-

Commit 1887747 from le...@apache.org in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1887747 ]

PDFBOX-4892: use convenience methods to simplify code

> Improve code quality (4)
> 
>
> Key: PDFBOX-4892
> URL: https://issues.apache.org/jira/browse/PDFBOX-4892
> Project: PDFBox
>  Issue Type: Improvement
>Affects Versions: 2.0.20
>Reporter: Tilman Hausherr
>Priority: Minor
>
> This is a longterm issue for the task to improve code quality, by using the 
> [SonarQube report|https://sonarcloud.io/project/issues?id=pdfbox-reactor], 
> hints in different IDEs, the FindBugs tool and other code quality tools.
> This is a follow-up of PDFBOX-4071, which was getting too long.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org