[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2019-06-13 Thread Tilman Hausherr (JIRA)


[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16863291#comment-16863291
 ] 

Tilman Hausherr commented on PDFBOX-3611:
-

Parse on demand is now available in PDFBOX-4569 thanks to Andreas. 


> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>Priority: Major
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2016-12-02 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15715663#comment-15715663
 ] 

Tilman Hausherr commented on PDFBOX-3611:
-

Oops, I haven't read that one for a long time. Lets just say it won't be fixed 
any time soon. Unless somebody comes up from nowhere and wants to do this.

> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2016-12-02 Thread simon steiner (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15715114#comment-15715114
 ] 

simon steiner commented on PDFBOX-3611:
---

Why is it listed on this page https://pdfbox.apache.org/ideas.html if this wont 
be fixed

> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2016-12-01 Thread Yan Bing (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15711276#comment-15711276
 ] 

Yan Bing commented on PDFBOX-3611:
--

I got it..

Our system manages huge PDF files with high resolutions.
File over 2GB is normal.
I can view it with Acrobat DC or Acrobat Pro.
 

> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2016-11-30 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15711227#comment-15711227
 ] 

Tilman Hausherr commented on PDFBOX-3611:
-

There is no work around this problem, and there are no plans for the future, 
sorry. There are many other "big" topics (251 open issues), e.g. fix 
transparency rendering, fix JPG2000, support complex scripts, improve acroform 
handling, support PDF 2.0, etc.

Even with parse on demand, I wonder if it would be possible to view such a huge 
file. Try PDF.js, they do parse on demand.
https://mozilla.github.io/pdf.js/web/viewer.html
Press the icon on the right that looks like a folder to open your file.

> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2016-11-30 Thread Yan Bing (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15711205#comment-15711205
 ] 

Yan Bing commented on PDFBOX-3611:
--

Thank you, Tilman.
Is there any work around for this problem?
Is there any plan for PDFBox to add function such as parse file partially, 
parse on demand in the future? 

> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3611) Processing file over 2GB failure

2016-11-30 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15711165#comment-15711165
 ] 

Tilman Hausherr commented on PDFBOX-3611:
-

There is no specification on some maximum. But internally the memory usage is 
always much larger than the PDF itself. PDFBox does not "parse on demand", it 
parses the whole PDF.

> Processing file over 2GB failure
> 
>
> Key: PDFBOX-3611
> URL: https://issues.apache.org/jira/browse/PDFBOX-3611
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.3
> Environment: OS:  Windows 10 x64.
> Memory:   16GB
>Reporter: Yan Bing
>
> I try to retrieve PDF's any property by PDFBox2.0.3 and 
> PDFBox2.0.4(snapshot),  but when the PDF file size is over 2GB, it throws 
> out-of-memory exception. Is it PDFBox's specification?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org