[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-20 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

V Stuart Foote  changed:

   What|Removed |Added

 Blocks||113123


Referenced Bugs:

https://bugs.documentfoundation.org/show_bug.cgi?id=113123
[Bug 113123] [META] PDF import filter in Writer
-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-20 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

V Stuart Foote  changed:

   What|Removed |Added

 Blocks|113123  |108254
 CC||vsfo...@libreoffice.org
  Component|Writer  |filters and storage

--- Comment #4 from V Stuart Foote  ---
Confirmed, do we have faulty "text:anchor-page-number" during pdfio import?

Be sure to use the correct PDF import filter, but I do reproduce during
"save-as" export to OOXML and opening that format with Word 2021 or Writer
7.6.0

PDF filter import (pdfio) into Writer should be done with:

"PDF - Portable Document Format (Writer) (*.pdf)"

The sample PDF is parsed into a four page Writer document and each text run of
the PDF ends up on its correct page on the writer canvas.

But writing out to ODF seems incorrect in addition to the issues noted for
doc/docx MS Binary and OOXML format.

Opening the ODF archive and examining content.xml for the text-box spans, each
of the T2 spans holding text are being written as to "page" anchors, but then
the associated "text:anchor-page-number" is set as "1".

Not too sure, but assume that would be OK for a relative page ref, but suspect
that that page number is then getting parsed when opened as OOXML or MS Binary,
or when those formats are opened back into LibreOffice.

Seems like the import filter parsing of the PDF text runs is correct, but then
we're doing incorrect thing for referencing the text span anchors. Is the issue
with the filter import of the PDF elements, or with the filter export from ODF
to MS Binary or OOXML? Or both?


Referenced Bugs:

https://bugs.documentfoundation.org/show_bug.cgi?id=108254
[Bug 108254] [META] File format filters (import/export) bugs and enhancements
https://bugs.documentfoundation.org/show_bug.cgi?id=113123
[Bug 113123] [META] PDF import filter in Writer
-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-19 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

BogdanB  changed:

   What|Removed |Added

 Blocks||113123
 CC||buzea.bog...@libreoffice.or
   ||g


Referenced Bugs:

https://bugs.documentfoundation.org/show_bug.cgi?id=113123
[Bug 113123] [META] PDF import filter in Writer
-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-17 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

--- Comment #3 from Stéphane Guillou (stragu) 
 ---
Created attachment 188418
  --> https://bugs.documentfoundation.org/attachment.cgi?id=188418=edit
sample ODT

The bug can be tested directly form this ODT, created in LO 7.5.4 after
importing the sample PDF with Writer.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-17 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

Stéphane Guillou (stragu)  changed:

   What|Removed |Added

 Ever confirmed|0   |1
 Status|UNCONFIRMED |NEW
   Keywords||filter:doc, filter:docx
Version|7.4.4.2 release |6.0.0.3 release
 CC||stephane.guillou@libreoffic
   ||e.org

--- Comment #2 from Stéphane Guillou (stragu) 
 ---
Reproduced when saved as DOCX and DOC with:

Version: 7.4.7.2 / LibreOffice Community
Build ID: 723314e595e8007d3cf785c16538505a1c878ca5
CPU threads: 8; OS: Linux 5.15; UI render: default; VCL: gtk3
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: threaded

And in recent master build:

Version: 24.2.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: 77fca616e0bd79e0b405fd0b3543cf8e94e15df3
CPU threads: 8; OS: Linux 5.15; UI render: default; VCL: gtk3
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: threaded

Already the case in 6.0.0.3.

In 5.4, text boxes would disappear and LO would hang.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-15 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

Martin Minchev  changed:

   What|Removed |Added

 Attachment #188383|PDF test file with multiple |multiple-pages.pdf - PDF
description|pages   |test file with multiple
   ||pages

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Libreoffice-bugs] [Bug 156303] Saving a PDF Import to doc/docx/rtf moves all text boxes to the first page

2023-07-15 Thread bugzilla-daemon
https://bugs.documentfoundation.org/show_bug.cgi?id=156303

Martin Minchev  changed:

   What|Removed |Added

 CC||martiminc...@gmail.com

--- Comment #1 from Martin Minchev  ---
Created attachment 188383
  --> https://bugs.documentfoundation.org/attachment.cgi?id=188383=edit
PDF test file with multiple pages

-- 
You are receiving this mail because:
You are the assignee for the bug.