https://bugs.documentfoundation.org/show_bug.cgi?id=169628
Bug ID: 169628
Summary: PDF → DOCX: File not opening in MS Word 2007/2010 +
text boxes issue in Word 2013 and above
Product: LibreOffice
Version: 26.2.0.0 alpha0+ master
Hardware: All
OS: Linux (All)
Status: UNCONFIRMED
Severity: minor
Priority: medium
Component: LibreOffice
Assignee: [email protected]
Reporter: [email protected]
Description:
Hi team,
I run a PDF conversion website (pdflinx.com) using LibreOffice headless on
Ubuntu VPS.
PDF → DOCX has two major issues:
1. Files do NOT open in MS Word 2007 and 2010
→ Error: "The file is corrupt" or "Word experienced an error trying to open
the file"
2. Files open in MS Word 2013+ but all text appears inside "text boxes"
→ When selecting text, it selects as a box (drawing object) instead of
normal flowing text
→ Makes editing very difficult
Command I am using:
xvfb-run -a libreoffice --headless --convert-to docx --outdir /output
/input/file.pdf
LibreOffice version: 25.2 / 24.8
OS: Ubuntu 22.04
Already tried:
--convert-to docx:"MS Word 2007 XML"
--convert-to doc
Different filters → no clean result
Request:
Please suggest a command/filter so that:
• File opens in Word 2007/2010
• No text boxes in Word 2019+, text should be normal flowing text
If DOCX is not possible, is clean .doc (Word 97-2003) export possible?
Attaching sample PDF + converted DOCX + screenshot of text boxes.
Thank you so much for your help!
Steps to Reproduce:
1. Upload any PDF to pdflinx.com → Convert to Word
2. Download the DOCX file
3. Try to open in MS Word 2007/2010 → fails
4. Open in Word 2013 → opens but text is inside text boxes
Actual Results:
- Word 2007/2010: File does not open (corrupt error)
- Word 2013+: Text appears in text boxes (not normal editable text)
Expected Results:
- File should open in Word 2007, 2010, 2013+
- Text should be normal flowing text (no text boxes)
Reproducible: Always
User Profile Reset: Yes
Additional Info:
LibreOffice version (exact):
$ libreoffice --version
LibreOffice 25.2.0
Running on Ubuntu 22.04 LTS (Hostinger VPS)
Node.js + child_process.spawn with xvfb-run for headless mode
Website: https://pdflinx.com (commercial use)
I have tested with multiple PDFs – same issue every time
Happy to provide SSH access or more logs if needed
--
You are receiving this mail because:
You are the assignee for the bug.