https://bugs.documentfoundation.org/show_bug.cgi?id=169628

            Bug ID: 169628
           Summary: PDF → DOCX: File not opening in MS Word 2007/2010 +
                    text boxes issue in Word 2013 and above
           Product: LibreOffice
           Version: 26.2.0.0 alpha0+ master
          Hardware: All
                OS: Linux (All)
            Status: UNCONFIRMED
          Severity: minor
          Priority: medium
         Component: LibreOffice
          Assignee: [email protected]
          Reporter: [email protected]

Description:
Hi team,

I run a PDF conversion website (pdflinx.com) using LibreOffice headless on
Ubuntu VPS.

PDF → DOCX has two major issues:

1. Files do NOT open in MS Word 2007 and 2010
   → Error: "The file is corrupt" or "Word experienced an error trying to open
the file"

2. Files open in MS Word 2013+ but all text appears inside "text boxes"
   → When selecting text, it selects as a box (drawing object) instead of
normal flowing text
   → Makes editing very difficult

Command I am using:
xvfb-run -a libreoffice --headless --convert-to docx --outdir /output
/input/file.pdf

LibreOffice version: 25.2 / 24.8
OS: Ubuntu 22.04

Already tried:
--convert-to docx:"MS Word 2007 XML"
--convert-to doc
Different filters → no clean result

Request:
Please suggest a command/filter so that:
• File opens in Word 2007/2010
• No text boxes in Word 2019+, text should be normal flowing text

If DOCX is not possible, is clean .doc (Word 97-2003) export possible?

Attaching sample PDF + converted DOCX + screenshot of text boxes.

Thank you so much for your help!

Steps to Reproduce:
1. Upload any PDF to pdflinx.com → Convert to Word
2. Download the DOCX file
3. Try to open in MS Word 2007/2010 → fails
4. Open in Word 2013 → opens but text is inside text boxes

Actual Results:
- Word 2007/2010: File does not open (corrupt error)
- Word 2013+: Text appears in text boxes (not normal editable text)

Expected Results:
- File should open in Word 2007, 2010, 2013+
- Text should be normal flowing text (no text boxes)


Reproducible: Always


User Profile Reset: Yes

Additional Info:
LibreOffice version (exact):
$ libreoffice --version
LibreOffice 25.2.0

Running on Ubuntu 22.04 LTS (Hostinger VPS)  
Node.js + child_process.spawn with xvfb-run for headless mode  
Website: https://pdflinx.com (commercial use)  
I have tested with multiple PDFs – same issue every time  
Happy to provide SSH access or more logs if needed

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to