https://bugs.documentfoundation.org/show_bug.cgi?id=32249
--- Comment #46 from Dave Gilbert <[email protected]> --- (In reply to Lightsky from comment #45) > Hi, > Not sure if this is the right place as its unclear if the bug is specific to > Draw or Writer import. > I think for PDF import in Writer one does NOT usually need to preserve the > 100% exact original layout. > When importing PDF in Writer one usually needs a text without boxes at all > b/c Writer is an editor, right? > Formatting/layout is way easier to fix than copy-pasting every single line > of text. > BTW, Adobe Acrobat Online Convert PDF to Word produced DOCX without any > boxes and it also preserved layout. > https://www.adobe.com/acrobat/online/pdf-to-word.html > > Why can't PDF import in Writer simply get rid of all text boxes? > > The requirements for PDF import in Draw may be very different though. That distinction between draw and writer is interesting - I don't think that's something we currently look at. Having said that, at the moment we just don't have any code to figure out how to correctly glue the text rows back into paragraphs; it's a tricky problem, to not glue together unrelated text. There are a few open projects that do manage it better than we do, so it feels like we can improve on it somehow. (For example, Okular's text selection generally works reasonably well; if it can manage that in order then that's an indication we should be able to thread it together). -- You are receiving this mail because: You are the assignee for the bug.
