Re: [libreoffice-users] A word of warning about PDF text

2014-01-31 Thread Cley Faye
2014-01-31 Peter West li...@pbw.id.au: A word of warning about text retrieved from PDF documents. Recovering text blocks from PDFs is inherently risky. PDF is a page definition format, and so it has no notion of the semantics of the text it contains. It places bits of text at certain

Re: [libreoffice-users] A word of warning about PDF text

2014-01-31 Thread Dominique Michel
Le Fri, 31 Jan 2014 13:22:41 +1000, Peter West li...@pbw.id.au a écrit : A word of warning about text retrieved from PDF documents. Recovering text blocks from PDFs is inherently risky. PDF is a page definition format, and so it has no notion of the semantics of the text it contains. It

Re: [libreoffice-users] A word of warning about PDF text

2014-01-31 Thread Dominique Michel
Le Sat, 1 Feb 2014 01:18:22 +0100, Dominique Michel dominique.mic...@vtxnet.ch a écrit : Le Fri, 31 Jan 2014 13:22:41 +1000, Peter West li...@pbw.id.au a écrit : A word of warning about text retrieved from PDF documents. Recovering text blocks from PDFs is inherently risky. PDF is a

[libreoffice-users] A word of warning about PDF text

2014-01-30 Thread Peter West
A word of warning about text retrieved from PDF documents. Recovering text blocks from PDFs is inherently risky. PDF is a page definition format, and so it has no notion of the semantics of the text it contains. It places bits of text at certain positions on the page. You can create a whole