Hello,
Someone send me a PDF containing some tables with product prices
updates from time to time. I found xpdf and pdftotext shell tool to
export text from pdf file. I want to extract interresting lines from
this text file in order to import relevant data into a database. I
find how to use grep to achieve that.
Most of lines looks like this one :
26390 PRODUCT
0,28 5,60 0,30 5,90
But sometimes the label was too long in the PDF table cell and
pdftotext exported this :
START OF A VERY VERY VERY LONG LABEL WITH
14604
0,30 14,90 0,30 15,00
SOME OTHER INFORMATION AT THE END
Can sed help me to rebuild the full product label in a single line ?
Can you help me building out that program ?
Many thanks in advance, best regards,
--
Pierre Y.
[EMAIL PROTECTED] (please get rid of the underscores)