Greetings, That's the way to go.
The N lines equals a record is a good way to handle "semi" or even non structured data - e.g. a document containing "notes". I've processed files where I set it to be 1 line per record and had only one field starting at the beginning of the row for example. Regards, Thom Thom C. Blackwell Product Manager Boston Software Systems (866) 653-5105 ex 807 www.bossoft.com <http://www.bossoft.com/> Sign up for my weekly webinar! <http://www.bostonworkstation.com/customer_center/special_events.aspx> LEGAL NOTICE Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this E-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents of this E-mail or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately, then delete this message and empty from your trash. From: [email protected] [mailto:[email protected]] Sent: Friday, August 13, 2010 7:47 PM To: Talk Subject: RE: [talkbws] Parsing PDF file with DataStation Thom, Thanks for the speedy reply! I tried the Page Length options: Each page ends with a Form Feed, and Beginning of Page Begins Like. They produce similar results, 1 record and then EOF. The PDF report looks a bit like this - PID VISITNO VISITLOC ADMITTED DISCHARGED 123456 1012345678 4W1001 07/26/2010 08/12/2010 123457 1012345679 4W1002 07/26/2010 08/12/2010 . . . I'm trying to process the PDF in the same manner as a fixed width text file - one record per line. PDF/Page only returns one record per page. What seems to be working for this particular PDF file is setting the Lines per Page to 1. Thanks again, David Garcia Sr. Programmer/Analyst Washington Hospital Healthcare System 2000 Mowry Avenue Fremont, CA 94538 (510) 745-6477 From: [email protected] [mailto:[email protected]] Sent: Friday, August 13, 2010 12:33 PM To: [email protected] Subject: RE: [talkbws] Parsing PDF file with DataStation Greetings, There are a series of radio buttons on the parsing config screen that related to page length and it sounds like you need to set one of these. Now I don't know what is the best option for your file, but here's the rundown to help you choose: Each page ends with a Form Feed - this indicates there's a "pagebreak" of sorts in the data - may be worth trying first If that doesn't work Beginning of page/End of page like buttons/dialog is used if there is a consistent something at the bottom / top of the page - e.g. End of Page or Page: Number of lines per page is the last resort - here you can specify there are n lines per page. Regards, Thom Thom C. Blackwell Product Manager Boston Software Systems (866) 653-5105 ex 807 www.bossoft.com <http://www.bossoft.com/> Sign up for my weekly webinar! <http://www.bostonworkstation.com/customer_center/special_events.aspx> LEGAL NOTICE Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this E-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents of this E-mail or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately, then delete this message and empty from your trash. From: [email protected] [mailto:[email protected]] Sent: Friday, August 13, 2010 3:28 PM To: Talk Subject: [talkbws] Parsing PDF file with DataStation Greetings, New BWS user here... I have a fairly simple PDF report file I'm trying to parse out using DataStation. The one page report has 7 fixed columns with titles and one detail record per line. DataStation is set up to use PDF/page format. I can view the report and select fields in the first detail line. The problem is that DataStation only returns the first record on the page and then goes to EOF. I can't figure out how to specify in the page dialog that there can be multiple records on a page. Any suggestions would be greatly appreciated Thanks! David Garcia Sr. Programmer/Analyst Washington Hospital Healthcare System 2000 Mowry Avenue Fremont, CA 94538 (510) 745-6477 --- To post a message to this list, send mail to: [email protected] You are currently subscribed as: [email protected] Unsubscribe in the customer center on our website: http://www.bostonworkstation.com/customer_center/virtual_user_group_talk .aspx --- To post a message to this list, send mail to: [email protected] You are currently subscribed as: [email protected] Unsubscribe in the customer center on our website: http://www.bostonworkstation.com/customer_center/virtual_user_group_talk .aspx --- To post a message to this list, send mail to: [email protected] You are currently subscribed as: [email protected] Unsubscribe in the customer center on our website: http://www.bostonworkstation.com/customer_center/virtual_user_group_talk .aspx --- To post a message to this list, send mail to: [email protected] You are currently subscribed as: [email protected] Unsubscribe in the customer center on our website: http://www.bostonworkstation.com/customer_center/virtual_user_group_talk.aspx
