At the request of Twayne, I'm sending this to the list. I took it offline originally since Twayne's problem was solved, but as he rightly says, others might benefit from our comments.
On Jan 20, 2010, at 19:14 , Twayne wrote: > Behold the writings of James Wilde from <[email protected]> who > wrote: > >> Hi Twayne: >> >> I'm taking this offline since I see you have found a solution. >> >> I gather this problem arose when you scanned in a document consisting >> of several pages, and the program saved them in reverse order. I >> have a big scanning job in front of me, scanning my first book for >> the Gutenberg.org project, and don't want to find myself in the same >> boat. But just in case I do... :( > > WOW, good for you! I wish you all good luck in your endeavors; that's great. > > LOL! I'll try to explain, but 1. My healthcare worker is due about anytime > and 2. I have mental memory retrieval problems. > > If you can see the thumbs of the pages you're scanning, then it'll be obvious > if they're in the wrong order I think. I had the guy who did the actual > scanning send me his .opd file (OmniPage Pro 14) and he clearly should have > known it was being reversed. > Each new page scanned went to the top of the thumbnail display and became > page 1, all the other pages incrementing by one. He had set the application > to put the newly scanned pages ABOVE the current page, where it should have > been set to add them BELOW the current page! Current page always being the > last page scanned unless the user moved the focus. By adding AFTER the last > page, then the page numbers assigned in the thumbs always stayed correct, and > in order. > He said he changed where new pages went to so he wouldn't have to keep > scrolling down to see what the bottom thumbnail was - something that's seldom > necessary to do, inadvisable and just plain wrong IMO. > Hope that makes sense<g>. > >> >> When the scanning program saved the pages, did it save them as whole >> pages with a hard page return between each? > > Yes, it did. At least in the OCR Text Window. Rather than page feeds, it used > Word's Section breaks for each page with the bottom Section being to start > next section on a new page. Same thing - same results; just done differently. > Occasionally it turned out slightly longer than a full page in Word, so Word > used a page and a fraction, but it still ended the section with start new > section on new page. Same thing when I opened it in OO.o too, the final > target. > > If not, I'm a bit >> baffled how Word managed not to take the part page which was your >> last page and add a bit from the next to the last page, and for the >> next to the last page take a bit of the page before - if you see what >> I mean. Word - and Writer - have a tendency to fill a page with >> following text if there isn't a hard page break. > > True. How it's handled is really up to the OCR converter itself. It -could- > have put everything all together in one long flowing document but one of the > settable options I have is to retain Font and Paragraph settings, which makes > it use Words Sections (OO.o's Styles). > >> >> And finally, just what method did you use? I saw a reference to >> p=p-1 in the message which seemed to give you the solution, but p was >> clearly not the page number but the name of the field. I couldn't >> see how this would work. Can you briefly describe what you did? > > The p=p-1 was part of the description for the macro I used, not a literal. I > actually did things a bit differently than the post suggested, but the > general information worked with the exception of any pages with page-size > graphics on them. Fortunately there were only a few of those that failed to > move. My fault - I'm not good with code! > > You shouldn't need to do any macroes: Rather than try to explain the macro, > and since you're just getting started: If I were you I'd scan/OCR say 3 pages > or so, save them as a document or whatever your app allows, and just look to > be sure they're organized in the right order. > I can 't imagine an app being defaulted to provide reversed pages; as I > said, the user made an intentional decision for improper reasons and I got > the sludge, I mean, results, of his efforts. > > What do you use to do your scanning and OCR (optical character recognition) > work? Just curious; I may know the program. I have an old Canoscan N1240U which I have managed to get working on a new installation of XP SP3 (and coincidentally on the dual-booting Ubuntu 9-10). The ocr software is a copy of Omnipage, and I think the version is 9. I think it's from the last century. On Ubuntu I have installed - but not yet tested - gocr. Since I know that Omnipage works well with the scanner I'll probably be using that for my project. Although I might just spend a day or so experimenting with gocr. The problem is to link the Xsane scanning software which comes with Ubuntu with gocr. Xsane actually suggests gocr as the ocr of choice, but there doesn't appear to be a simple method of choosing ocr at the scanning stage. Maybe Xsane just saves the material in a form that gocr can use later - i.e. a two-phase ocr operation, which might actually be better, as opposed to the one-stage operation which Omnipage uses, and which means that you have the hassle of checking the input of each page before scanning the next. > > Please reserve further communications for the group so that all may benefit > from either of our comments. Normally I don't accept mails from newsgroups > but I recognized your name. > > HTH, > > Twayne > > > >> >> Thanks a lot if you have the time. >> >> James Wilde > > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
