2008/10/14 <[EMAIL PROTECTED]>: > > Dotan> Can Python go through a directory of files and replace each > Dotan> instance of "newline-space" with nothing? > > Sure. Something like (*completely* untested, so caveat emptor): > > import glob > import os > > for f in glob.glob('*.vcf'): > # corrupt data > uncooked = open(f, 'rb').read() > # fix it > cooked = uncooked.replace('\n ', '') > # backup original file for safety > os.rename(f, '%s.orig' % f) > # and save it > open(f, 'wb').write(cooked) >
The regex seems to be catching more than I would expect it to: [EMAIL PROTECTED]:~/test$ cat test.vcf BEGIN:VCARD FN:First Last N:Last;First;;; NOTE;CHARSET=UTF-8;ENCODING=QUOTED-PRINTABLE:First Line.\nThe Second Line i s long so that it will wrap. Long\, long\, and wrapping!=\n\nThird Line.\n UID:frh74xvYZ9 VERSION:2.1 END:VCARD BEGIN:VCARD FN;CHARSET=UTF-8;ENCODING=QUOTED-PRINTABLE:=D7=90=D7=90=D7=A4=D7=A8=D7=98=D 7=99 =D7=9E=D7=A9=D7=A4=D7=97=D7=94 N;CHARSET=UTF-8;ENCODING=QUOTED-PRINTABLE:=D7=9E=D7=A9=D7=A4=D7=97=D7=94;=D 7=90=D7=90=D7=A4=D7=A8=D7=98=D7=99;;; NOTE;CHARSET=UTF-8;ENCODING=QUOTED-PRINTABLE:=D7=A9=D7=95=D7=A8=D7=94 =D7=A 8=D7=90=D7=A9=D7=95=D7=A0=D7=94.\n=D7=A9=D7=95=D7=A8=D7=94 =D7=A9=D7=A0=D7= 99=D7=94 =D7=94=D7=99=D7=90 =D7=\n=90=D7=A8=D7=95=D7=9B=D7=94\, =D7=9B=D7=9 3=D7=99 =D7=A9=D7=A0=D7=A8=D7=90=\n =D7=90=D7=95=D7=AA=D7=94 =D7=92=D7=95=D 7=9C=D7=A9=D7=AA. =D7=90=D7=A8=D7=\n=95=D7=9B=D7=94\, =D7=90=D7=A8=D7=95=D7 =9B=D7=94\, =D7=95=D7=92=D7=95=D7=9C=\n=D7=A9=D7=AA!\n=D7=A9=D7=95=D7=A8=D7 =94 =D7=A9=D7=9C=D7=99=D7=A9=D7=99=D7=AA.\n UID:KqbQKbfBaF VERSION:2.1 END:VCARD [EMAIL PROTECTED]:~/test$ python ./fix.py [EMAIL PROTECTED]:~/test$ cat test.vcf BEGIN:VCARD FN:First Last N:Last;First;;; s long so that it will wrap. Long\, long\, and wrapping!=\n\nThird Line.\ni UID:frh74xvYZ9 VERSION:2.1 END:VCARD BEGIN:VCARD 7=99 =D7=9E=D7=A9=D7=A4=D7=97=D7=94INTABLE:=D7=90=D7=90=D7=A4=D7=A8=D7=98=D 7=90=D7=90=D7=A4=D7=A8=D7=98=D7=99;;;ABLE:=D7=9E=D7=A9=D7=A4=D7=97=D7=94;=D =94 =D7=A9=D7=9C=D7=99=D7=A9=D7=99=D7=AA.\nA9=D7=AA!\n=D7=A9=D7=95=D7=A8=D7 UID:KqbQKbfBaF VERSION:2.1 END:VCARD [EMAIL PROTECTED]:~/test$ Why is that? I tried to fix it, but I do not even see what is wrong to fix! It looks god to me. -- Dotan Cohen http://what-is-what.com http://gibberish.co.il א-ב-ג-ד-ה-ו-ז-ח-ט-י-ך-כ-ל-ם-מ-ן-נ-ס-ע-ף-פ-ץ-צ-ק-ר-ש-ת ä-ö-ü-ß-Ä-Ö-Ü -- http://mail.python.org/mailman/listinfo/python-list