On Dec 14, 2004, at 7:46 AM, Randy Padawer wrote:
The text file is made up of many individual emails concatenated end-to-end.
Rather than parsing on the lead line, maybe you can parse on the email format. Something like this, only in regex?
Optional blank lines Start of capture for single email Some non-blank lines Blank line Obtional lines End of capture dot line Start of capture for rest of emails blah-blah end of file End of capture
If the file is long, the second capture might be slow. You might want to consider matchChunk instead of matchText and match on either a single char or the end. That would get you the char position of the end. It might be possible to capture nothing.
If you want the dot line in the mail, just move the end of capture.
This might have problems with some special cases such as an empty body; you might have to look at the syntax for that.
You can read the file in binary and do parsing on cr lf lines and then convert to your platform, if you want.
Dar Scott
****************************************
DSC
http://www.swcp.com/dsc/
Programming Services
****************************************_______________________________________________ use-revolution mailing list [EMAIL PROTECTED] http://lists.runrev.com/mailman/listinfo/use-revolution
