I've been given the job of writing a CGI script to receive the data from a form and append it to a text file. Later, the text file will be analyzed using MS Access. My problem is escaping characters which are often used as delimiters in text-based importing formats, such as ' or " or \t or \n. Any of these could be legitimately entered by a user into the text fields of the form. I'd like to capture these, and not just discard them, and in such as way that they can be easily converted back into the original characters after importing into Access.
Is there a standardized or commonly accepted way of doing this? I first looked at HTML::Entities, but it doesn't look as if it converts \t or \n. Otherwise, this would be a good choice. I also looked at Unicode::Strings, but it seemed as if this would convert the entire string to Unicode, which I don't know if Access accepts, or if this would even solve my problem. The form is an Adobe .pdf form with editable fields, which returns the data as an .fdf file. Don't think this is important, but you can learn more about it at http://www.adobe.com/support/techdocs/27f9a.htm. I tried searching CPAN on 'encoding' but that didn't seem to be the right term. Thanks for your help and advice. -Kevin ----- E. Kevin Zembower Internet Systems Group manager Johns Hopkins University Bloomberg School of Public Health Center for Communications Programs 111 Market Place, Suite 310 Baltimore, MD 21202 410-659-6139 -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] <http://learn.perl.org/> <http://learn.perl.org/first-response>