I've been given the job of writing a CGI script to receive the data from a form and 
append it to a text file. Later, the text file will be analyzed using MS Access. My 
problem is escaping characters which are often used as delimiters in text-based 
importing formats, such as ' or " or \t or \n. Any of these could be legitimately 
entered by a user into the text fields of the form. I'd like to capture these, and not 
just discard them, and in such as way that they can be easily converted back into the 
original characters after importing into Access.

Is there a standardized or commonly accepted way of doing this?

I first looked at HTML::Entities, but it doesn't look as if it converts \t or \n. 
Otherwise, this would be a good choice. I also looked at Unicode::Strings, but it 
seemed as if this would convert the entire string to Unicode, which I don't  know if 
Access accepts, or if this would even solve my problem.

The form is an Adobe .pdf form with editable fields, which returns the data as an .fdf 
file. Don't think this is important, but you can learn more about it at 
http://www.adobe.com/support/techdocs/27f9a.htm.

I tried searching CPAN on 'encoding' but that didn't seem to be the right term.

Thanks for your help and advice.

-Kevin


-----
E. Kevin Zembower
Internet Systems Group manager
Johns Hopkins University
Bloomberg School of Public Health
Center for Communications Programs
111 Market Place, Suite 310
Baltimore, MD  21202
410-659-6139


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>


Reply via email to