On Mon, Sep 01, 2003, Subba Rao wrote about "Re: VI(M) - Changing & Merging 2 lines?":
> One other question is, how can I find white space (non-printable) characters
> in a text file. The records file was created from PDF to text with the help of
> "pdftotext". I don't what characters are in this file, but when I try to
> import it into an application, the data gets mangled. The file opens fine in
> any of the spreadsheets. I want to see the characters in the file (not \n or
> <SPACE> etc).
Whitespace and non-printable characters aren't the same thing, obviously.
I'm assuming that you want to see non-printable characters (control characters
and non-ASCII characters).
One thing you can use is "cat -v", as in
cat -vE file | less
This will show you control characters as ^A, non-ascii characters as M-A,
and line ends as $ (this is useful for seeing spaces in the end of the line).
'less file' might also be enough, depending on what you need.
If you want to see the character codes of all the characters in your file,
you can also use "od -c file".
--
Nadav Har'El | Tuesday, Sep 2 2003, 5 Elul 5763
[EMAIL PROTECTED] |-----------------------------------------
Phone: +972-53-245868, ICQ 13349191 |As far as we know, our computer has never
http://nadav.harel.org.il |had an undetected error.
=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]