According to Greg Fenton: > --- Pub Litics <[EMAIL PROTECTED]> wrote: > > it was loaded with M-underline (carriage returns). I stripped all of > > those out and re-indexed and, behold, when I ran a search I did not > > come up with any forum-related, database entries for the > > sub-directories I was trying to exclude. > > BTW: You can get a utility to strip out carriage returns. There are a > number of different ones, but the one typically found on Unix and in > Cygwin is "dos2unix".
Oddly enough, ht://Dig programs strip off the carriage returns from the end of the lines, so lines terminated with CR/LF instead of just LF shouldn't pose a problem. What I was saying earlier is that if the lines end with just CR, and no LF, then it is a problem. An ASCII CR control character (carriage return) has a hex 0D value, while an ASCII LF control character (line feed) has a hex 0A value. Unix/Linux programs need the 0A value, also called the newline character, as a line terminator. Some Linux programs also allow CR/LF, hex 0D0A, as line terminator, just because so many DOS/Windows based editors use these, but not all Linux programs allow these. But what is this M-underline character? Do you mean the "M-_" that a cat -v command would show? If so, that is actually a hex DF value, i.e. the underscore with the top bit set, or � (German es-tset) in ISO-8859-1. If your editor was adding these, there is indeed a problem. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

