According to Greg Fenton:
> --- Pub Litics <[EMAIL PROTECTED]> wrote:
> > it was loaded with M-underline (carriage returns).  I stripped all of
> > those out and re-indexed and, behold, when I ran a search I did not
> > come up with any forum-related, database entries for the
> > sub-directories I was trying to exclude.
> 
> BTW: You can get a utility to strip out carriage returns.  There are a
> number of different ones, but the one typically found on Unix and in
> Cygwin is "dos2unix".

Oddly enough, ht://Dig programs strip off the carriage returns from
the end of the lines, so lines terminated with CR/LF instead of just
LF shouldn't pose a problem.  What I was saying earlier is that if the
lines end with just CR, and no LF, then it is a problem.

An ASCII CR control character (carriage return) has a hex 0D value,
while an ASCII LF control character (line feed) has a hex 0A value.
Unix/Linux programs need the 0A value, also called the newline character,
as a line terminator.  Some Linux programs also allow CR/LF, hex 0D0A,
as line terminator, just because so many DOS/Windows based editors use
these, but not all Linux programs allow these.

But what is this M-underline character?  Do you mean the "M-_" that a
cat -v command would show?  If so, that is actually a hex DF value,
i.e. the underscore with the top bit set, or � (German es-tset) in
ISO-8859-1.  If your editor was adding these, there is indeed a problem.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to