Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Rob Wilkerson
On 11/7/06, Josh Nathanson <[EMAIL PROTECTED]> wrote: > Hey Rob, > > > True. Does lead you to wonder, though, how they're sneaking in there. > > Folks don't just type in null characters... > > It leads me to wonder allright!! Maybe forms autofill or something? I wish I had an answer for you. It

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Paul Hastings
Josh Nathanson wrote: > 3) If I did the rereplace on the way into the db, it still may not catch all > offending control characters. There may be a new one that isn't in the > regex yet. Additionally, I don't want to disrupt the shopper's checkout > process if at all possible. there can't be.

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Josh Nathanson
Hey Rob, > True. Does lead you to wonder, though, how they're sneaking in there. > Folks don't just type in null characters... It leads me to wonder allright!! Maybe forms autofill or something? > REReplace ( mystring, '[^\x00-\x7f]', '', 'ALL' ) > Again, it's a pretty broad brush, but it shou

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Rob Wilkerson
On 11/7/06, Josh Nathanson <[EMAIL PROTECTED]> wrote: > > Maybe I can summarize: > 1) CDATA is not helpful when encountering control characters. True. Does lead you to wonder, though, how they're sneaking in there. Folks don't just type in null characters... > 2) Thus, I have to use rereplace w

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Josh Nathanson
From: "Rob Wilkerson" <[EMAIL PROTECTED]> To: "CF-Talk" Sent: Tuesday, November 07, 2006 2:04 PM Subject: Re: Cleaning XML - Unicode 0x0 SOLVED sorta > On 11/7/06, Matt Quackenbush <[EMAIL PROTECTED]> wrote: >> Josh, >> >> I think the

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Rob Wilkerson
On 11/7/06, Matt Quackenbush <[EMAIL PROTECTED]> wrote: > Josh, > > I think the point that Rob and others were making is that your data should > be validated and cleaned up BEFORE being inserted into the database - > whether it's inserted as XML or not is completely and utterly irrelevant. That's

RE: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Matt Quackenbush
overs up the real issue, which is invalid data being entered into the database. Thanks, Matt -Original Message- From: Josh Nathanson [mailto:[EMAIL PROTECTED] Sent: Tuesday, November 07, 2006 1:14 PM To: CF-Talk Subject: Re: Cleaning XML - Unicode 0x0 SOLVED sorta OK, I added this

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Rob Wilkerson
On 11/7/06, Josh Nathanson <[EMAIL PROTECTED]> wrote: > > Yes it's non scalable...but, since the data is not going into the database > as xml, just plain old form fields, I can't use CDATA on the way in anyway, > correct? I would have to run the same regex on each of the incoming form > fields tha

Re: Cleaning XML - Unicode 0x0 SOLVED sorta

2006-11-07 Thread Josh Nathanson
OK, I added this to my regex: \x00 Which is a hex representation of the character 0. And it worked. Not sure why chr(0) didn't work. Yes it's non scalable...but, since the data is not going into the database as xml, just plain old form fields, I can't use CDATA on the way in anyway, correct?