Need RegEx help for unknown/odd characters.

2013-05-17 Thread Che Vilnonis

Good afternoon. Occasionally when parsing a RSS feed, I get RSS data I
cannot parse. Click for screenshot below (text highlighted in green). In
this example, Montreal should return Montréal but does not. What regex
could I use to remove or replace such odd characters?

http://www.asitv.com/images/_funkychar.jpg

Thanks, Che



~|
Order the Adobe Coldfusion Anthology now!
http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355737
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm


Re: Need RegEx help for unknown/odd characters.

2013-05-17 Thread Nathan Strutz

How about deAccent() ?
http://cflib.org/udf/deAccent


nathan strutz
[www.dopefly.com] [hi.im/nathanstrutz]


On Wed, May 15, 2013 at 9:20 AM, Che Vilnonis ch...@asitv.com wrote:


 Good morning. Occasionally when parsing a RSS feed, I get RSS data I cannot
 parse. Click for screenshot below (text highlighted in green). In this
 example, Montreal should return Montréal but does not. What regex could I
 use to remove or replace such odd characters?

 http://www.asitv.com/images/_funkychar.jpg

 Thanks, Che




 

~|
Order the Adobe Coldfusion Anthology now!
http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355757
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm


RE: Need RegEx help for unknown/odd characters.

2013-05-17 Thread Che Vilnonis

Nathan, the problem is that within the feed itself, the characters don't
have accents. When output to a browser, they funky chars display on screen
with odd geometric shapes like an upright rectangle. They're like odd ascii
chars that I have only seen a few time before. In the end, I can't insert
them into the MySQL db.

Ché

-Original Message-


How about deAccent() ?
http://cflib.org/udf/deAccent


On Wed, May 15, 2013 at 9:20 AM, Che Vilnonis ch...@asitv.com wrote:


 Good morning. Occasionally when parsing a RSS feed, I get RSS data I 
 cannot parse. Click for screenshot below (text highlighted in green). 
 In this example, Montreal should return Montréal but does not. What 
 regex could I use to remove or replace such odd characters?

 http://www.asitv.com/images/_funkychar.jpg




~|
Order the Adobe Coldfusion Anthology now!
http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355762
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm


Re: Need RegEx help for unknown/odd characters.

2013-05-17 Thread Kris Jones

How about making sure the charset is set properly. I have seen this kind of
thing when not setting to UTF-8. This can be a problem either on the
ingesting, or when the feed is written.



On Fri, May 17, 2013 at 2:25 PM, Che Vilnonis ch...@asitv.com wrote:


 Nathan, the problem is that within the feed itself, the characters don't
 have accents. When output to a browser, they funky chars display on screen
 with odd geometric shapes like an upright rectangle. They're like odd ascii
 chars that I have only seen a few time before. In the end, I can't insert
 them into the MySQL db.

 Ché

 -Original Message-


 How about deAccent() ?
 http://cflib.org/udf/deAccent


 On Wed, May 15, 2013 at 9:20 AM, Che Vilnonis ch...@asitv.com wrote:

 
  Good morning. Occasionally when parsing a RSS feed, I get RSS data I
  cannot parse. Click for screenshot below (text highlighted in green).
  In this example, Montreal should return Montréal but does not. What
  regex could I use to remove or replace such odd characters?
 
  http://www.asitv.com/images/_funkychar.jpg




 

~|
Order the Adobe Coldfusion Anthology now!
http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355763
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm


Need RegEx help for unknown/odd characters.

2013-05-16 Thread Che Vilnonis

Good morning. Occasionally when parsing a RSS feed, I get RSS data I cannot
parse. Click for screenshot below (text highlighted in green). In this
example, Montreal should return Montréal but does not. What regex could I
use to remove or replace such odd characters?

http://www.asitv.com/images/_funkychar.jpg

Thanks, Che




~|
Order the Adobe Coldfusion Anthology now!
http://www.amazon.com/Adobe-Coldfusion-Anthology/dp/1430272155/?tag=houseoffusion
Archive: 
http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:355733
Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm
Unsubscribe: http://www.houseoffusion.com/groups/cf-talk/unsubscribe.cfm