Hi Bill,

Ok, so maybe I can blame JRoller ;-)

I'd still like to find a way to enter Unicode characters into the title and body of an entry directly, though. From looking at the source of other Roller blog pages and RSS feeds I can see that they're getting UTF-8 encoding glyphs in there somehow... I'd really rather not have to type HTML entity escapes by hand!

Unfortunately I'm a total newcomer to this, so there may be an obvious way.

On a related note: Let's say I wanted to embed some code examples in a blog entry or title, for example:

Enum <E extends Enum<E>>

The left & right angle brackets are HTML chars, so I'd have to escape them. Doing it manually is a pain. If I want to put them in a title then I get complaints from RSS validators... Is there any way to embed HTML characters in a blog title that won't generate validation warnings? I can live with JRoller not displaying things correctly ;-)

Thanks again,

Sean

On Apr 21, 2006, at 12:15 AM, Bill Tribley wrote:

Hi Sean,
It sounds like the jroller blog viewer is not properly set up to handle true utf-8 encoding. When you put the character into HTML then you are causing it to pass through jroller, your browser is picking up on the utf-8, recognizing Chinese and displaying it, as long as all the characters make it.
Bill


On Thu, 20 Apr 2006 23:11:26 -0400, Sean Montgomery wrote:
Greetings,

I use Safari under OS X 10.4.6 to access my Roller blog at www.jroller.com
using, well, whatever version of Roller they're using today ;-)

I'd like to be able to use Unicode characters in the titles of my blog entries. If I use OS X's built in Chinese input method editor to enter a Chinese character in an entry title via the Edit Entry screen I'll see the correct character show up in the entry listing (to the right of the Edit Entry screen under Recent Entries) but all I get is a '?' when I view the
blog.

If I try entering an HTML entity like "&#x80d6;" into the title then I see those seven characters under Recent Entries, but I do see the correct single (Chinese) character when I view the blog. The correct character also shows up in the RSS feed when viewed via Safari. The trouble comes when I try to view the new entry via the front page of the JRoller website - it displays
"&#x80d6;" explicitly.

Sure, I could just blame JRoller ;-) Instead I pointed feedvalidator.org at my RSS feed and validated it - they flagged the "&amp;#x80d6;" and gave a warning that the title should not contain HTML and that I shouldn't be surprised if some viewers strip the characters or leave them there - like I
saw at JRoller.

I've seen Roller blogs that contain entries with titles containing explicit
Unicode characters - I've check out their RSS source (using Safari's
View:View Source command). Their RSS feed source, like mine, contains
charset="UTF-8", so that makes sense.

So what am I doing wrong? It looks like there's no way for me to input Unicode via the Chinese input method using the existing web- based Roller interface that JRoller has configured. Is it a configuration issue? Or do I need to use an alternative method of posting entries that uses the Blogger or
MetaWeblog APIs?

I didn't find anything useful on the Roller user guides and wiki about
this... Any suggestions on where to turn?

??!

Reply via email to