Re: [sphinx-dev] Re: Some characters rendering wrongly in HTML output</span></a></span> </h1> <p class="darkgray font13"> <span class="sender pipe"><a href="/search?l=sphinx-dev@googlegroups.com&q=from:%22Friedrich+Romstedt%22" rel="nofollow"><span itemprop="author" itemscope itemtype="http://schema.org/Person"><span itemprop="name">Friedrich Romstedt</span></span></a></span> <span class="date"><a href="/search?l=sphinx-dev@googlegroups.com&q=date:20111125" rel="nofollow">Fri, 25 Nov 2011 00:17:49 -0800</a></span> </p> </div> <div itemprop="articleBody" class="msgBody"> <!--X-Body-of-Message--> <pre>Am 25.11.2011 um 08:44 schrieb Guenter Milde <mi...@users.sf.net>: > On 2011-11-24, Friedrich Romstedt wrote: >> Hi, > >> I'm experiencing some problem with Sphinx 1.1.2 (and also an earlier >> version from July), that some characters in my HTML <title> are >> occuring as strange Unicode character sequences in the HTML. Here's >> an example: > >> <a rel="nofollow" href="http://www.roentgen.physik.uni-goettingen.de/~fromstedt/">http://www.roentgen.physik.uni-goettingen.de/~fromstedt/</a> > >> Watch the title displayed in the browser (not the headline, but the >> title). The two-character sequence is hardcoded like this in the >> HTML, apparently (inspection with Firefox "Show Source"). > > Show source reveals: > > <title>Welcome to Friedrich Romstedt’s IRP Home! &mdash; I > RP - Friedrich Romstedt</title> > > Is this the same as in your *.html source (if you look at it in a text > editor) or is there some additional change introduced on its way over the web? > > Looks like an encoding problem.</pre><pre> Yes, that's what I've done too, with the same diagnosis. >> I have no idea how to boil this down since it's just the <title> which >> is affected. > > There is a similar problem with the "pop-up > anchors" of the section headings. If I move the mouse over a section > heading, I see something like: > > Welcome to Friedrich Romstedt’s IRP Home!¶ Precisely. Forgot to mention that. >> In that example, the two-char sequence results from an apostroph ' . > > AFAIK, Sphinx uses "smartypants" to convert > > Character ' (39, 0x27) APOSTROPHE > * neutral (vertical) glyph with mixed usage > > to > > Character '’' (8217, 0x2019) > 2019 RIGHT SINGLE QUOTATION MARK > = single comma quotation mark > * this is the preferred character to use for apostrophe > > At least this is what I see in the first section heading. > > Maybe you can disable this replacement. If I know how; although I would prefer a real fix / explanation. >> I think that my vim on my institute computer writes out just plain >> ASCII (i.e., not 16bit Unicode). ``file`` says:: > >> ASCII English text > > If it really writes just ASCII, how do you write Röntgen in your source > files? I don't. I write Roentgen. I switched my keyboard to US layout since DE layout is terrible for programming too. >> If anyone has an idea how to track this down, it's very much welcome. > > In "Romstedt’s", the three bytes of the UTF-8 representation of the > Character '’' (8217, 0x2019) RIGHT SINGLE QUOTATION MARK > are treated as three characters. Three bytes? Meaning ' is an digraph even though it is Unicoded? I remember there are several possibilities to encode some characters in Unicode. > How do you specify the title? Currently by the headline. I will try the .. title:: next. > What is the locale encoding? Mixed. The system has german i18n but I changed keyboard layout to US so I guess LOCALE is us_US.something. I remember MATLAB complaining about inconsistencies upon startup. I will investigate it. CU Friedrich -- You received this message because you are subscribed to the Google Groups "sphinx-dev" group. To post to this group, send email to sphinx-dev@googlegroups.com. To unsubscribe from this group, send email to sphinx-dev+unsubscr...@googlegroups.com. For more options, visit this group at <a rel="nofollow" href="http://groups.google.com/group/sphinx-dev?hl=en">http://groups.google.com/group/sphinx-dev?hl=en</a>. </pre> </div> <div class="msgButtons margintopdouble"> <ul class="overflow"> <li class="msgButtonItems"><a class="button buttonleft " accesskey="p" href="msg05275.html">Previous message</a></li> <li class="msgButtonItems textaligncenter"><a class="button" accesskey="c" href="index.html#05276">View by thread</a></li> <li class="msgButtonItems textaligncenter"><a class="button" accesskey="i" href="maillist.html#05276">View by date</a></li> <li class="msgButtonItems textalignright"><a class="button buttonright " accesskey="n" href="msg05277.html">Next message</a></li> </ul> </div> <a name="tslice"></a> <div class="tSliceList margintopdouble"> <ul class="icons monospace"> <li class="icons-email"><span class="subject"><a href="msg05269.html">[sphinx-dev] Some characters rendering wrongly in HTML ...</a></span> <span class="sender italic">Friedrich Romstedt</span></li> <li><ul> <li class="icons-email"><span class="subject"><a href="msg05273.html">[sphinx-dev] Re: Some characters rendering wrongly...</a></span> <span class="sender italic">Viktor Haag</span></li> <li><ul> <li class="icons-email"><span class="subject"><a href="msg05274.html">Re: [sphinx-dev] Re: Some characters rendering...</a></span> <span class="sender italic">Friedrich Romstedt</span></li> </ul></li> <li class="icons-email"><span class="subject"><a href="msg05275.html">[sphinx-dev] Re: Some characters rendering wrongly...</a></span> <span class="sender italic">Guenter Milde</span></li> <li><ul> <li class="icons-email tSliceCur"><span class="subject">Re: [sphinx-dev] Re: Some characters rendering...</span> <span class="sender italic">Friedrich Romstedt</span></li> <li><ul> <li class="icons-email"><span class="subject"><a href="msg05277.html">Re: [sphinx-dev] Re: Some characters rende...</a></span> <span class="sender italic">Friedrich Romstedt</span></li> <li class="icons-email"><span class="subject"><a href="msg05278.html">Re: [sphinx-dev] Re: Some characters rende...</a></span> <span class="sender italic">Friedrich Romstedt</span></li> <li class="icons-email"><span class="subject"><a href="msg05279.html">[sphinx-dev] Re: Some characters rendering...</a></span> <span class="sender italic">Guenter Milde</span></li> <li><ul> <li class="icons-email"><span class="subject"><a href="msg05280.html">[sphinx-dev] Re: Some characters rende...</a></span> <span class="sender italic">Guenter Milde</span></li> <li><ul> <li class="icons-email"><span class="subject"><a href="msg05281.html">Re: [sphinx-dev] Re: Some charact...</a></span> <span class="sender italic">Georg Brandl</span></li> <li class="icons-email"><span class="subject"><a href="msg05282.html">Re: [sphinx-dev] Re: Some charact...</a></span> <span class="sender italic">Friedrich Romstedt</span></li> </ul></li> </ul></li> <li class="icons-email"><span class="subject"><a href="msg05288.html">AW: [sphinx-dev] Re: Some characters rende...</a></span> <span class="sender italic">Lothar Braun</span></li> </ul> </ul> </ul> </ul> </div> <div class="overflow msgActions margintopdouble"> <div class="msgReply" > <h2> Reply via email to </h2> <form method="POST" action="/mailto.php"> <input type="hidden" name="subject" value="Re: [sphinx-dev] Re: Some characters rendering wrongly in HTML <title> output"> <input type="hidden" name="msgid" value="09EF1A4F-B3DE-434E-B5A5-EB68A9822AEF@gmail.com"> <input type="hidden" name="relpath" value="sphinx-dev@googlegroups.com/msg05276.html"> <input type="submit" value=" Friedrich Romstedt "> </form> </div> </div> </div> <div class="aside" role="complementary"> <div class="logo"> <a href="/"><img src="/logo.png" width=247 height=88 alt="The Mail Archive"></a> </div> <form class="overflow" action="/search" method="get"> <input type="hidden" name="l" value="sphinx-dev@googlegroups.com"> <label class="hidden" for="q">Search the site</label> <input class="submittext" type="text" id="q" name="q" placeholder="Search sphinx-dev"> <input class="submitbutton" name="submit" type="image" src="/submit.png" alt="Submit"> </form> <div class="nav margintop" id="nav" role="navigation"> <ul class="icons font16"> <li class="icons-home"><a href="/">The Mail Archive home</a></li> <li class="icons-list"><a href="/sphinx-dev@googlegroups.com/">sphinx-dev - all messages</a></li> <li class="icons-about"><a href="/sphinx-dev@googlegroups.com/info.html">sphinx-dev - about the list</a></li> <li class="icons-expand"><a href="/search?l=sphinx-dev@googlegroups.com&q=subject:%22Re%5C%3A+%5C%5Bsphinx%5C-dev%5C%5D+Re%5C%3A+Some+characters+rendering+wrongly+in+HTML+%3Ctitle%3E+output%22&o=newest&f=1" title="e" id="e">Expand</a></li> <li class="icons-prev"><a href="msg05275.html" title="p">Previous message</a></li> <li class="icons-next"><a href="msg05277.html" title="n">Next message</a></li> </ul> </div> <div class="listlogo margintopdouble"> </div> <div class="margintopdouble"> </div> </div> </div> <div class="footer" role="contentinfo"> <ul> <li><a href="/">The Mail Archive home</a></li> <li><a href="/faq.html#newlist">Add your mailing list</a></li> <li><a href="/faq.html">FAQ</a></li> <li><a href="/faq.html#support">Support</a></li> <li><a href="/faq.html#privacy">Privacy</a></li> <li class="darkgray">09EF1A4F-B3DE-434E-B5A5-EB68A9822AEF@gmail.com</li> </ul> </div> </body> </html>