On Thu, Jan 09, 2025 at 11:15:23AM -0700, Brian Inglis wrote: > Hi folks, > > Many sites are now using Character Entity Names defined under > > https://www.w3.org/TR/xml-entity-names/
https://www.w3.org/TR/xml-entity-names/#source Note: unicode.xml is over 5MB in size and may not really be suitable for direct viewing in a browser. You may prefer to save the file rather than follow the above link to unicode.xml in a browser. (sounds like a lot of data - bigger than the current lynx executable) > https://github.com/w3c/xml-entities/blob/gh-pages/unicode.xml > > https://github.com/w3c/xml-entities/raw/refs/heads/gh-pages/unicode.xml > > https://html.spec.whatwg.org/multipage/named-characters.html#named-character-references > > https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references > > which currently render verbatim rather than being ignored or blanked, for > example: > > $ lynx -dump -nonumbers -nolist libera.chat | grep '&[^;]\+;' > Libera.​Chat > Libera.​Chat > > Please consider updating your entities from what I can see in your > snapshots, or provide an innocuous default? > > [Not subscribed, please CC:] > > -- > Take care. Thanks, Brian Inglis Calgary, Alberta, Canada > > La perfection est atteinte Perfection is achieved > non pas lorsqu'il n'y a plus rien à ajouter not when there is no more to add > mais lorsqu'il n'y a plus rien à retirer but when there is no more to cut > -- Antoine de Saint-Exupéry > > -- Thomas E. Dickey <dic...@invisible-island.net> https://invisible-island.net
signature.asc
Description: PGP signature