[issue16245] Update html.entities.html5 dictionary and parseentities.py

2019-11-08 Thread Terry J. Reedy


Terry J. Reedy  added the comment:

According to git blame, the html5 dict in 
https://github.com/python/cpython/blob/master/Lib/html/entities.py has changed 
in 7 years.  On the other hand, the standard on which it is based, 
https://html.spec.whatwg.org/multipage/named-characters.html, was last revised 
yesterday, and I presume several other times since.  On the third hand, I just 
ran the update script and there was no change to entities.py, so maybe is has 
been run with every release.

Should a comment be added to the file listing the unicode source and the update 
script?

--
nosy: +terry.reedy

___
Python tracker 

___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2014-03-16 Thread Éric Araujo

Éric Araujo added the comment:

I just ran the script:

$ Tools/scripts/parse_html5_entities.py 
The current dictionary is updated.

This is done :‑)

--
resolution:  - fixed
status: open - closed

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2014-03-16 Thread Éric Araujo

Éric Araujo added the comment:

BTW this message does not mean that the dictionary was just updated, but that 
is was already up to date.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2014-03-15 Thread Larry Hastings

Larry Hastings added the comment:

Was this done?  I'm tagging 3.4.0 final soon.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2013-08-06 Thread Ezio Melotti

Ezio Melotti added the comment:

I run Tools/scripts/parse_html5_entities.py and it says that The current 
dictionary is updated..  We should check this again, and eventually close the 
issue, when 3.4 is released.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2013-08-01 Thread Larry Hastings

Larry Hastings added the comment:

I'm downgrading this to deferred blocker.  We'll make sure it happens before 
Python 3.4.0, but there's no need to hold up Python 3.4a1 for this.

--
priority: release blocker - deferred blocker

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2013-07-31 Thread Larry Hastings

Larry Hastings added the comment:

This is still marked as a release blocker.  I guess this is a tickler for 
Ezio to go check and see if there's a new entities file.

Ezio: can you get this issue closed or downgraded in the next two days?

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2013-02-20 Thread Ramchandra Apte

Ramchandra Apte added the comment:

Shouldn't this be deferred blocker?

--
nosy: +Ramchandra Apte

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2013-02-10 Thread Antoine Pitrou

Changes by Antoine Pitrou pit...@free.fr:


--
versions:  -Python 3.3

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-11-01 Thread Serhiy Storchaka

Changes by Serhiy Storchaka storch...@gmail.com:


--
nosy:  -serhiy.storchaka

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Ezio Melotti

Ezio Melotti added the comment:

I think it's ok to have a separate file rather than patching the existing one 
(see attached patch).  If the old script is not used anymore it could be 
removed, otherwise we could just leave it there.

--
stage: needs patch - patch review

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Ezio Melotti

Changes by Ezio Melotti ezio.melo...@gmail.com:


Added file: http://bugs.python.org/file27670/issue16245-2.diff

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Iuliia Proskurnia

Iuliia Proskurnia added the comment:

Version with --patch to modify Lib/html/entities.py automatically

--
nosy: +iuliia.proskurnia
Added file: http://bugs.python.org/file27676/issue16245-3.diff

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Roundup Robot

Roundup Robot added the comment:

New changeset dd8b969d7459 by Ezio Melotti in branch 'default':
#16245: add a script to generate the html.entities.html5 dict.
http://hg.python.org/cpython/rev/dd8b969d7459

--
nosy: +python-dev

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Roundup Robot

Roundup Robot added the comment:

New changeset 1eb1c6942ac8 by Ezio Melotti in branch '3.3':
#16245: Fix the value of a few entities in html.entities.html5.
http://hg.python.org/cpython/rev/1eb1c6942ac8

New changeset 70fab10cd542 by Ezio Melotti in branch 'default':
#16245: merge with 3.3.
http://hg.python.org/cpython/rev/70fab10cd542

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Roundup Robot

Roundup Robot added the comment:

New changeset fb80df16c4ff by Ezio Melotti in branch 'default':
Add Misc/NEWS entry for dd8b969d7459/#16245.
http://hg.python.org/cpython/rev/fb80df16c4ff

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-23 Thread Ezio Melotti

Ezio Melotti added the comment:

I now committed an improved version of the script (thanks Iuliia!) and updated 
the html.entities.html5 dictionary accordingly.

I'm leaving this open because we will have to check if the dictionary is still 
updated before the release of Python 3.4.

--
components: +Library (Lib)
stage: patch review - committed/rejected

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-19 Thread Éric Araujo

Éric Araujo added the comment:

I say replace the code.  HTML 4.01 won’t be updated.

--
nosy: +eric.araujo, georg.brandl, larry

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-16 Thread Ezio Melotti

New submission from Ezio Melotti:

A JSON file containing all the HTML5 entities is now available at 
http://dev.w3.org/html5/spec/entities.json.
I tested from the interpreter to see if it matches the values in 
html.entities.html5 and there are a dozen of entities that need to be updated:

 s = json.load(open('entities.json'))
 from html.entities import html5
 for (k1,i1),(k2,i2) in zip(sorted(s.items()), sorted(html5.items())):
...   if i1['characters'] != i2: (k1, k2, i1['characters'], i2, 
i1['codepoints'], list(map(ord, i2)))
... 
('DotDot;', 'DotDot;', '⃜', '◌⃜', [8412], [9676, 8412])
('DownBreve;', 'DownBreve;', '̑', '◌̑', [785], [9676, 785])
('LeftAngleBracket;', 'LeftAngleBracket;', '⟨', '〈', [10216], [9001])
('NewLine;', 'NewLine;', '\n', '␊', [10], [9226])
('RightAngleBracket;', 'RightAngleBracket;', '⟩', '〉', [10217], [9002])
('Tab;', 'Tab;', '\t', '␉', [9], [9225])
('TripleDot;', 'TripleDot;', '⃛', '◌⃛', [8411], [9676, 8411])
('lang;', 'lang;', '⟨', '〈', [10216], [9001])
('langle;', 'langle;', '⟨', '〈', [10216], [9001])
('rang;', 'rang;', '⟩', '〉', [10217], [9002])
('rangle;', 'rangle;', '⟩', '〉', [10217], [9002])
('tdot;', 'tdot;', '⃛', '◌⃛', [8411], [9676, 8411])

The Tools/scripts/parseentities.py script should also be updated (or possibly a 
new, separate script should be added), so it can be used to generate the html5 
dict.  I'm setting this as release blocker so that the update gets done before 
the release (other values might change in the meanwhile).

--
assignee: ezio.melotti
messages: 173021
nosy: ezio.melotti
priority: release blocker
severity: normal
stage: needs patch
status: open
title: Update html.entities.html5 dictionary and parseentities.py
type: behavior
versions: Python 3.3, Python 3.4

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-16 Thread Kushal Das

Changes by Kushal Das kushal...@gmail.com:


--
nosy: +kushaldas

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-16 Thread Ezio Melotti

Changes by Ezio Melotti ezio.melo...@gmail.com:


--
keywords: +patch
Added file: http://bugs.python.org/file27593/issue16245.diff

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue16245] Update html.entities.html5 dictionary and parseentities.py

2012-10-16 Thread Serhiy Storchaka

Changes by Serhiy Storchaka storch...@gmail.com:


--
nosy: +serhiy.storchaka

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue16245
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com