[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 C. Scott Ananian changed: What|Removed |Added CC||canan...@wikimedia.org --- Comment #19 from C. Scott Ananian --- At some point I would like to replace tidy with a API-compatible binary which uses the standard HTML5 parser mechanism. It's on my list of 'free time projects'. There are lots of HTML5 parser libraries now. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #18 from Gabriel Wicke --- (In reply to Bartosz Dziewoński from comment #17) > $30 is not within the reach of everyone. There's also the fact that you have > to have a credit card to get any reputable paid hosting, and that's also not > a given in the whole world. Depending on your use case there are also free options like Wikia and other non-profit options without ads. Free shared hosting is not automatically going to be more reputable than free VM hosting, nor do I see systematic differences in payment methods. You are free to work on MediaWiki on shared hosting of course. All I'm saying is that there are few remaining reasons for us to - spend major resources on shared hosting support, and - let it hold back our architectural development at the expense of security, performance and maintainability -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #17 from Bartosz Dziewoński --- (In reply to Gabriel Wicke from comment #16) > Nor can you typically run tidy there. Citation needed. http://www.php.net/manual/en/book.tidy.php It's definitely more likely to be accessible than having node and being able to shell out. > Virtual machines are really cheap > these days (starting at about $30 / year), so cost is no longer the issue > that prevents people from installing better tools for the job. $30 is not within the reach of everyone. There's also the fact that you have to have a credit card to get any reputable paid hosting, and that's also not a given in the whole world. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #16 from Gabriel Wicke --- (In reply to Bartosz Dziewoński from comment #11) > (In reply to comment #10) > > Which issues do you see apart from rendering quality / compatibility? > > Compatibility/availability is the single showstopper issue here. I can't run > server-side JavaScript on most free hostings. Nor can you typically run tidy there. Virtual machines are really cheap these days (starting at about $30 / year), so cost is no longer the issue that prevents people from installing better tools for the job. Missing packaging is another point, but that is also being addressed (parsoid is now debianized). In any case, we are working on being ready to start using Parsoid HTML for normal page views this summer. We might not want to maintain the PHP parser in the longer term, and are thus less likely to spend much effort on replacing tidy right now. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #15 from Erwin Dokter --- Fount a lilbrary called HTML Purifier, but that's more of a 'evil code' filter with some 'Tidy inspired' features. Probably not what we want. There is also tidy-html5 [1], a fork that aims for full HTML5 support. [1] https://github.com/w3c/tidy-html5 -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 Erwin Dokter changed: What|Removed |Added CC||er...@darcoury.nl --- Comment #14 from Erwin Dokter --- I run into too many problems because of Tidy. It's main flaw is that it is not compatible with HTML5; it hasn't been updated since 2008(!). Most problems stem from Tidy not allowing any block elements inside inline elements (which is allowed in HTML5), and kicks them out which results in broken HTML, even though its goal is to prevent exactly that. Is there no library that has the same functionality and is up to date? -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #13 from Bartosz Dziewoński --- MediaWiki is intended to be both, if you ask me. I don't see how your question is relevant to the bug, since I am not proposing to make it a hodge-podge. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #12 from James Forrester --- (In reply to comment #11) > (In reply to comment #10) > > Which issues do you see apart from rendering quality / compatibility? > > Compatibility/availability is the single showstopper issue here. I can't run > server-side JavaScript on most free hostings. So it's back to the policy question of what MediaWiki is intended to be - a great wiki for large- and medium-scale wikis, or a hodge-podge of tools which are limited by ease of download-the-zip-file installation over a proper management tool, rather than by what is best for users? -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 Bartosz Dziewoński changed: What|Removed |Added See Also||https://bugzilla.wikimedia. ||org/show_bug.cgi?id=53784 Depends on|53784 | -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #11 from Bartosz Dziewoński --- (In reply to comment #10) > Which issues do you see apart from rendering quality / compatibility? Compatibility/availability is the single showstopper issue here. I can't run server-side JavaScript on most free hostings. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 Gabriel Wicke changed: What|Removed |Added Depends on||53784 -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #10 from Gabriel Wicke --- (In reply to comment #9) > Using Parsoid for page view is just not workable in short or mid term, no > matter how much we would want it. Which issues do you see apart from rendering quality / compatibility? -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #9 from Bartosz Dziewoński --- (In reply to comment #8) > VE is an HTML editor, so can be used without Parsoid. Well yeah, okay, this could work. VE, however, has certain software and hardware requirements not all computers meet. And there's the entire issue of "templating" which you dismissed with a single word, which I assume is currently not implemented without wikitext backing it. VE also currently doesn't work for, say, talk pages (and please don't mention Flow, it will not be ready by next summer) or edit summaries, and there are certain pieces of the interface which show raw source code like diffs (I don't think anybody has implemented rich text diffs yet in MediaWiki, but this is something I'd really like to see). Using Parsoid for page view is just not workable in short or mid term, no matter how much we would want it. /offtopic -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #8 from Gabriel Wicke --- (In reply to comment #7) > You can't possibly want to require every MediaWiki installation everywhere to > switch to editing raw HTML by hand (VE depends on Parsoid…). VE is an HTML editor, so can be used without Parsoid. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #7 from Bartosz Dziewoński --- (In reply to comment #6) > Parsoid would only be needed for wikitext editing and -templating. HTML-only > wikis would basically serve XHTML straight from storage. You can't possibly want to require every MediaWiki installation everywhere to switch to editing raw HTML by hand (VE depends on Parsoid…). -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 --- Comment #6 from Gabriel Wicke --- Parsoid would only be needed for wikitext editing and -templating. HTML-only wikis would basically serve XHTML straight from storage. -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 54617] Replace Tidy with a library that doesn't suck
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617 Krinkle changed: What|Removed |Added CC||krinklem...@gmail.com Summary|Tidy sucks, needs to die, |Replace Tidy with a library |let's kill it |that doesn't suck --- Comment #5 from Krinkle --- Rephrasing summary to reflect that we don't intend to get rid of fixing unclosed tags, but Tidy specifically (we shouldn't kill Tidy without adding something else, so that makes the bug more "atomic") -- You are receiving this mail because: You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l