[Bug 54617] Replace Tidy with a library that doesn't suck

2014-07-10 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

C. Scott Ananian  changed:

   What|Removed |Added

 CC||canan...@wikimedia.org

--- Comment #19 from C. Scott Ananian  ---
At some point I would like to replace tidy with a API-compatible binary which
uses the standard HTML5 parser mechanism.  It's on my list of 'free time
projects'.  There are lots of HTML5 parser libraries now.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2014-02-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #18 from Gabriel Wicke  ---
(In reply to Bartosz Dziewoński from comment #17) 
> $30 is not within the reach of everyone. There's also the fact that you have
> to have a credit card to get any reputable paid hosting, and that's also not
> a given in the whole world.

Depending on your use case there are also free options like Wikia and other
non-profit options without ads. Free shared hosting is not automatically going
to be more reputable than free VM hosting, nor do I see systematic differences
in payment methods.

You are free to work on MediaWiki on shared hosting of course. All I'm saying
is that there are few remaining reasons for us to

- spend major resources on shared hosting support, and 
- let it hold back our architectural development at the expense of security,
performance and maintainability

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2014-02-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #17 from Bartosz Dziewoński  ---
(In reply to Gabriel Wicke from comment #16)
> Nor can you typically run tidy there.

Citation needed. http://www.php.net/manual/en/book.tidy.php It's definitely
more likely to be accessible than having node and being able to shell out.


> Virtual machines are really cheap
> these days (starting at about $30 / year), so cost is no longer the issue
> that prevents people from installing better tools for the job.

$30 is not within the reach of everyone. There's also the fact that you have to
have a credit card to get any reputable paid hosting, and that's also not a
given in the whole world.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2014-02-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #16 from Gabriel Wicke  ---
(In reply to Bartosz Dziewoński from comment #11)
> (In reply to comment #10)
> > Which issues do you see apart from rendering quality / compatibility?
> 
> Compatibility/availability is the single showstopper issue here. I can't run
> server-side JavaScript on most free hostings.

Nor can you typically run tidy there. Virtual machines are really cheap these
days (starting at about $30 / year), so cost is no longer the issue that
prevents people from installing better tools for the job. Missing packaging is
another point, but that is also being addressed (parsoid is now debianized).

In any case, we are working on being ready to start using Parsoid HTML for
normal page views this summer. We might not want to maintain the PHP parser in
the longer term, and are thus less likely to spend much effort on replacing
tidy right now.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2014-02-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #15 from Erwin Dokter  ---
Fount a lilbrary called HTML Purifier, but that's more of a 'evil code' filter
with some 'Tidy inspired' features. Probably not what we want.

There is also tidy-html5 [1], a fork that aims for full HTML5 support.

[1] https://github.com/w3c/tidy-html5

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2014-02-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

Erwin Dokter  changed:

   What|Removed |Added

 CC||er...@darcoury.nl

--- Comment #14 from Erwin Dokter  ---
I run into too many problems because of Tidy. It's main flaw is that it is not
compatible with HTML5; it hasn't been updated since 2008(!). Most problems stem
from Tidy not allowing any block elements inside inline elements (which is
allowed in HTML5), and kicks them out which results in broken HTML, even though
its goal is to prevent exactly that.

Is there no library that has the same functionality and is up to date?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-11-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #13 from Bartosz Dziewoński  ---
MediaWiki is intended to be both, if you ask me. I don't see how your question
is relevant to the bug, since I am not proposing to make it a hodge-podge.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-11-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #12 from James Forrester  ---
(In reply to comment #11)
> (In reply to comment #10)
> > Which issues do you see apart from rendering quality / compatibility?
> 
> Compatibility/availability is the single showstopper issue here. I can't run
> server-side JavaScript on most free hostings.

So it's back to the policy question of what MediaWiki is intended to be - a
great wiki for large- and medium-scale wikis, or a hodge-podge of tools which
are limited by ease of download-the-zip-file installation over a proper
management tool, rather than by what is best for users?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-11-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

Bartosz Dziewoński  changed:

   What|Removed |Added

   See Also||https://bugzilla.wikimedia.
   ||org/show_bug.cgi?id=53784
 Depends on|53784   |

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-11-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #11 from Bartosz Dziewoński  ---
(In reply to comment #10)
> Which issues do you see apart from rendering quality / compatibility?

Compatibility/availability is the single showstopper issue here. I can't run
server-side JavaScript on most free hostings.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-11-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

Gabriel Wicke  changed:

   What|Removed |Added

 Depends on||53784

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-11-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #10 from Gabriel Wicke  ---
(In reply to comment #9)
> Using Parsoid for page view is just not workable in short or mid term, no
> matter how much we would want it.

Which issues do you see apart from rendering quality / compatibility?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-10-09 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #9 from Bartosz Dziewoński  ---
(In reply to comment #8)
> VE is an HTML editor, so can be used without Parsoid.

Well yeah, okay, this could work. VE, however, has certain software and
hardware requirements not all computers meet. And there's the entire issue of
"templating" which you dismissed with a single word, which I assume is
currently not implemented without wikitext backing it.

VE also currently doesn't work for, say, talk pages (and please don't mention
Flow, it will not be ready by next summer) or edit summaries, and there are
certain pieces of the interface which show raw source code like diffs (I don't
think anybody has implemented rich text diffs yet in MediaWiki, but this is
something I'd really like to see).

Using Parsoid for page view is just not workable in short or mid term, no
matter how much we would want it.

/offtopic

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-10-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #8 from Gabriel Wicke  ---
(In reply to comment #7)
> You can't possibly want to require every MediaWiki installation everywhere to
> switch to editing raw HTML by hand (VE depends on Parsoid…).

VE is an HTML editor, so can be used without Parsoid.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-10-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #7 from Bartosz Dziewoński  ---
(In reply to comment #6)
> Parsoid would only be needed for wikitext editing and -templating. HTML-only
> wikis would basically serve XHTML straight from storage.

You can't possibly want to require every MediaWiki installation everywhere to
switch to editing raw HTML by hand (VE depends on Parsoid…).

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-10-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

--- Comment #6 from Gabriel Wicke  ---
Parsoid would only be needed for wikitext editing and -templating. HTML-only
wikis would basically serve XHTML straight from storage.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 54617] Replace Tidy with a library that doesn't suck

2013-10-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=54617

Krinkle  changed:

   What|Removed |Added

 CC||krinklem...@gmail.com
Summary|Tidy sucks, needs to die,   |Replace Tidy with a library
   |let's kill it   |that doesn't suck

--- Comment #5 from Krinkle  ---
Rephrasing summary to reflect that we don't intend to get rid of fixing
unclosed tags, but Tidy specifically (we shouldn't kill Tidy without adding
something else, so that makes the bug more "atomic")

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l