[Bug 26563] Add characters changed per revision for stub and full article dumps

2012-06-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Bug 26563 depends on bug 22750, which changed state.

Bug 22750 Summary: update export-0.4.xsd
https://bugzilla.wikimedia.org/show_bug.cgi?id=22750

   What|Old Value   |New Value

 Status|REOPENED|RESOLVED
 Resolution||FIXED

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2012-05-31 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Bug 26563 depends on bug 22750, which changed state.

Bug 22750 Summary: update export-0.4.xsd
https://bugzilla.wikimedia.org/show_bug.cgi?id=22750

   What|Old Value   |New Value

 Status|RESOLVED|REOPENED
 Resolution|WORKSFORME  |

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-11-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Bug 26563 depends on bug 22750, which changed state.

Bug 22750 Summary: update export-0.4.xsd
https://bugzilla.wikimedia.org/show_bug.cgi?id=22750

   What|Old Value   |New Value

 Status|NEW |RESOLVED
 Resolution||WORKSFORME

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-08-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Ariel T. Glenn ar...@wikimedia.org changed:

   What|Removed |Added

 Status|REOPENED|RESOLVED
 Resolution||FIXED

--- Comment #11 from Ariel T. Glenn ar...@wikimedia.org 2011-08-29 16:41:09 
UTC ---
I'm closing it now.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-08-12 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Diederik van Liere dvanli...@gmail.com changed:

   What|Removed |Added

   Keywords||analytics

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-08-12 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

--- Comment #10 from Diederik van Liere dvanli...@gmail.com 2011-08-12 
21:16:57 UTC ---
I think we can close this bug, or not?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-07-11 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Brion Vibber br...@wikimedia.org changed:

   What|Removed |Added

 Status|RESOLVED|REOPENED
 Depends on||22750
 Resolution|FIXED   |

--- Comment #8 from Brion Vibber br...@wikimedia.org 2011-07-11 23:10:48 UTC 
---
The updated schema never got published on mediawiki.org: bug 22750

This will break anything trying to automatically run XSD validation due to
being unable to fetch the schema file.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-07-11 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Brion Vibber br...@wikimedia.org changed:

   What|Removed |Added

 Blocks||29819

--- Comment #9 from Brion Vibber br...@wikimedia.org 2011-07-11 23:15:47 UTC 
---
that's bug 29819  rather. bah!

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-02-06 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

--- Comment #7 from Diederik van Liere dvanli...@gmail.com 2011-02-06 
08:10:44 UTC ---
Maybe we should include the delta byte count or cumulative number of bytes in
the database to enable feature requests such as: 
* Show size of current text in edit form
(https://bugzilla.wikimedia.org/show_bug.cgi?id=3890)
* Sorting language pane by article size
(https://bugzilla.wikimedia.org/show_bug.cgi?id=6559)
* Page character counts: denote simple vs. complex changes
(https://bugzilla.wikimedia.org/show_bug.cgi?id=8571)
* Special page for statistics about specific articles
(https://bugzilla.wikimedia.org/show_bug.cgi?id=547)

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-01-27 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Rob Lanphier ro...@wikimedia.org changed:

   What|Removed |Added

 Status|ASSIGNED|RESOLVED
 Resolution||FIXED

--- Comment #6 from Rob Lanphier ro...@wikimedia.org 2011-01-27 20:09:20 UTC 
---
This is fixed in r79856, and will be deployed as part of 1.17

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-01-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Roan Kattouw roan.katt...@gmail.com changed:

   What|Removed |Added

 CC||roan.katt...@gmail.com

--- Comment #3 from Roan Kattouw roan.katt...@gmail.com 2011-01-08 11:57:37 
UTC ---
(In reply to comment #2)
 I'll document how I'd go about characters, just in case anyone wants to tackle
 it.  The JOIN of the text table in WikiExporter::dumpFrom would have to be
 performed even in the case of a stub dump.  WikiExporter()-text would need to
 be passed as a new parameter into XMLDumpWriter::writeRevision().  The stub
 logic in XMLDumpWriter::writeRevision() would need to be changed to use the 
 new
 parameter to see if we're dealing with a stub dump, rather than inferring it
 from the absence of text.  Finally, mb_strlen($foo, 'UTF-8') could be called. 
 It's not a ton of code (probably 10-15 lines of code change, tops) but that's
 less likely to get fast-tracked to production.
Wouldn't this cause stub dumps to load the text of each revision, significantly
slowing down their generation?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-01-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

--- Comment #4 from Ariel T. Glenn ar...@wikimedia.org 2011-01-08 12:11:59 
UTC ---
Exactly. What we want to do is follow the same procedure we did for bytes: add
a field in the revision table, automatically populate it for new revs, run a
job to populate for old revs.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-01-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Rob Lanphier ro...@wikimedia.org changed:

   What|Removed |Added

 Status|NEW |ASSIGNED
 AssignedTo|wikibug...@lists.wikimedia. |ro...@wikimedia.org
   |org |

--- Comment #5 from Rob Lanphier ro...@wikimedia.org 2011-01-09 03:03:19 UTC 
---
Even more reason to punt on character count.  :)  If we ever add character
count to the database, we really ought to address bug 21860 (checksum per rev)
while we're at it.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-01-07 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

Rob Lanphier ro...@wikimedia.org changed:

   What|Removed |Added

 AssignedTo|ar...@wikimedia.org |wikibug...@lists.wikimedia.
   ||org

--- Comment #1 from Rob Lanphier ro...@wikimedia.org 2011-01-07 20:00:24 UTC 
---
Byte count will be way easier, and might happen sooner than character count,
since we already have revision length in the database.  Ariel asks that we 
update the version number of the dumps if that happens, so users of the dumps
can correlate contents to versions.

The code to modify is here:
http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/includes/Export.php?view=markup

To update the version, we need to update schemaVersion().

In order for this to get into production, it of course needs to get deployed to
the production branch.

Ariel doesn't have time to implement this right now, so an interested volunteer
would be appreciated.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 26563] Add characters changed per revision for stub and full article dumps

2011-01-07 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563

--- Comment #2 from Rob Lanphier ro...@wikimedia.org 2011-01-08 03:31:37 UTC 
---
Committed r79856 into trunk.  I did bytes because characters was a little more
involved.  I added byte counts to both stub and full dumps.  

I thought about not including the byte count in the full dump because it's
pretty trivial to get that count from most XML parsers.  However, it is nice to
have the byte count that doesn't include any XML escaping introduced by the
dump, so I left it in.

I'll document how I'd go about characters, just in case anyone wants to tackle
it.  The JOIN of the text table in WikiExporter::dumpFrom would have to be
performed even in the case of a stub dump.  WikiExporter()-text would need to
be passed as a new parameter into XMLDumpWriter::writeRevision().  The stub
logic in XMLDumpWriter::writeRevision() would need to be changed to use the new
parameter to see if we're dealing with a stub dump, rather than inferring it
from the absence of text.  Finally, mb_strlen($foo, 'UTF-8') could be called. 
It's not a ton of code (probably 10-15 lines of code change, tops) but that's
less likely to get fast-tracked to production.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l