[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Bug 26563 depends on bug 22750, which changed state. Bug 22750 Summary: update export-0.4.xsd https://bugzilla.wikimedia.org/show_bug.cgi?id=22750 What|Old Value |New Value Status|REOPENED|RESOLVED Resolution||FIXED -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Bug 26563 depends on bug 22750, which changed state. Bug 22750 Summary: update export-0.4.xsd https://bugzilla.wikimedia.org/show_bug.cgi?id=22750 What|Old Value |New Value Status|RESOLVED|REOPENED Resolution|WORKSFORME | -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Bug 26563 depends on bug 22750, which changed state. Bug 22750 Summary: update export-0.4.xsd https://bugzilla.wikimedia.org/show_bug.cgi?id=22750 What|Old Value |New Value Status|NEW |RESOLVED Resolution||WORKSFORME -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Ariel T. Glenn ar...@wikimedia.org changed: What|Removed |Added Status|REOPENED|RESOLVED Resolution||FIXED --- Comment #11 from Ariel T. Glenn ar...@wikimedia.org 2011-08-29 16:41:09 UTC --- I'm closing it now. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Diederik van Liere dvanli...@gmail.com changed: What|Removed |Added Keywords||analytics -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 --- Comment #10 from Diederik van Liere dvanli...@gmail.com 2011-08-12 21:16:57 UTC --- I think we can close this bug, or not? -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Brion Vibber br...@wikimedia.org changed: What|Removed |Added Status|RESOLVED|REOPENED Depends on||22750 Resolution|FIXED | --- Comment #8 from Brion Vibber br...@wikimedia.org 2011-07-11 23:10:48 UTC --- The updated schema never got published on mediawiki.org: bug 22750 This will break anything trying to automatically run XSD validation due to being unable to fetch the schema file. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Brion Vibber br...@wikimedia.org changed: What|Removed |Added Blocks||29819 --- Comment #9 from Brion Vibber br...@wikimedia.org 2011-07-11 23:15:47 UTC --- that's bug 29819 rather. bah! -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 --- Comment #7 from Diederik van Liere dvanli...@gmail.com 2011-02-06 08:10:44 UTC --- Maybe we should include the delta byte count or cumulative number of bytes in the database to enable feature requests such as: * Show size of current text in edit form (https://bugzilla.wikimedia.org/show_bug.cgi?id=3890) * Sorting language pane by article size (https://bugzilla.wikimedia.org/show_bug.cgi?id=6559) * Page character counts: denote simple vs. complex changes (https://bugzilla.wikimedia.org/show_bug.cgi?id=8571) * Special page for statistics about specific articles (https://bugzilla.wikimedia.org/show_bug.cgi?id=547) -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Rob Lanphier ro...@wikimedia.org changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution||FIXED --- Comment #6 from Rob Lanphier ro...@wikimedia.org 2011-01-27 20:09:20 UTC --- This is fixed in r79856, and will be deployed as part of 1.17 -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Roan Kattouw roan.katt...@gmail.com changed: What|Removed |Added CC||roan.katt...@gmail.com --- Comment #3 from Roan Kattouw roan.katt...@gmail.com 2011-01-08 11:57:37 UTC --- (In reply to comment #2) I'll document how I'd go about characters, just in case anyone wants to tackle it. The JOIN of the text table in WikiExporter::dumpFrom would have to be performed even in the case of a stub dump. WikiExporter()-text would need to be passed as a new parameter into XMLDumpWriter::writeRevision(). The stub logic in XMLDumpWriter::writeRevision() would need to be changed to use the new parameter to see if we're dealing with a stub dump, rather than inferring it from the absence of text. Finally, mb_strlen($foo, 'UTF-8') could be called. It's not a ton of code (probably 10-15 lines of code change, tops) but that's less likely to get fast-tracked to production. Wouldn't this cause stub dumps to load the text of each revision, significantly slowing down their generation? -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 --- Comment #4 from Ariel T. Glenn ar...@wikimedia.org 2011-01-08 12:11:59 UTC --- Exactly. What we want to do is follow the same procedure we did for bytes: add a field in the revision table, automatically populate it for new revs, run a job to populate for old revs. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Rob Lanphier ro...@wikimedia.org changed: What|Removed |Added Status|NEW |ASSIGNED AssignedTo|wikibug...@lists.wikimedia. |ro...@wikimedia.org |org | --- Comment #5 from Rob Lanphier ro...@wikimedia.org 2011-01-09 03:03:19 UTC --- Even more reason to punt on character count. :) If we ever add character count to the database, we really ought to address bug 21860 (checksum per rev) while we're at it. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 Rob Lanphier ro...@wikimedia.org changed: What|Removed |Added AssignedTo|ar...@wikimedia.org |wikibug...@lists.wikimedia. ||org --- Comment #1 from Rob Lanphier ro...@wikimedia.org 2011-01-07 20:00:24 UTC --- Byte count will be way easier, and might happen sooner than character count, since we already have revision length in the database. Ariel asks that we update the version number of the dumps if that happens, so users of the dumps can correlate contents to versions. The code to modify is here: http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/includes/Export.php?view=markup To update the version, we need to update schemaVersion(). In order for this to get into production, it of course needs to get deployed to the production branch. Ariel doesn't have time to implement this right now, so an interested volunteer would be appreciated. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l
[Bug 26563] Add characters changed per revision for stub and full article dumps
https://bugzilla.wikimedia.org/show_bug.cgi?id=26563 --- Comment #2 from Rob Lanphier ro...@wikimedia.org 2011-01-08 03:31:37 UTC --- Committed r79856 into trunk. I did bytes because characters was a little more involved. I added byte counts to both stub and full dumps. I thought about not including the byte count in the full dump because it's pretty trivial to get that count from most XML parsers. However, it is nice to have the byte count that doesn't include any XML escaping introduced by the dump, so I left it in. I'll document how I'd go about characters, just in case anyone wants to tackle it. The JOIN of the text table in WikiExporter::dumpFrom would have to be performed even in the case of a stub dump. WikiExporter()-text would need to be passed as a new parameter into XMLDumpWriter::writeRevision(). The stub logic in XMLDumpWriter::writeRevision() would need to be changed to use the new parameter to see if we're dealing with a stub dump, rather than inferring it from the absence of text. Finally, mb_strlen($foo, 'UTF-8') could be called. It's not a ton of code (probably 10-15 lines of code change, tops) but that's less likely to get fast-tracked to production. -- Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. You are on the CC list for the bug. ___ Wikibugs-l mailing list Wikibugs-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikibugs-l