On Mon, Aug 2, 2010 at 5:48 PM, Ariel T. Glenn wrote:
> Στις 02-08-2010, ημέρα Δευ, και ώρα 17:36 -0400, ο/η Aryeh Gregor
> έγραψε:
>> On Mon, Aug 2, 2010 at 5:28 PM, Jason A. Spiro wrote:
>> > Does the difference really matter so much that we must really use the
>> > more-obscure and more-techni
On Wed, Aug 4, 2010 at 12:17 AM, MZMcBride wrote:
> This is problematic logic for a few reasons. I see a change to the rev_len
> logic
> as being similar to a change in article count logic. The same arguments work
> in
> both places, specifically the "step problem" that will cause nasty jumps in
On Tue, Aug 3, 2010 at 8:12 PM, soxred93 wrote:
> (just remember that it's 1.5 to 5 times slower, like I said earlier.
> Whether or not that's an issue will have to be decided by higher powers)
This is not some question that has to be decided by
specially-appointed performance gurus -- just do so
On Wed, Aug 4, 2010 at 7:38 AM, MZMcBride wrote:
> Ariel T. Glenn wrote:
>> I"m all for the change, but it would have to be announced well
>> in
> advance of rollout and coordinated with other folks. For example,
>> I
> have a check against rev_len (in bytes) when writing out XML dumps,
>> in
> o
Ariel T. Glenn wrote:
> I"m all for the change, but it would have to be announced well
> in
advance of rollout and coordinated with other folks. For example,
> I
have a check against rev_len (in bytes) when writing out XML dumps,
> in
order to avoid rev id and rev content out of sync errors that w
Στις 04-08-2010, ημέρα Τετ, και ώρα 04:17 +, ο/η MZMcBride έγραψε:
> Aryeh Gregor wrote:
> > The same could be said of practically any user-visible change. I
> > mean, maybe if we add a new special page we'll break some script that
> > was screen-scraping Special:SpecialPages. We can either f
(just remember that it's 1.5 to 5 times slower, like I said earlier.
Whether or not that's an issue will have to be decided by higher powers)
On Aug 3, 2010, at 5:54 PM, Aryeh Gregor wrote:
> On Tue, Aug 3, 2010 at 5:09 PM, Daniel Friesen
> wrote:
>> Yup, though we might as well remember that
On Tue, Aug 3, 2010 at 5:09 PM, Daniel Friesen
wrote:
> Yup, though we might as well remember that not everyone has mb_
> functions installed.
if ( !function_exists( 'mb_strlen' ) ) {
/**
* Fallback implementation of mb_strlen, hardcoded to UTF-8.
* @param string $str
* @param
Aryeh Gregor wrote:
> On Tue, Aug 3, 2010 at 10:59 AM, soxred93 wrote:
>
>> Just butting in here, if I recall correctly, both the PHP-native
>> mb_strlen() and the MediaWiki fallback mb_strlen() functions are
>> considerably slower (1.5 to 5 times as slow).
>>
>
> They only have to be run
On Tue, Aug 3, 2010 at 10:59 AM, soxred93 wrote:
> Just butting in here, if I recall correctly, both the PHP-native
> mb_strlen() and the MediaWiki fallback mb_strlen() functions are
> considerably slower (1.5 to 5 times as slow).
They only have to be run once, when the revision is saved. It's n
Just butting in here, if I recall correctly, both the PHP-native
mb_strlen() and the MediaWiki fallback mb_strlen() functions are
considerably slower (1.5 to 5 times as slow). Unless there's another
way to count characters for multibyte UTF strings, this would not be
a feasible idea.
-X!
On Tue, Aug 3, 2010 at 1:14 AM, Liangent wrote:
> Byte count is used. For example in Chinese Wikipedia, one of the
> criteria of "Did you know" articles is ">= 3000 bytes".
I mean, is byte count used for anything where character count couldn't
be used just about as well? Like is there some code
Ahem.
The revision size (and page size, meaning that of last revision) in
bytes, is available in the API. If you change the definition there is
no telling what you will break. Essentially you can't.
A character count would have to be another field.
best,
Robert
On Tue, Aug 3, 2010 at 9:53 AM, C
On 8/3/10, ChrisiPK wrote:
> This is a policy requirement, not a technical requirement, and can surely be
> adjusted.
It seems 1 zh char = 3 bytes gives a kind of proper weight among
characters. Obviously, zh chars look more important (when counting the
amount of content) than en chars, which are
This is a policy requirement, not a technical requirement, and can surely be
adjusted.
Am 03.08.2010 07:14, schrieb Liangent:
> On 8/3/10, Aryeh Gregor wrote:
>> No, we'd just have to repurpose rev_len to mean "characters" instead
>> of "bytes", and update all the old rows. We don't actually nee
On 8/3/10, Aryeh Gregor wrote:
> No, we'd just have to repurpose rev_len to mean "characters" instead
> of "bytes", and update all the old rows. We don't actually need the
> byte count for anything, do we?
Byte count is used. For example in Chinese Wikipedia, one of the
criteria of "Did you know
On Tue, Aug 3, 2010 at 2:53 PM, Jason A. Spiro wrote:
> A question for the non-English wiki contributors out there: Do you
> honestly care that MediaWiki shows byte counts and not character
> counts? If so, why do you care?
If the count itself is useful (I don't think it is), then it is
probabl
On Mon, Aug 2, 2010 at 5:36 PM, Aryeh Gregor
wrote:
>
> On Mon, Aug 2, 2010 at 5:28 PM, Jason A. Spiro wrote:
>>
>> Does the difference really matter so much that we must really use the
>> more-obscure and more-technical term "bytes"?
>
> In English, maybe not. In a lot of languages, they'll dif
On Mon, Aug 2, 2010 at 6:45 PM, Victor Vasiliev wrote:
> That would require introduction of another field to revision table,
> since byte count is not convertible to characher count in UTF-8.
No, we'd just have to repurpose rev_len to mean "characters" instead
of "bytes", and update all the old r
2010/8/2 Aryeh Gregor :
> That I don't know. I don't know if descriptions of the Usability
> Initiative's studies are all public, or what. Maybe one of them could
> fill us in.
There are videos around, yes, but I'm not sure we have reports.
Digging around on usabilitywiki should turn stuff up, or
On 08/03/2010 01:48 AM, Ariel T. Glenn wrote:
> I would love it if the indicator was in characters instead of bytes.
> That's more meaningful for almost every project. Readers are looking at
> text after all, not at raw strings.
>
> Ariel
>
That would require introduction of another field to revi
Στις 02-08-2010, ημέρα Δευ, και ώρα 17:36 -0400, ο/η Aryeh Gregor
έγραψε:
> On Mon, Aug 2, 2010 at 5:28 PM, Jason A. Spiro wrote:
> > Does the difference really matter so much that we must really use the
> > more-obscure and more-technical term "bytes"?
>
> In English, maybe not. In a lot of lan
On Mon, Aug 2, 2010 at 5:28 PM, Jason A. Spiro wrote:
> Does the difference really matter so much that we must really use the
> more-obscure and more-technical term "bytes"?
In English, maybe not. In a lot of languages, they'll differ by a
somewhat unpredictable factor that can be as high as thr
On Mon, Aug 2, 2010 at 5:18 PM, Aryeh Gregor
wrote:
>
> On Mon, Aug 2, 2010 at 4:59 PM, Jason A. Spiro wrote:
>
>> Has anyone ever done usability studies of newbies -- new Internet
>> users, experienced Internet users who are non-editors, or new editors?
>
> Yep, that's what the Usability Initiat
On Mon, Aug 2, 2010 at 4:59 PM, Jason A. Spiro wrote:
> Has anyone ever done usability studies of newbies -- new Internet
> users, experienced Internet users who are non-editors, or new editors?
Yep, that's what the Usability Initiative does.
> Have the study conductors watched how they play wi
On Wed, Jul 28, 2010 at 2:37 AM, church.of.emacs.ml
wrote:
>
> On 07/28/2010 04:57 AM, Jason Spiro wrote:
>>
>> I think it would be nice for both "View history"[1] and "User contributions"
>> to
>> show bytes added/removed. This would make it easier to distinguish between
>> small contributions
On 07/28/2010 04:57 AM, Jason Spiro wrote:
> I think it would be nice for both "View history"[1] and "User contributions"
> to
> show bytes added/removed. This would make it easier to distinguish between
> small contributions from big ones: between multiple-sentence additions and
> small typo fi
Hi all.
"Recent changes" shows bytes added/removed in green/red. But "View history"
only shows revision length in bytes, and "User contributions" shows no byte
counts at all.
I think it would be nice for both "View history"[1] and "User contributions" to
show bytes added/removed. This would mak
28 matches
Mail list logo