Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Kevin Gorman
I should state that I haven't fully kept up to date on this thread; I
managed to give myself a concussion this week and have been limiting the
amount of time spent on activities that are mentally intensive.  I decided
to pop in and check on how it was going though, and after seeing Russavia's
friendly there-are-no-problems-with-commons-culture tone in his last post,
I figured I'd pop in and ask a question of Russavia (which he refused to
answer on-wiki.)

Russavia: one of your recent comments at a deletion discussion (here: [1])
looks an awful lot like you think that labelling a group of identifiable,
living people as engaged in prostitution related activities with no
evidence other than their location is perfectly acceptable and not a
violation of [[COM:IDENT]] or the board BLP resolution.  To me, labelling a
group of identifiable, living people as engaged in prostitution related
activities with no evidence other than their location is a strong violation
of both COM:IDENT, and the board's BLP resolution.  Most of the other
Wikimedians I've shown your comment to have interpreted it in the same way
I have. (You said that labelling a group of identifiable, living people as
engaged in prostitution related activities was apt because they were in
an area known to have a lot of sex work.  Given my understanding of the
word, that means you thought it was appropriate/suitable - apt is
definitely not a word I'd use to describe a BLP violation.  Just in case
I'd misunderstood what the word meant, I took a look at MW and the OED,
both of which indicated that no,I really hadnt.)

a) Have I accurately interpreted your comment?  If I haven't, would you
mind clarifying your intent as you refused to do on-wiki so those confused
about what you meant are able to understand what you meant?
b) Do you believe that walking away from an active on-wiki discussion
refusing to explain what you meant by a comment some took as highly
questionable is in line with what should be expected of sysops on major
projects?

Here's the comment in question reproduced for those who don't want to click
through:
Carrer de Sant Ramon in El Raval, Barcelona is notorious for being one of
the worst places in Barcelona for street prostitution, and this is also
acknowledged in local Catalan press. The uploader has done the right thing
in applying cover to the eyes of the people, and no individual person is
being named as a prostitute, but given that this street is known for
daylight prostitution, the name of the file and its description is apt.
However, prostitutes in the street do not generally like their photographs
to be taken. I am not opining delete on the basis of the nomination but
rather for very different reasons. Inline
withCommons:Country_specific_consent_requirements, in Spain one requires
permission to both take a photograph and publish the photograph when it is
taken in a public place, and there is no evidence that this was obtained by
the uploader. So on that basis, and that basis alone, it's a firm delete.

-
Kevin Gorman
(Since I cut/pasted Russavia's comment directly, one of the links to
Youtube he posted may attach to this email, and I don't currently see the
right button to turn that off.)


On Thu, May 15, 2014 at 10:58 PM, Erik Moeller e...@wikimedia.org wrote:

 On Thu, May 15, 2014 at 10:03 PM, John Mark Vandenberg jay...@gmail.com
 wrote:

  We're getting a long way off topic of the still frame on MOTD, but I
  agree, and wish that the WMF would make this a priority for their
  multimedia and search team.
  Many improvements have been suggested by the community, and both sides
  of the fence have even agreed on some of them, such as clustered
  search results:
 
 
 https://meta.wikimedia.org/wiki/Controversial_content/Brainstorming#Clustering_for_search_results_on_Commons
  https://bugzilla.wikimedia.org/show_bug.cgi?id=35701

 First, as general background, WMF recently started migrating its
 search infrastructure over to ElasticSearch. See:

 https://www.mediawiki.org/wiki/Search
 https://www.mediawiki.org/wiki/Help:CirrusSearch

 The new search is available on Commons as a BetaFeature. It's worth
 looking at search results that are viewed as problematic through the
 new search and compare. For example, the results for Asian are
 markedly different in the new search.

 I would caution against a simplistic characterization of technology as
 a solution for what's inherently a complex socio-technical problem.
 That was a core issue with the image filter proposal and it's a
 similar issue here. If people insist on uploading pictures of
 masturbation with toothbrushes, those pictures will come up in
 searches. If we insist on not having a distinction between explicit
 and non-explicit materials in file metadata, search results won't have
 it either. We can point the finger at technology because that's easy,
 but it's not magical pixie dust.

 To get a feel for ElasticSearch's capabilities, please see the help
 page above, as 

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Pete Forsyth
On Thu, May 15, 2014 at 8:20 PM, Andreas Kolbe jayen...@gmail.com wrote:


 Don't put words in my mouth, Pete. Commons is broken is a Jimmy Wales
 quote.


It's something many people have said, and I do apologize for my mistake --
I thought you were one of them. I am very happy to learn that I was wrong
about that, and appreciate the clarification.


 I do think Commons has some ways to go, though,


Agree 100%. This is something I work on, if not every day, probably every
week. I do think that progress is being made, on adult material and in many
other areas. But you're right, there's lots of work to do in developing a a
shared understanding of what issues are at play and how to go about
addressing them. The point of I'm trying to make in this discussion is, we
do a lot more good by focusing on what's working, and then expanding on
that, than we do by getting all accusatory about the things that are *not*
working. (And such accusations often seem to be accompanied by an
unjustified assumption that the bad somehow outweighs the good.)

Pete
[[User:Peteforsyth]]
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Russavia
Kevin,

Let me know when you have recovered from the concussion you have
incurred, which I hope is soon and I hope you are getting better, and
then I would urge you to re-look at the issues and re-present them,
and I would be more than happy to discuss publicly right here.

It would be unfair of you to expect me to be able to comment frankly
(which is my preferred way) right now, when what you are writing here,
on Commons, and in the comments section of Pete's blog post may not be
Kevin Gorman, Wikipedian-in-Residence at UC Berkeley, but Kevin
Gorman, the guy who has received a nasty knock to the head.

Until then, I would prefer not to comment further.

Cheers,

Russavia

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] [Wikitech-l] Request for comments: How to deal with open datasets?

2014-05-16 Thread David Cuenca
On Thu, May 15, 2014 at 11:53 PM, Andrew Gray andrew.g...@dunelm.org.ukwrote:

 Definitely agree that we needed something like this. There's a lot of
 confusion about what Wikidata is for, and what is and isn't appropriate for
 it - both from outsiders and from within the Wikimedia community. I've seen
 vague it's data, it'll go on Wikidata a few times, which is a bit like
 saying it's text, it'll go on Wikipedia ;-)


Actually during the Hackathon somebody said: We need a new project... a
sort of Wiki...Data...Source? :D And it is great that you make the analogy
of Wikipedia-and-Text because when you think about it, Wikipedia went
through exactly the same process until Project Sourceberg was
characterized ;-)

Micru
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Pete Forsyth
On Thu, May 15, 2014 at 10:58 PM, Erik Moeller e...@wikimedia.org wrote:

 I would caution against a simplistic characterization of technology as
 a solution for what's inherently a complex socio-technical problem.


Please forgive a sentimental moment -- I am so happy to hear this clearly
acknowledged by somebody in a position like yours. It is amazing to me how
little attention this simple and obvious point has gotten in the years of
discussion we have had on these topics! It seems that somehow, our
discussions often seem to ignore that free speech, varying cultural values
and editorial objectives, and individual rights have ALWAYS led to drama
and problem, starting many centuries before Wikimedia ever came on the
scene. It is really astonishing to me how much currency the XYZ group is
ethically challenged frame gains in our discussion -- whether that group
is those guys at Commons, or those guys at WMF, or whatever.

These problems are fundamentally difficult. Yes. That is the appropriate
starting point for these discussions, and we should all keep our shared
value to assume good faith in mind as we dig into the next layer.

All that said, I have long argued that there is a very simple solution to
90+% of the problem, and I'm curious whether this has ever been pursued.

A quick visit to stats.grok.se indicates that the Search feature of
English Wikipedia was used 63 million times last month, while the Search
feature of Commons was used 18,000 times in the same month. Of course, most
of the Wikipedia searches were for text, but at least *some* of them would
have involved the multimedia tab, which returns almost exactly the same
results whether you are on Wikipedia or Wikimedia Commons. (If there's a
way to quantify this, I don't know it.)

So my question is this: does a reader searching for images on Wikipedia
automatically want the same kind of results as a reader searching for
images on Wikimedia Commons? Does the Wikipedia reader *expect* to find
images from a *related* site? I think the clear answer is no. (Of course
this could be tested, but to do so is beyond my resources.)

I think it is much more likely that a Wikipedia reader would expect to find
those images *used in Wikipedia articles* than a massive collection of
stuff that is somehow tangentially related to Wikipedia in a way that they
don't fully understand.

So why on earth does the main multimedia search link on Wikipedia
automatically return unused results from Commons to begin with? Is that
really the right way to go?

I think a better way for the Wikipedia multimedia Search feature to work
would be if it returned for electric toothbrush ONLY the images (from
Wikipedia and from Commons) that are *actually* used in Wikipedia articles.
There could also be a link there, Would you like to search the media
repository Wikimedia Commons, a sister site to Wikipedia? But that could
be a secondary step, thereby preventing lots of people (who found the
electric toothbrush image they wanted already) from ever seeing the
horrific masturbation photo.

Is there a good reason not to have the Search feature work this way? I'm
not the most technical person, but this doesn't seem like a hugely
challenging project. Could it be accomplished for (say) $10,000? And if so,
is there a good reason not to do so, and eliminate a large portion of the
astonished users who get surprised by something like a toothbrush being put
to an unexpected use?

Pete
[[User:Peteforsyth]]
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Kevin Gorman
Russavia -

It would be pretty easy to go No, I didn't mean to imply that it was okay
to label a group of identifiable living people as prostitutes based on the
street they happened to be standing on, which, for some odd reason, you've
refused to do both on and off-wiki.  This has obviously gotten
significantly away from the topic of the original thread, but would it
comfort you if I asked one of the non-concussed people who interpreted your
comment in the exact same way I did to pose the same question to you here?

It doesn't exactly look awesome to make what is, charitably, an easy to
misinterpret statement, and then refuse to clarify the intent of your
statement on a project you're a sysop on.


Kevin Gorman


On Fri, May 16, 2014 at 1:12 AM, Russavia russavia.wikipe...@gmail.comwrote:

 Kevin,

 Let me know when you have recovered from the concussion you have
 incurred, which I hope is soon and I hope you are getting better, and
 then I would urge you to re-look at the issues and re-present them,
 and I would be more than happy to discuss publicly right here.

 It would be unfair of you to expect me to be able to comment frankly
 (which is my preferred way) right now, when what you are writing here,
 on Commons, and in the comments section of Pete's blog post may not be
 Kevin Gorman, Wikipedian-in-Residence at UC Berkeley, but Kevin
 Gorman, the guy who has received a nasty knock to the head.

 Until then, I would prefer not to comment further.

 Cheers,

 Russavia

 ___
 Wikimedia-l mailing list, guidelines at:
 https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
 Wikimedia-l@lists.wikimedia.org
 Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
 mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Andreas Kolbe
On Fri, May 16, 2014 at 9:05 AM, Pete Forsyth petefors...@gmail.com wrote:

 The point of I'm trying to make in this discussion is, we

do a lot more good by focusing on what's working, and then expanding on
 that, than we do by getting all accusatory about the things that are *not*
 working.



Think of a surgeon who's done thousands of successful routine operations.
But every once in a while, he does a gastric bypass, and those patients
more often than not end up harmed.

It isn't appropriate in such a case to focus on what's working.



 (And such accusations often seem to be accompanied by an
 unjustified assumption that the bad somehow outweighs the good.)



The question isn't whether the bad outweighs the good.
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Russavia
Kevin,

Feel free to have one of the people who don't have a nasty head injury
ask me the question. That would be fine, and I would actually prefer
it. Given your head injury, I'm actually a little surprised that your
friends did think of asking me themselves under the circumstances.

Cheers

Russavia

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Pete Forsyth
On Fri, May 16, 2014 at 1:39 AM, Andreas Kolbe jayen...@gmail.com wrote:

 On Fri, May 16, 2014 at 9:05 AM, Pete Forsyth petefors...@gmail.com
 wrote:

  The point of I'm trying to make in this discussion is, we
 
 do a lot more good by focusing on what's working, and then expanding on
  that, than we do by getting all accusatory about the things that are
 *not*
  working.



 Think of a surgeon who's done thousands of successful routine operations.
 But every once in a while, he does a gastric bypass, and those patients
 more often than not end up harmed.

 It isn't appropriate in such a case to focus on what's working.



  (And such accusations often seem to be accompanied by an
  unjustified assumption that the bad somehow outweighs the good.)
 


 The question isn't whether the bad outweighs the good.


Andreas,

I agree with your more sophisticated concerns about what is going on.
However, I think it's really important to put them in context. If Wikimedia
Commons had existed in 1985, this would be a very compelling line of
criticism. But in 2014, the same kind of issues -- occasionally
encountering shockingly inappropriate images on occasion -- happens whether
you are using Wikimedia Commons, Google search, Flickr, Instagram, or any
number of other sites -- not to mention spam that arrives unbidden in your
email box. If there are studies that quantify how often this happens in
different contexts, I'm not aware of them (and would be very happy to learn
about them). Until we can look at that kind of study, I refuse to accept as
a premise that Commons is categorically worse than other broad collections
of media on the Internet.

This is a problem to be addressed, yes. And it is a problem that the
Commons community works to address (at least in incremental fashion) every
single day. But in my opinion, it is not a problem to panic about.

Pete
[[User:Peteforsyth]]
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Craig Franklin
On 16 May 2014 15:09, Russavia russavia.wikipe...@gmail.com wrote:

 But of course, you, with a grand total of 303 edits on Commons going
 back to 2007 (most of which comprises of voting on Picture of the
 Year) are speaking from a position of experience when you say you
 understand Commons and its culture. So you'll excuse me, but it is a
 bit rich you saying that, and see your comments as insanely out of
 touch with the reality.


Out of curiosity, how many edits to Commons must one have for their opinion
to be valid?

Cheers,
Craig
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

[Wikimedia-l] Wikimedia Search (it was Commons' frontpage...)

2014-05-16 Thread David Cuenca
On Fri, May 16, 2014 at 10:28 AM, Pete Forsyth petefors...@gmail.comwrote:

 A quick visit to stats.grok.se indicates that the Search feature of
  English Wikipedia was used 63 million times last month [...]


Long time ago I played around with the idea that our endeavor for free
knowledge would result in spin-off movements, some of which have already
happened. Like easing the access to current primary sources (Open Access),
guaranteeing the distribution of knowledge (Wikipedia Zero, movement for
net neutrality), and making sure that it can be found with global open
search tools (?).

This last part is mostly uncharted territory, and as the diversity of our
open knowledge repository expands, it seems that we can offer much more
than just wikipedia text search. Commercial search engines are moving in
the direction of search result mash-ups, mixing text , images, videos,
maps, data... so the user has an idea of which facets of the subject he or
she can explore further.
Even mash-ups generated from Wikimedia sites will never be able to compete
with web search engines, but it can become the primary choice when looking
for free knowledge, specially if synergies with other free knowledge
organizations were sought.

Perhaps there will be gaps, but I am convinced that with the time they will
become less. Maps were missing and OSM has covered that void, there were no
free scores, and then IMSLP appeared. Even for products there is Open
Product Data in the making. And regarding local business seems that
Wikivoyage and OSM have something to offer too, and who knows what will
come next.

I wonder what are your thoughts about the exciting topic of joining forces
with other organizations in the search front to become more than the sum of
the parts.

Micru
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Andreas Kolbe
On Fri, May 16, 2014 at 9:46 AM, Pete Forsyth petefors...@gmail.com wrote:

 I agree with your more sophisticated concerns about what is going on.
 However, I think it's really important to put them in context. If Wikimedia
 Commons had existed in 1985, this would be a very compelling line of
 criticism. But in 2014, the same kind of issues -- occasionally
 encountering shockingly inappropriate images on occasion -- happens whether
 you are using Wikimedia Commons, Google search, Flickr, Instagram, or any
 number of other sites -- not to mention spam that arrives unbidden in your
 email box. If there are studies that quantify how often this happens in
 different contexts, I'm not aware of them (and would be very happy to learn
 about them). Until we can look at that kind of study, I refuse to accept as
 a premise that Commons is categorically worse than other broad collections
 of media on the Internet.



Commons is fundamentally different from Google, Flickr and other image
repositories in that it doesn't have safe search, neither as default nor as
an option.

If you enter human male, forefinger, Asian, Caucasian, or Black
as search terms in –


1.

Google Images

http://images.google.com/


2.

Flickr

https://www.flickr.com/


3.

Wikipedia Multimedia

https://en.wikipedia.org/w/index.php?title=Special:Searchsearch=fulltext=Searchprofile=images



– the results are strikingly different, with the Wikimedia image repository
the only one returning NSFW results (this applies even if you switch Google
Safe Search off).

You can philosophically debate, applaud or excoriate that fact, as many
have done, but it remains a fact.
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Wikimedia Search (it was Commons' frontpage...)

2014-05-16 Thread Gerard Meijssen
Hoi,
We already have this in WD-Search.

Try it for instance on the Telugu or Tamil Wikipedia. Search for any
subject that is covered in a Wikidata supported Wiki. You can use any
script. When the label exists, it will find it for you.

Thanks to automated descriptions you will get to see what the item is
about. You will be able to find articles in a Wikipedia and a link to
Commons.

PS Your project can have this enabled by adding one line to common.js
Thanks,
 GerardM


On 16 May 2014 11:46, David Cuenca dacu...@gmail.com wrote:

 On Fri, May 16, 2014 at 10:28 AM, Pete Forsyth petefors...@gmail.com
 wrote:

  A quick visit to stats.grok.se indicates that the Search feature of
   English Wikipedia was used 63 million times last month [...]
 

 Long time ago I played around with the idea that our endeavor for free
 knowledge would result in spin-off movements, some of which have already
 happened. Like easing the access to current primary sources (Open Access),
 guaranteeing the distribution of knowledge (Wikipedia Zero, movement for
 net neutrality), and making sure that it can be found with global open
 search tools (?).

 This last part is mostly uncharted territory, and as the diversity of our
 open knowledge repository expands, it seems that we can offer much more
 than just wikipedia text search. Commercial search engines are moving in
 the direction of search result mash-ups, mixing text , images, videos,
 maps, data... so the user has an idea of which facets of the subject he or
 she can explore further.
 Even mash-ups generated from Wikimedia sites will never be able to compete
 with web search engines, but it can become the primary choice when looking
 for free knowledge, specially if synergies with other free knowledge
 organizations were sought.

 Perhaps there will be gaps, but I am convinced that with the time they will
 become less. Maps were missing and OSM has covered that void, there were no
 free scores, and then IMSLP appeared. Even for products there is Open
 Product Data in the making. And regarding local business seems that
 Wikivoyage and OSM have something to offer too, and who knows what will
 come next.

 I wonder what are your thoughts about the exciting topic of joining forces
 with other organizations in the search front to become more than the sum of
 the parts.

 Micru
 ___
 Wikimedia-l mailing list, guidelines at:
 https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
 Wikimedia-l@lists.wikimedia.org
 Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
 mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Wikimedia Search (it was Commons' frontpage...)

2014-05-16 Thread David Cuenca
Gerard, I was referring to search integration with external projects.

Does WD-Search do that?


On Fri, May 16, 2014 at 12:29 PM, Gerard Meijssen gerard.meijs...@gmail.com
 wrote:

 Hoi,
 We already have this in WD-Search.

 Try it for instance on the Telugu or Tamil Wikipedia. Search for any
 subject that is covered in a Wikidata supported Wiki. You can use any
 script. When the label exists, it will find it for you.

 Thanks to automated descriptions you will get to see what the item is
 about. You will be able to find articles in a Wikipedia and a link to
 Commons.

 PS Your project can have this enabled by adding one line to common.js
 Thanks,
  GerardM


 On 16 May 2014 11:46, David Cuenca dacu...@gmail.com wrote:

  On Fri, May 16, 2014 at 10:28 AM, Pete Forsyth petefors...@gmail.com
  wrote:
 
   A quick visit to stats.grok.se indicates that the Search feature of
English Wikipedia was used 63 million times last month [...]
  
 
  Long time ago I played around with the idea that our endeavor for free
  knowledge would result in spin-off movements, some of which have already
  happened. Like easing the access to current primary sources (Open
 Access),
  guaranteeing the distribution of knowledge (Wikipedia Zero, movement for
  net neutrality), and making sure that it can be found with global open
  search tools (?).
 
  This last part is mostly uncharted territory, and as the diversity of our
  open knowledge repository expands, it seems that we can offer much more
  than just wikipedia text search. Commercial search engines are moving
 in
  the direction of search result mash-ups, mixing text , images, videos,
  maps, data... so the user has an idea of which facets of the subject he
 or
  she can explore further.
  Even mash-ups generated from Wikimedia sites will never be able to
 compete
  with web search engines, but it can become the primary choice when
 looking
  for free knowledge, specially if synergies with other free knowledge
  organizations were sought.
 
  Perhaps there will be gaps, but I am convinced that with the time they
 will
  become less. Maps were missing and OSM has covered that void, there were
 no
  free scores, and then IMSLP appeared. Even for products there is Open
  Product Data in the making. And regarding local business seems that
  Wikivoyage and OSM have something to offer too, and who knows what will
  come next.
 
  I wonder what are your thoughts about the exciting topic of joining
 forces
  with other organizations in the search front to become more than the sum
 of
  the parts.
 
  Micru
  ___
  Wikimedia-l mailing list, guidelines at:
  https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
  Wikimedia-l@lists.wikimedia.org
  Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
  mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe
 ___
 Wikimedia-l mailing list, guidelines at:
 https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
 Wikimedia-l@lists.wikimedia.org
 Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
 mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe




-- 
Etiamsi omnes, ego non
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] [Wikitech-l] Request for comments: How to deal with open datasets?

2014-05-16 Thread David Cuenca
Thanks for sharing your frustrations. I also notice that the word dataset
has many interpretations that we should delimit first its meaning, or at
least add some clarification. For instance dataset can be interpreted as
a:
- static data table: just a table containing some information in each
fields and that gets updated/versioned as a whole
- collections of files and their associated metadata: what the GLAMs release
- an object and its representations

My intention was that the RFC only refers to the first interpretation of
the word dataset, that is static data tables. Sorry if my use of the
word caused some misunderstanding, I will try to make it more clear in the
RFC.

The issues that you mention in your email are not related to the RFC scope,
however they deserve attention. I have identified the following subtopics:
1. Change a page of a djvu
2. Dirty metadata from GLAMs
3. Not reconizing the good job done by Commons
4. Files associated with a concept
5. Users not classifying their data in proper subcategories
6. Showing gaps in our coverage
7. Files as parts of a real object item

To keep this thread on-topic, I will address them in a separate email that
I will call it Dealing with GLAM data. I hope you don't mind.

Micru


On Fri, May 16, 2014 at 11:35 AM, Jane Darnell jane...@gmail.com wrote:

 David,

 I would strongly prefer a system that keeps the parts together, while
 at the same time, keeping all the parts separate and interchangeable.
 I hate that the .djvu files are blobs now, because if I find a better
 scan of an engraving from a book, I would like to replace the crappy
 scan that is in the .djvu file. I suppose you need to keep the version
 you uploaded, but you always want to present the best you have to the
 reader.

 I have looked at problems with datasets for a small GLAM, and have
 seen just how bad the data can be. I am mostly a web-surfer of
 poorly-designed GLAM datasets, which is why I have spent many hours
 thinking about these things. I have since given up trying to preach
 the evangelism of open data to GLAMs and started thinking more about
 what Wikipedia can do to curate the world's art. Many GLAMs are
 willing to share their data, but believe me when I say we may not want
 it. The backlog in batch uploads to Commons is not the technical
 upload queue, it's all the data massaging by hand that Wikipedians
 need to do beforehand. That work, which is done by Commons wizards,
 goes largely unrecognized today.

 Theoretically, a specific artwork is both a data item and a dataset.
 If you look at our artwork template on Commons you may have noticed
 how it has grown in the past 4 years and is fast becoming a fairly
 comprehensive standard dataset for certain items. The next step is to
 create a way to index these per object (yes we have categories - is
 that really the best we can do?).

 For popular artworks that are architectural features, Wiki Loves
 Monuments has harvested so many images of these from all different
 angles that you could probably make the case that Wikimedia Commons
 has more images than any other publication about that specific item.
 If you browse the various language versions and their representation
 of the object, you will notice that individual Wikipedians have
 selected different images, but these are rarely linked to each other
 and the casual Wikipedia reader has no idea that they can probably
 view the object in 3-D if they want to, or see a short movie about how
 it was made. Indeed, let's face it, most casual readers have only
 heard of Wikipedia and are completely unaware of Wikimedia Commons and
 have never heard of Wikimedia Commons categories.

 Take the case for the Sagrada Familia:
 https://commons.wikimedia.org/wiki/Category:Sagrada_Fam%C3%ADlia

 This category is augmented by a gallery page, with the helpful text
 The Sagrada Família is an unfinished church in the Catalan city of
 Barcelona, considered to be architect Antoni Gaudí's masterpiece. For
 images of the Holy Family (Jesus, Mary, and Joseph), see Category:Holy
 Family. :
 https://commons.wikimedia.org/wiki/Sagrada_Fam%C3%ADlia

 Is this really the best we can do? Has anyone ever stopped and counted
 the rate at which we accumulate photos of the Sagrada Familia each
 year? We don't want to deter people from uploading, because we are
 probably still missing important photos of various internal features.
 But how do we show the gaps in our coverage of this object, while
 presenting an encyclopedic view? The English Wikipedia page includes
 about 40 images with a link to the category, but no other hints for
 media navigation.

 This is just one example, there are many more. I would like to see a
 system by which the normal Wiki-collaboration process can be used to
 slowly integrate all of the Commons files into datasets per item, and
 then include these into datasets per city or artist or GLAM or
 whatever. I suppose it should be lists of categories, gallery pages,
 and templates, 

[Wikimedia-l] Dealing with GLAM data (it was: How to deal with open datasets?)

2014-05-16 Thread David Cuenca
Hi Jane,

As a continuation of the other thread:

1. Change a page of a djvu

The usual procedure is to download the file, perform changes with tools
like Djvu Solo, and overwrite the previous file with the new one adding a
description of the cannges (Upload a new version of this file). This
doesn't delete the old file, it is still accessible, and it can be restored
any time.

2. Dirty metadata from GLAMs

That is a problem I heard many times and there is no easy solution, however
there are better tools now that some years ago. Have you heard of
OpenRefine?
https://code.google.com/p/google-refine/

Commons needs something like that, but to annotate metadata with Wikidata
concepts. Maybe you could write a description on what is needed on the
IdeaLab?https://meta.wikimedia.org/wiki/Grants:IdeaLab

3. Not recognizing the good job done by Commons

This is something not addressed in current designs of the Media Viewer. You
can only thank the person who uploaded the file, but not the one who
curated the metadata, added categories, etc. If you have any idea about how
to send good karma to these users, please do share them in the MediaViewer
feedback page:
https://www.mediawiki.org/wiki/Talk:Multimedia/About_Media_Viewer

I also think that many Wikipedians have the mindset that since English
Wikipedia is the biggest project, all other projects should be subordinated
to their wishes. That creates some tensions (as you probably have seen the
last days). There is no easy solution to this en-wp centric mentality, I
just wish that more shared international projects (like Wikidata) will add
perspective and a better understanding in the long run.

4. Files associated with a concept

Hopefully this will be addressed by new tools like Wikidata for media info
https://commons.wikimedia.org/wiki/Commons:Wikidata_for_media_info

5. Users not classifying their data in proper subcategories

It is hard to educate casual users, I guess you could propose a new
notification when a file gets re-categorized. That way users will learn
which category would have been better.

6. Showing gaps in our coverage

Again this depends on wikidata for commons. When that tool is in place
then it will be possible to create concept trees and signal which branch
of the tree doesn't have any file. It is doable, but not trivial for now.

7. Files as parts of a real object item

We are going in that direction, however it is a long ride... again the
Wikidata for media info is part of the solution. We'll see...

Cheers,
Micru

On Fri, May 16, 2014 at 11:35 AM, Jane Darnell jane...@gmail.com wrote:

 David,

 I would strongly prefer a system that keeps the parts together, while
 at the same time, keeping all the parts separate and interchangeable.
 I hate that the .djvu files are blobs now, because if I find a better
 scan of an engraving from a book, I would like to replace the crappy
 scan that is in the .djvu file. I suppose you need to keep the version
 you uploaded, but you always want to present the best you have to the
 reader.

 I have looked at problems with datasets for a small GLAM, and have
 seen just how bad the data can be. I am mostly a web-surfer of
 poorly-designed GLAM datasets, which is why I have spent many hours
 thinking about these things. I have since given up trying to preach
 the evangelism of open data to GLAMs and started thinking more about
 what Wikipedia can do to curate the world's art. Many GLAMs are
 willing to share their data, but believe me when I say we may not want
 it. The backlog in batch uploads to Commons is not the technical
 upload queue, it's all the data massaging by hand that Wikipedians
 need to do beforehand. That work, which is done by Commons wizards,
 goes largely unrecognized today.

 Theoretically, a specific artwork is both a data item and a dataset.
 If you look at our artwork template on Commons you may have noticed
 how it has grown in the past 4 years and is fast becoming a fairly
 comprehensive standard dataset for certain items. The next step is to
 create a way to index these per object (yes we have categories - is
 that really the best we can do?).

 For popular artworks that are architectural features, Wiki Loves
 Monuments has harvested so many images of these from all different
 angles that you could probably make the case that Wikimedia Commons
 has more images than any other publication about that specific item.
 If you browse the various language versions and their representation
 of the object, you will notice that individual Wikipedians have
 selected different images, but these are rarely linked to each other
 and the casual Wikipedia reader has no idea that they can probably
 view the object in 3-D if they want to, or see a short movie about how
 it was made. Indeed, let's face it, most casual readers have only
 heard of Wikipedia and are completely unaware of Wikimedia Commons and
 have never heard of Wikimedia Commons categories.

 Take the case for the Sagrada Familia:
 

Re: [Wikimedia-l] Dealing with GLAM data (it was: How to deal with open datasets?)

2014-05-16 Thread Jane Darnell
David,
Thanks for your thoughtful reply. Yes I think if I sat down I could
probably type up about 20 things to go into the Idea Lab, but it's
quite a small group of people that ever reads anything in there, and
most of the readers are just quickly checking other ideas to make sure
they have filled in their own idea correctly.

One problem is that there is very little human interaction at all on
Commons. I was personally very sad to see this user [1] leave the
project, but I can't help but agree with him/her. Sometimes I wonder
what I am doing there myself! I just checked, and I have only left
messages on 21 other Commons user pages since 2006. Of those 21 users,
I have personally met only 4 of them. I can safely say I do not use
Commons as a social media website.

On the subject of dirty metadata from GLAMS, I am afraid that we
concentrate too much on metadata from GLAMs and I would like to see
some project work on our own metadata. Lots of it is dirty, but there
are some featured items in there that can be used to clean lots of the
dirty stuff to a higher level of quality and we should do that. I am
worried though when I read things from experienced Wikipedians along
the lines of getting rid of the creator and institution templates, or
even categories. [2] The whole power of Wikipedia lies in the low
threshold of technical knowledge necessary for participation. We need
to keep the entrance gate as wide open as possible. I see it as a sign
of failure on the part of Commons that I rarely use the newimproved
default upload wizard because it almost never fits my needs. I find it
annoying that I need to use my own link to use the old uploader. When
searching for new solutions, let's not throw away what we have, but
try to incorporate innovative approaches on top of the technical
landscape we already created.

I don't expect newbies to understand and use categories, but it should
be easier to identify and fix uncategorized items, and we need to
lower thresholds of participation. I think your suggestion of
uploading a new version of a djvu file shows an example of a high
threshold (the participant must have djvu software to be able to
update the file). In the case of the Sagrada Familia, there is a lot
more necessary to consolidate our knowledge about it than just media
files and text. We need improved search methods and associated
navigational tooling, as well as a way to aggregate the number of
projects involved in a quick overview. It's more useful if you can see
that more than 100 projects carry information about it, than if it was
in just 1 or two projects. Even Google just references the article in
your default language. I would also like to see redirects and
disambiguation pages exported to a central place, so search will
improve across language boundaries. I've noticed that it's often
spelling differences that keeps Google search better than Wikipedia at
locating Wikipedia stuff.

As far as the English-centric vision goes, I don't feel one way or
another about it. English is as good a lingua franca as any other,
as long as it's one basic language. I have said before I am cool with
this being Chinese, as long as it's consistent. My main problem with
Wikidata is the spotty coverage of labels, and if German was the
default language and was completely filled in for all data item
labels, I would be totally happy to put German in my Google settings
from now on. I get frustrated when people get into arguments about the
database-ification of Wikipedia, as if that is a bad thing. It is
unrealistic to think we can make everything machine-readable, just as
it is unrealistic to think we can keep the reading machines outside
the gates. We need to satisfy both our power users and our power
editors, and the only way to do that is to keep low-thresholds of
participation for both groups.  That low threshold of participation is
first and foremost for our casual mobile readers who become
Wiki(p/m)edians when they upload one thing and go away for ever. We
need them too, and though I don't want to force them into learning
about our category structures, I do want to serve them a few tips so
they can leave the default uploader and take a look around the
landscape. The main thing is, we need to share more about where we are
going as we get there, so users don't leave because they get tired of
waiting.

[1] 
https://commons.wikimedia.org/w/index.php?title=User:Boo-Boo_Baroooldid=90832110
[2] https://commons.wikimedia.org/wiki/Commons_talk:Wikidata_for_media_info

Jane

2014-05-16 14:20 GMT+02:00, David Cuenca dacu...@gmail.com:
 Hi Jane,

 As a continuation of the other thread:

 1. Change a page of a djvu

 The usual procedure is to download the file, perform changes with tools
 like Djvu Solo, and overwrite the previous file with the new one adding a
 description of the cannges (Upload a new version of this file). This
 doesn't delete the old file, it is still accessible, and it can be restored
 any time.

 2. Dirty metadata 

Re: [Wikimedia-l] Wikimedia Search (it was Commons' frontpage...)

2014-05-16 Thread Federico Leva (Nemo)

David Cuenca, 16/05/2014 12:55:

Gerard, I was referring to search integration with external projects.

Does WD-Search do that?


No, but it's an example of what can be already be done with API calls.
http://magnusmanske.de/wordpress/?p=108

Potentially even better, interwiki search was restored: 
https://bugzilla.wikimedia.org/show_bug.cgi?id=44420
You can try it on Italian language projects: 
https://it.wikiquote.org/w/index.php?title=Speciale%3ARicercasearch=gattofulltext=Ricerca


Portals like wikiwix.com and http://search.creativecommons.org have 
existed for a long time, but soon they'll be able to get much more from 
Wikimedia projects (after some more CirrusSearch deploys and Wikidata 
data imports).


Nemo

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Pete Forsyth
On Fri, May 16, 2014 at 2:54 AM, Andreas Kolbe jayen...@gmail.com wrote:

 Commons is fundamentally different from Google, Flickr and other image
 repositories in that it doesn't have safe search, neither as default nor as
 an option.


Have you never had Safe Search features fail? It seems to happen regularly
for me.

Overall though -- I don't disagree with you, this is stuff that should be
fixed. But as Erik pointed out, the fix is not obvious.

The thing that bothers me is when people (especially movement leaders)
falsely accuse entire communities of standing in the way of progress.

Pete
[[User:Peteforsyth]]
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] How to Criticize with Kindness

2014-05-16 Thread Romaine Wiki
Hello all,

I had a discussion with Fabrice about how a culture of Kindness and Fabrice 
also made a submission for Wikimania about it: 
https://wikimania2014.wikimedia.org/wiki/Submissions/A_Culture_of_Kindness

In the past years I notice very much how easy discussions can go in the wrong 
direction as all the facial expressions and intonation are lost when users 
write a message on a talk page. Many many times this goes wrong, and users have 
a different interpretation of what someone else said what causes a fight on the 
wiki. If users are smart they find out that in fact the difference between them 
is very small with (usually) only a very slight difference in focus, but in 
general they agree with each other, but they don't realize that on the moment 
of the discussion. (If users with good will aren't that smart to discover that, 
such can grow out to a fighting situation for many years.) If I estimate I 
would say at least 50% of all troubled discussions are causes by 
miscommunication as the result of words being read differently as result of 
missing facial expressions and intonation what most people are used to have in 
the communication with people around them.

If certain users are deaf, autistic or dyslectic, or have such background, this 
is even worsened. 

For some years I say that if I can follow a training to improve textual 
communication to better understand how things are perceived, I really like to 
follow such training. As I don't know of any, I started to figure out and 
collect what communication mistakes are made what cause troubles between users 
with the intention of creating a guide for users, to let them understand why 
some communication gives worse results.

Romaine

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

[Wikimedia-l] Commons vs. local media search

2014-05-16 Thread Erik Moeller
On Fri, May 16, 2014 at 1:28 AM, Pete Forsyth petefors...@gmail.com wrote:

 I think it is much more likely that a Wikipedia reader would expect to find
 those images *used in Wikipedia articles* than a massive collection of
 stuff that is somehow tangentially related to Wikipedia in a way that they
 don't fully understand.

 So why on earth does the main multimedia search link on Wikipedia
 automatically return unused results from Commons to begin with? Is that
 really the right way to go?

I'm breaking out this question since it's a concrete technical
proposal; it should probably also be raised on the multimedia list.
But we should answer it from the perspective What's best for the
user, rather than have it be driven solely by the NSFW corner case
(which may also appear when searching images used on a project like
en.wp alone).

As a user, I might want to find images to add to an article. Having
results from the central repository presented locally makes it easier
to do so without visiting a separate site. (Consider this from the
perspective of smaller projects especially, where the local search
would be pretty much useless.) This is why VisualEditor presents
Commons search results, as well.

As a user, I might be interested in multimedia about a certain topic I
just read about in Wikipedia. Showing only the results already in the
Wikipedia article(s) about the topic would make it harder to find such
media. Simple example: Let's say I read an article about a city, and I
want to find other historic maps of that city. In many cases, these
maps do exist on Commons but not locally. Should we therefore force
people to visit Commons to find them?

I would argue that from the ordinary user's perspective, the
distinction between Wikipedia/Commons is less important than what they
have in common, i.e. being large repositories of useful educational
content (and hyperbole aside, 99% of Commons is pretty boring stuff).

We could default to displaying locally used media and offer to search
Commons with an extra click. From a usability perspective, you want to
minimize the steps a user has to take, so good UX design would likely
disclose results from Commons either a) always, clearly labeled or b)
when no local results are available.

There's no question that search UX, both on Commons and on Wikipedia,
can be improved. I'm just skeptical that an unbiased evaluation of the
user experience using standard UX heuristics would lead to a design
that hides explicit content from initial search results. Distinguish
different types of content more clearly and make it easier to find the
stuff you want - sure, that's doable.

Erik
-- 
Erik Möller
VP of Engineering and Product Development, Wikimedia Foundation

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons vs. local media search

2014-05-16 Thread Nikolas Everett
On Fri, May 16, 2014 at 1:25 PM, Erik Moeller e...@wikimedia.org wrote:

 On Fri, May 16, 2014 at 1:28 AM, Pete Forsyth petefors...@gmail.com
 wrote:

  I think it is much more likely that a Wikipedia reader would expect to
 find
  those images *used in Wikipedia articles* than a massive collection of
  stuff that is somehow tangentially related to Wikipedia in a way that
 they
  don't fully understand.
 
  So why on earth does the main multimedia search link on Wikipedia
  automatically return unused results from Commons to begin with? Is that
  really the right way to go?

 I'm breaking out this question since it's a concrete technical
 proposal; it should probably also be raised on the multimedia list.
 But we should answer it from the perspective What's best for the
 user, rather than have it be driven solely by the NSFW corner case
 (which may also appear when searching images used on a project like
 en.wp alone).

 As a user, I might want to find images to add to an article. Having
 results from the central repository presented locally makes it easier
 to do so without visiting a separate site. (Consider this from the
 perspective of smaller projects especially, where the local search
 would be pretty much useless.) This is why VisualEditor presents
 Commons search results, as well.

 As a user, I might be interested in multimedia about a certain topic I
 just read about in Wikipedia. Showing only the results already in the
 Wikipedia article(s) about the topic would make it harder to find such
 media. Simple example: Let's say I read an article about a city, and I
 want to find other historic maps of that city. In many cases, these
 maps do exist on Commons but not locally. Should we therefore force
 people to visit Commons to find them?

 I would argue that from the ordinary user's perspective, the
 distinction between Wikipedia/Commons is less important than what they
 have in common, i.e. being large repositories of useful educational
 content (and hyperbole aside, 99% of Commons is pretty boring stuff).

 We could default to displaying locally used media and offer to search
 Commons with an extra click. From a usability perspective, you want to
 minimize the steps a user has to take, so good UX design would likely
 disclose results from Commons either a) always, clearly labeled or b)
 when no local results are available.

 There's no question that search UX, both on Commons and on Wikipedia,
 can be improved. I'm just skeptical that an unbiased evaluation of the
 user experience using standard UX heuristics would lead to a design
 that hides explicit content from initial search results. Distinguish
 different types of content more clearly and make it easier to find the
 stuff you want - sure, that's doable.


With Cirrus (the New Seach beta feature) both marking stuff from commons
as from commons and adding a tick box to remove stuff from commons would
also be possible.  Marking would be easier but both wouldn't be too hard.
We could even artificially push results that are on the local wiki higher
then those on commons.

I know its a horrible excuse, but we're spinning up a project to rework the
search page's user interface.  It hasn't something like that since tables
in table in tables was normal.  We've been talking about this.  The project
is still in the those a pretty mock ups stage and it has been moving
slowly.

Nik
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons vs. local media search

2014-05-16 Thread Pete Forsyth
On Fri, May 16, 2014 at 10:25 AM, Erik Moeller e...@wikimedia.org wrote:

 On Fri, May 16, 2014 at 1:28 AM, Pete Forsyth petefors...@gmail.com
 wrote:

  I think it is much more likely that a Wikipedia reader would expect to
 find
  those images *used in Wikipedia articles* than a massive collection of
  stuff that is somehow tangentially related to Wikipedia in a way that
 they
  don't fully understand.
 
  So why on earth does the main multimedia search link on Wikipedia
  automatically return unused results from Commons to begin with? Is that
  really the right way to go?

 I'm breaking out this question since it's a concrete technical
 proposal; it should probably also be raised on the multimedia list.
 But we should answer it from the perspective What's best for the
 user, rather than have it be driven solely by the NSFW corner case
 (which may also appear when searching images used on a project like
 en.wp alone).

 As a user, I might want to find images to add to an article. Having
 results from the central repository presented locally makes it easier
 to do so without visiting a separate site. (Consider this from the
 perspective of smaller projects especially, where the local search
 would be pretty much useless.) This is why VisualEditor presents
 Commons search results, as well.

 As a user, I might be interested in multimedia about a certain topic I
 just read about in Wikipedia. Showing only the results already in the
 Wikipedia article(s) about the topic would make it harder to find such
 media. Simple example: Let's say I read an article about a city, and I
 want to find other historic maps of that city. In many cases, these
 maps do exist on Commons but not locally. Should we therefore force
 people to visit Commons to find them?

 I would argue that from the ordinary user's perspective, the
 distinction between Wikipedia/Commons is less important than what they
 have in common, i.e. being large repositories of useful educational
 content (and hyperbole aside, 99% of Commons is pretty boring stuff).

 We could default to displaying locally used media and offer to search
 Commons with an extra click. From a usability perspective, you want to
 minimize the steps a user has to take, so good UX design would likely
 disclose results from Commons either a) always, clearly labeled or b)
 when no local results are available.

 There's no question that search UX, both on Commons and on Wikipedia,
 can be improved. I'm just skeptical that an unbiased evaluation of the
 user experience using standard UX heuristics would lead to a design
 that hides explicit content from initial search results. Distinguish
 different types of content more clearly and make it easier to find the
 stuff you want - sure, that's doable.

 Erik


Thanks Erik, this all sounds like a very reasonable and welcome approach. I
certainly don't think my proposal is complete, and as I indicated I don't
have the capability to do the required user research, but would hope this
idea could be a useful prompt for charting of a path forward.

Pete
[[User:Peteforsyth]]
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Wikimedia Search (it was Commons' frontpage...)

2014-05-16 Thread David Cuenca
On Fri, May 16, 2014 at 4:23 PM, Federico Leva (Nemo) nemow...@gmail.comwrote:

 Portals like wikiwix.com and http://search.creativecommons.org have
 existed for a long time, but soon they'll be able to get much more from
 Wikimedia projects (after some more CirrusSearch deploys and Wikidata data
 imports).


Wikiwix doesn't combine results from other non-wikimedia sites. And the CC
portal just directs to each different search engine.
Those are really poor search solutions when looking for results from
multiple free knowledge sites.

Thanks anyhow for pointing me to them, I didn't know about their existence
before.

Micru
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Andreas Kolbe
On Fri, May 16, 2014 at 9:43 AM, Russavia russavia.wikipe...@gmail.comwrote:

 Kevin,

 Feel free to have one of the people who don't have a nasty head injury
 ask me the question. That would be fine, and I would actually prefer
 it. Given your head injury, I'm actually a little surprised that your
 friends did think of asking me themselves under the circumstances.

 Cheers

 Russavia



Cutting to the chase, bearing in mind the location and other visual cues, I
personally would also assume that the description was indeed apt. In other
words, if I saw those women standing there, I'd assume they were
prostitutes too.

However, assumptions can be wrong. It would be wise for Commons to err on
the side of caution, and not label potentially identifiable women as
prostitutes on the basis of an unknown individual's upload to Commons.

This is a good example:

https://commons.wikimedia.org/wiki/File:Street_prostitute_EP_Blvd_02_Memphis_TN.jpg

She might well be a prostitute. She might also (for example) just have had
a tiff with her ex-boyfriend, who snapped this picture. To be wrong in one
out of a hundred cases like that is one time too many.

In topic areas like that, I'd be far more comfortable relying on an image
from a verifiable source like the one you mentioned in the deletion
discussion:

https://commons.wikimedia.org/wiki/File:9.000919_Pattaya_streetscene5.jpg
___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Wikimedia Search (it was Commons' frontpage...)

2014-05-16 Thread Federico Leva (Nemo)

David Cuenca, 16/05/2014 20:41:

Wikiwix doesn't combine results from other non-wikimedia sites. And the CC
portal just directs to each different search engine.
Those are really poor search solutions when looking for results from
multiple free knowledge sites.

Thanks anyhow for pointing me to them, I didn't know about their existence
before.


Sure, I mentioned them as two long-time loosely related partners of ours 
who could be (in theory) contacted for an idea like yours, now that 
there are more options. No idea who else could be interested: the last 
Ubuntu, and Firefox since forever, have their own search portals, but 
with their own priorities.


Otherwise, we can just wait for the miracles of APIs and free licenses 
to happen, at most with some prodding, unrelatedly from any effort of 
ours. For instance, Yandex seems to have some interest in our stuff, 
maybe they'll do something surprising at some point. :)


Nemo

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe

Re: [Wikimedia-l] Commons' frontpage probably shouldn't prominently feature a decontextualised stack of corpses.

2014-05-16 Thread Erik Moeller
On Thu, May 15, 2014 at 10:58 PM, Erik Moeller e...@wikimedia.org wrote:
 Capabilities that exist today with the new search include
 template-based boosting of results, a feature that's already enabled
 on Commons and which will boost quality content in search results:
 https://commons.wikimedia.org/w/index.php?title=MediaWiki:Cirrussearch-boost-templatesaction=edit

For the record, negative boosting is possible as well. So if folks
wanted to add {{NSFW}} to media files that should appear lower in
search results and then apply a 100% boost to that template, that
would put those results further down. Of course that would likely have
unintended consequences, and also take us down the familiar road of
having to figure out what to label / not label.

-- 
Erik Möller
VP of Engineering and Product Development, Wikimedia Foundation

___
Wikimedia-l mailing list, guidelines at: 
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, 
mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe