Re: [Wikimedia-l] [Commons-l] Data mining for media archives

2014-02-06 Thread
On 7 February 2014 04:04, Samuel Klein wrote: > That's just beautiful. Thank you, Fae & Faebot. > > I see that job filtered for mobile uploads without EXIF data. > What obstacles do you envision for running such a service for all images? >> https://commons.m.wikimedia.org/wiki/User:Faebot/Sandbox

Re: [Wikimedia-l] [Commons-l] Data mining for media archives

2014-02-06 Thread Samuel Klein
That's just beautiful. Thank you, Fae & Faebot. I see that job filtered for mobile uploads without EXIF data. What obstacles do you envision for running such a service for all images? On Thu, Feb 6, 2014 at 7:59 PM, Fæ wrote: > On 6 Feb 2014 22:40, "Samuel Klein" wrote: > ... >> Are we doing a

Re: [Wikimedia-l] [Commons-l] Data mining for media archives

2014-02-06 Thread
On 6 Feb 2014 22:40, "Samuel Klein" wrote: ... > Are we doing any commons analysis like this at the moment? > Is any similarity-analysis done on upload to help uploaders identify > copies of the same image that already exist online? Or to flag > potential copyvios for reviewers Yes O:-) Checkout

Re: [Wikimedia-l] [Textbook-l] Textbooks Which Borrow Heavily from Wikipedia

2014-02-06 Thread Federico Leva (Nemo)
Samuel Klein, 06/02/2014 22:41: How could they improve attribution? What Phoebe said. A link to each history page *might* be enough but, especially if they're ebooks, a full list of names costs little (even though it can be ugly). What download formats or APIs would we like to see to enabl

[Wikimedia-l] Data mining for media archives

2014-02-06 Thread Samuel Klein
John Resig has just published some excellent data analysis combining TinEye, image archives, and image clustering and deduplication to identify identical and similar images across a large corpus. http://ejohn.org/research/computer-vision-photo-archives/ Are we doing any commons analysis like this

Re: [Wikimedia-l] Textbooks Which Borrow Heavily from Wikipedia

2014-02-06 Thread phoebe ayers
On Thu, Feb 6, 2014 at 1:41 PM, Samuel Klein wrote: > I'm meeting with the Boundless team tomorrow. > > How could they improve attribution? Looking at the g-book that James linked (and without paying for a download etc) I don't see any particular attribution at all in the book itself. The insid

Re: [Wikimedia-l] Textbooks Which Borrow Heavily from Wikipedia

2014-02-06 Thread David Gerard
On 6 February 2014 21:41, Samuel Klein wrote: > I'm meeting with the Boundless team tomorrow. Excellent! > How could they improve attribution? > What download formats or APIs would we like to see to enable reposting > to Wikibooks, or better cross-platform collaboration? Yeah, this is it. O

Re: [Wikimedia-l] Textbooks Which Borrow Heavily from Wikipedia

2014-02-06 Thread Samuel Klein
I'm meeting with the Boundless team tomorrow. How could they improve attribution? What download formats or APIs would we like to see to enable reposting to Wikibooks, or better cross-platform collaboration? Is anyone on wikibooks currently working on importing such materials, in Tamil or English

Re: [Wikimedia-l] Botopedia?

2014-02-06 Thread Anders Wennersten
Thanks Sam, your answer warms my soul! And you summarize my key points excellent, (and clearer than I managed myself) @Gerard: Our visions are very close and I support yours in general. On a more concrete level it seems we have some different views, it could be misundertandings from my side,

Re: [Wikimedia-l] Botopedia?

2014-02-06 Thread Samuel Klein
@Anders: I seem to have unintentionally derailed your excellent thread. My apologies; I've taken responses to that subthread offline. To return to your main point: we do need 'A strategy for semi-automated article generation; and inclusion of Wikidata'. Anders Wennersten writes: < [we] will no

Re: [Wikimedia-l] Invitation to WMF January 2014 Metrics & Activities Meeting: Thursday, February 6, 19:00 UTC

2014-02-06 Thread Praveena Maharaj
REMINDER: This meeting starts in 30 minutes. On Thu, Jan 30, 2014 at 3:29 PM, Praveena Maharaj wrote: > Dear all, > > The next WMF metrics and activities meeting will take place on Thursday, > February 6, 2014 at 7:00 PM UTC (11 AM PST). The IRC channel is > #wikimedia-office on irc.freenode.net

[Wikimedia-l] Language Engineering IRC Office Hour on February 12, 2014 (Wednesday) at 1700 UTC

2014-02-06 Thread Runa Bhattacharjee
[x-posted] Hello, The Wikimedia Language Engineering team will be hosting the monthly IRC office hour on February 12, 2014 (Wednesday) at 1700 UTC/ 0900 PDT on #wikimedia-office. This time we would be talking about the recent changes made to the Universal Language Selector (ULS) - the MediaWiki