Re: [Foundation-l] Wikipedia meets git

2009-10-23 Thread Milos Rancic
For a couple of years I have an idea with another approach: Use MediaWiki as a software repository. Actually, I am already doing so on my local MediaWiki installation. I even had some python scripts (based on pywikipediabot) which are dealing with importing and exporting source code (but I don't kn

Re: [Foundation-l] Wikipedia meets git

2009-10-23 Thread Domas Mituzas
> A distributed backend is a nice idea anyway - imagine a meteor hitting > the Florida data centres ... give me that stuff you all just had, I want it too. Domas ___ foundation-l mailing list foundation-l@lists.wikimedia.org Unsubscribe: https://lists.

Re: [Foundation-l] Wikipedia meets git

2009-10-21 Thread jamesmikedup...@googlemail.com
Ok, I have started a google group called mediawiki-vcs http://groups.google.com/group/mediawiki-vcs We should just move the discussion there. Additionaly, I did not name it git, but vcs, for the reason that we should support multiple backends via a plugin. I am interested in using

Re: [Foundation-l] Wikipedia meets git

2009-10-21 Thread David Gerard
2009/10/21 jamesmikedup...@googlemail.com : > most people are working on very small subsets of the data. Very few > people will want to have all the data, think about getting all the > versions from all the git repos, it would be the same. > My idea is for smaller chapters who want to get started

Re: [Foundation-l] Wikipedia meets git

2009-10-21 Thread jamesmikedup...@googlemail.com
Wow, I am impressed. Let me remind you of one thing, most people are working on very small subsets of the data. Very few people will want to have all the data, think about getting all the versions from all the git repos, it would be the same. My idea is for smaller chapters who want to get started

Re: [Foundation-l] Wikipedia meets git

2009-10-21 Thread jamesmikedup...@googlemail.com
On Mon, Oct 19, 2009 at 5:30 PM, Joshua Gay wrote: > Since there are other people out there, perhaps we can start a mediawiki-git > discussion list and/or wiki discussion page? I'd love to post the work I'm > doing, too as it starts to come together. Sounds great Josh, lets collaborate. We can se

Re: [Foundation-l] Wikipedia meets git

2009-10-21 Thread Samuel Klein
That sounds like a great idea. I know a few other people who have worked on git-based wikis and toyed with making them compatible with mediawiki (copying bernie innocenti, one of the most eloquent :). SJ On Mon, Oct 19, 2009 at 11:30 AM, Joshua Gay wrote: > > I will apply for an account when I

Re: [Foundation-l] Wikipedia meets git

2009-10-19 Thread Joshua Gay
> I will apply for an account when It is ready for integration. > > this is still in experimentation mode. > The git replaces the mysql database. > > But there is alot more work to do to make this viable. > > thanks for all your encouragement and support. > > Since there are other people out there,

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Luca de Alfaro
Whoops, sorry, due to a glitch of a DNS setting, the demo on pt.wikipedia.org will be up later today or tomorrow. The demo on it.wikipedia.org is up now. Luca On Sat, Oct 17, 2009 at 4:48 PM, Luca de Alfaro wrote: > Dear James, > > you are absolutely right that we were lacking demos: we worked

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Luca de Alfaro
Dear James, you are absolutely right that we were lacking demos: we worked flat out to produce some, and if you visit http://wikitrust.soe.ucsc.edu/ , you can see that there are now a couple of Wikipedias on which you can try this. We wrote our own text analysis engine. The reason is that the ty

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Luca de Alfaro
I am very sorry. We needed to reconfigure a server, so we moved out the WikiTrust home page, and we put it on google sites, so we could redo the server configuration. There is a CNAME, but if you are caching the old name, the DNS change may not have propagated to you. In that case, please go to ht

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Luca de Alfaro
Yes, I just sent a message to the quality mailing list. Install the WikiTrust add-on, and visit it.wikipedia.org for instance. pt.wikipedia.org should also work. Please see the message I just sent to the Quality mailing list for more information. Luca On Fri, Oct 16, 2009 at 4:40 AM, jamesmikedu

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread jamesmikedup...@googlemail.com
Thanks, I will apply for an account when It is ready for integration. this is still in experimentation mode. The git replaces the mysql database. But there is alot more work to do to make this viable. thanks for all your encouragement and support. mike On Sat, Oct 17, 2009 at 6:53 PM, John Va

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread John Vandenberg
On Sun, Oct 18, 2009 at 3:39 AM, jamesmikedup...@googlemail.com wrote: > see my new blogpost word leve blaming for wikipedia via git and perl ... > http://fmtyewtk.blogspot.com/2009/10/mediawiki-git-word-level-blaming-one.html > ... > Problem is that github does not like this amount of processor p

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread jamesmikedup...@googlemail.com
see my new blogpost word leve blaming for wikipedia via git and perl ... http://fmtyewtk.blogspot.com/2009/10/mediawiki-git-word-level-blaming-one.html Next step is ready : 1. I have a single script that will pull a given article and check in the revisions into git, it is not perfect, but works.

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Anthony
On Sat, Oct 17, 2009 at 11:04 AM, John Vandenberg wrote: > On Sun, Oct 18, 2009 at 1:05 AM, Anthony wrote: >> I asked whether or not you tried svn, because svn already uses skip deltas. > > svn would be daft, for so many reasons. Doesn't mean you can't learn from it. ___

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread John Vandenberg
On Sun, Oct 18, 2009 at 1:05 AM, Anthony wrote: > On Sat, Oct 17, 2009 at 4:40 AM, jamesmikedup...@googlemail.com > wrote: >>> It would be nice if the git archival format was more efficient for the >>> kinds of changes made in Wikipedia articles: Source code changes tends >>> to have short lines

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Anthony
On Sat, Oct 17, 2009 at 4:40 AM, jamesmikedup...@googlemail.com wrote: >> It would be nice if the git archival format was more efficient for the >> kinds of changes made in Wikipedia articles: Source code changes tends >> to have short lines and changes tend to change a significant portion >> of t

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread jamesmikedup...@googlemail.com
I have On Sat, Oct 17, 2009 at 10:18 AM, Gregory Maxwell wrote: > On Fri, Oct 16, 2009 at 10:31 AM, Anthony wrote: >> On Fri, Oct 16, 2009 at 12:45 AM, jamesmikedup...@googlemail.com >>> if you want only the last 3 revisions checked out , it takes about 10 >>> seconds and produces 300k of data.

Re: [Foundation-l] Wikipedia meets git

2009-10-17 Thread Gregory Maxwell
On Fri, Oct 16, 2009 at 10:31 AM, Anthony wrote: > On Fri, Oct 16, 2009 at 12:45 AM, jamesmikedup...@googlemail.com >> if you want only the last 3 revisions checked out , it takes about 10 >> seconds and produces 300k of data. > > 10 seconds?  That's horrible.  Have you tried using svn? On a reas

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread jamesmikedup...@googlemail.com
I did not mean that literally, let me check the exact time for you : 1.258s time git clone --depth 3 git://github.com/h4ck3rm1k3/KosovoWikipedia.git Initialized empty Git repository in /home_data2/2009/10/KosovoWikipedia/gittest2/KosovoWikipedia/.git/ remote: Counting objects: 21, done. remote: Co

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread Anthony
On Fri, Oct 16, 2009 at 12:45 AM, jamesmikedup...@googlemail.com > if you want only the last 3 revisions checked out , it takes about 10 > seconds and produces 300k of data. 10 seconds? That's horrible. Have you tried using svn? ___ foundation-l maili

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread jamesmikedup...@googlemail.com
On Fri, Oct 16, 2009 at 2:08 PM, Gerard Meijssen wrote: > Hoi, > After a minute of googling I find http://wikitrust.soe.ucsc.edu/home .. I am > sure it is there for you as well. Yes the page is there, it seems to be a good idea. only I am missing some html pages so that we can see what it looks

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread Gerard Meijssen
Hoi, After a minute of googling I find http://wikitrust.soe.ucsc.edu/home .. I am sure it is there for you as well. Thanks, GerardM 2009/10/16 jamesmikedup...@googlemail.com > On Fri, Oct 16, 2009 at 9:45 AM, Denny Vrandecic > wrote: > > That is pretty cool. But wouldn't it make more sense

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread jamesmikedup...@googlemail.com
On Fri, Oct 16, 2009 at 9:45 AM, Denny Vrandecic wrote: > That is pretty cool. But wouldn't it make more sense to have a more- > fine grained blame, like the one in wikitrust, down to the character > level? Can you please provide some example pages of wikitrust? they seem to be AWOL: In the

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread jamesmikedup...@googlemail.com
> On Oct 16, 2009, at 10:30, jamesmikedup...@googlemail.com wrote: I have make two simple vlogs about what and why i did this http://www.youtube.com/watch?v=jc9jo1ZFLqk http://www.youtube.com/watch?v=7WfRuEuvIso Mike ___ foundation-l mailing list fou

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread Denny Vrandecic
Just another pointer, here is a distributed MediaWiki system developed at INRIA. I haven't looked into it yet too deep, but their evaluation looked very promising. Best, denny On Oct 16, 2009, at 10:30, jamesmikedup...@googlemail.com wrote: > O

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread jamesmikedup...@googlemail.com
On Fri, Oct 16, 2009 at 9:45 AM, Denny Vrandecic wrote: > That is pretty cool. But wouldn't it make more sense to have a more- > fine grained blame, like the one in wikitrust, down to the character > level? I don't know all these wikitools, but if the feature is missing from git, then it will ben

Re: [Foundation-l] Wikipedia meets git

2009-10-16 Thread Denny Vrandecic
That is pretty cool. But wouldn't it make more sense to have a more- fine grained blame, like the one in wikitrust, down to the character level? cheers, denny On Oct 15, 2009, at 20:55, jamesmikedup...@googlemail.com wrote: > Hallo, > I have gotten the wikipedia article for Kosovo in git. >

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread Joshua Gay
This is very awesome. I am in the early stages of trying to scope out a small side project to do a mediawiki <-> git bridge; it is very challenging. Being able to download the complete edit history in this fashion is extremely useful. Thank you very much for sharing this work. -Josh On Fri, Oct

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread jamesmikedup...@googlemail.com
>> On Thu, Oct 15, 2009 at 11:33 PM, Gregory Maxwell wrote: >>> Then why are you using git? It turns out there are a few wikis built on top of git : 1. the git-wiki : http://atonie.org/2008/02/git-wiki http://github.com/jeffbski/git-wiki git-wiki is a wiki that relies on git to keep pages' histo

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread jamesmikedup...@googlemail.com
On Fri, Oct 16, 2009 at 6:40 AM, jamesmikedup...@googlemail.com wrote: > On Thu, Oct 15, 2009 at 11:33 PM, Gregory Maxwell wrote: >> On Thu, Oct 15, 2009 at 4:38 PM, jamesmikedup...@googlemail.com >> wrote: >>> There are ways to optimize all of this. Most users will not want to >>> download the

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread jamesmikedup...@googlemail.com
On Thu, Oct 15, 2009 at 11:33 PM, Gregory Maxwell wrote: > On Thu, Oct 15, 2009 at 4:38 PM, jamesmikedup...@googlemail.com > wrote: >> There are ways to optimize all of this. Most users will not want to >> download the full history. > > Then why are you using git? I am not most users. I am using

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread Gregory Maxwell
On Thu, Oct 15, 2009 at 4:38 PM, jamesmikedup...@googlemail.com wrote: > There are ways to optimize all of this. Most users will not want to > download the full history. Then why are you using git? ___ foundation-l mailing list foundation-l@lists.wikim

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread jamesmikedup...@googlemail.com
On Thu, Oct 15, 2009 at 10:16 PM, Gregory Maxwell wrote: > It is cool that you get the complete history. > > But— it's a bit uncool that its about 14mbytes when the article is > 100k; understandable given that the expanded uncompressed history is > about 337mbytes... I have the uncompressed histo

Re: [Foundation-l] Wikipedia meets git

2009-10-15 Thread Gregory Maxwell
On Thu, Oct 15, 2009 at 2:55 PM, jamesmikedup...@googlemail.com wrote: > Hallo, > I have gotten the wikipedia article for Kosovo in git. > It is fast, distributed, highly compressed, redundant, branchable and usable. > > The blame function will show you who edited what version. > > Here Blame on t

[Foundation-l] Wikipedia meets git

2009-10-15 Thread jamesmikedup...@googlemail.com
Hallo, I have gotten the wikipedia article for Kosovo in git. It is fast, distributed, highly compressed, redundant, branchable and usable. The blame function will show you who edited what version. Here Blame on the up to date kosovo article! http://github.com/h4ck3rm1k3/KosovoWikipedia/blame/mas