Re: [Rd] SVN vs DVCS

2010-05-26 Thread Martin Maechler
 Felix Andrews fe...@nfrac.org
 on Wed, 26 May 2010 11:20:12 +1000 writes:

 On second thoughts it is really none of my business how the R sources
 are managed.
 But I would encourage package developers and/or r-forge maintainers to
 consider these systems.

Thank you, Felix, for the compilation of these alternatives.

One very relevant piece of information that you've not added,
is, how easily one could *move* from svn to such a system
(including the full history of every file with revision numbers,
 log messages, etc),
and .. for R-forge, e.g., which of these provide nice and
flexible tools (as svn does) for an automatic web interface to
inspect file histories, differences, etc.

Regards,
Martin  ( maintainer of svn.r-project.org )

 Regards
 -Felix

 On 26 May 2010 10:29, Felix Andrews fe...@nfrac.org wrote:
 Hi,
 
 Just wondering whether anyone had thought about moving the R sources
 to a distributed version control system such as Bazaar, Git or
 Mercurial. These new generation systems make it easier to work on
 feature branches, allow working offline, are very fast, etc.
 
 Some projects that have moved to Git are
 Linux Kernel
 Perl
 Ruby on Rails
 ...
 http://en.wikipedia.org/wiki/Git_(software)
 
 Some projects that have moved to Bazaar (bzr) are
 Ubuntu
 MySQL
 Inkscape
 ...
 http://en.wikipedia.org/wiki/Bazaar_(software)
 
 Some projects that have moved to Mercurial (hg) are
 Mozilla
 Octave
 Python
 ...
 http://en.wikipedia.org/wiki/Mercurial_(software)
 
 Joel Spolky's take on it:
 http://www.joelonsoftware.com/items/2010/03/17.html
 
 Regards
 -Felix
 
 --
 Felix Andrews / 安福立
 Postdoctoral Fellow
 Integrated Catchment Assessment and Management (iCAM) Centre
 Fenner School of Environment and Society [Bldg 48a]
 The Australian National University
 Canberra ACT 0200 Australia
 M: +61 410 400 963
 T: + 61 2 6125 4670
 E: felix.andr...@anu.edu.au
 CRICOS Provider No. 00120C
 --
 http://www.neurofractal.org/felix/
 



 -- 
 Felix Andrews / 安福立
 Postdoctoral Fellow
 Integrated Catchment Assessment and Management (iCAM) Centre
 Fenner School of Environment and Society [Bldg 48a]
 The Australian National University
 Canberra ACT 0200 Australia
 M: +61 410 400 963
 T: + 61 2 6125 4670
 E: felix.andr...@anu.edu.au
 CRICOS Provider No. 00120C
 -- 
 http://www.neurofractal.org/felix/

 __
 R-devel@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-devel

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] Indexing bug?

2010-05-26 Thread Duncan Murdoch

Is this expected behaviour?

x - factor(c(c, b, a,c))
results - c(c=4, b=5)
results[x]

giving

 results[x]
NAbc NA
 NA54   NA

(i.e. it appears to give results[levels(x)]


whereas results[as.character(x)] does what I expected:

as.character(x)
results[as.character(x)]

 as.character(x)
[1] c b a c
 results[as.character(x)]
  cb NAc
  45   NA4

Duncan Murdoch

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] Indexing bug?

2010-05-26 Thread Deepayan Sarkar
On Wed, May 26, 2010 at 3:27 PM, Duncan Murdoch
murdoch.dun...@gmail.com wrote:
 Is this expected behaviour?

 x - factor(c(c, b, a,c))
 results - c(c=4, b=5)
 results[x]

 giving

 results[x]
 NA    b    c NA
  NA    5    4   NA

 (i.e. it appears to give results[levels(x)]

I would say it gives

results[as.numeric(x)]

and it's not clear if that is less expected. E.g., what happens if
results has no names? results[as.numeric(x)] would still work, but not
results[as.character(x)].

Factors have a dual nature (character labels vs numeric codes), and
when forced to choose, I think it's reasonable to choose the solution
which works more generally.

-Deepayan




 whereas results[as.character(x)] does what I expected:

 as.character(x)
 results[as.character(x)]

 as.character(x)
 [1] c b a c
 results[as.character(x)]
  c    b NA    c
  4    5   NA    4

 Duncan Murdoch

 __
 R-devel@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-devel


__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] Indexing bug?

2010-05-26 Thread Berwin A Turlach
G'day Duncan,

On Wed, 26 May 2010 05:57:38 -0400
Duncan Murdoch murdoch.dun...@gmail.com wrote:

 Is this expected behaviour?

Yes, according to the answer that this poster

https://stat.ethz.ch/pipermail/r-devel/2008-March/048674.html

got.

Indeed, the help page of '[' states:

The index object i can be numeric, logical, character or empty.
Indexing by factors is allowed and is equivalent to indexing by the
numeric codes (see factor) and not by the character values which are
printed (for which use [as.character(i)]). 

Cheers,

Berwin

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Gabor Grothendieck
Note that one can also use any of the dvcs systems without actually
moving from svn by using the dvcs (or associated extension/addon) as
an svn client or by using it on an svn checkout.

On Wed, May 26, 2010 at 5:44 AM, Martin Maechler
maech...@stat.math.ethz.ch wrote:
 Felix Andrews fe...@nfrac.org
     on Wed, 26 May 2010 11:20:12 +1000 writes:

     On second thoughts it is really none of my business how the R sources
     are managed.
     But I would encourage package developers and/or r-forge maintainers to
     consider these systems.

 Thank you, Felix, for the compilation of these alternatives.

 One very relevant piece of information that you've not added,
 is, how easily one could *move* from svn to such a system
 (including the full history of every file with revision numbers,
  log messages, etc),
 and .. for R-forge, e.g., which of these provide nice and
 flexible tools (as svn does) for an automatic web interface to
 inspect file histories, differences, etc.

 Regards,
 Martin  ( maintainer of svn.r-project.org )

     Regards
     -Felix

     On 26 May 2010 10:29, Felix Andrews fe...@nfrac.org wrote:
     Hi,
    
     Just wondering whether anyone had thought about moving the R sources
     to a distributed version control system such as Bazaar, Git or
     Mercurial. These new generation systems make it easier to work on
     feature branches, allow working offline, are very fast, etc.
    
     Some projects that have moved to Git are
     Linux Kernel
     Perl
     Ruby on Rails
     ...
     http://en.wikipedia.org/wiki/Git_(software)
    
     Some projects that have moved to Bazaar (bzr) are
     Ubuntu
     MySQL
     Inkscape
     ...
     http://en.wikipedia.org/wiki/Bazaar_(software)
    
     Some projects that have moved to Mercurial (hg) are
     Mozilla
     Octave
     Python
     ...
     http://en.wikipedia.org/wiki/Mercurial_(software)
    
     Joel Spolky's take on it:
     http://www.joelonsoftware.com/items/2010/03/17.html
    
     Regards
     -Felix

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] segfault on 2.11.0 with large POSIXct vector using as.character

2010-05-26 Thread Jeff Ryan
Running as.character on a large POSIXct causes a segfault on my 2.11
(2010-04-22) install.  Seems to crash at around 9e4 ... on OSX and Ubuntu at
least.

 invisible(as.character(Sys.time()+1:7e4))
 invisible(as.character(Sys.time()+1:8e4))
 invisible(as.character(Sys.time()+1:9e4))
Error: segfault from C stack overflow


 invisible(as.character(Sys.time()+1:5e5))
Error: segfault from C stack overflow

Thanks,
Jeff

 sessionInfo()
R version 2.11.0 (2010-04-22)
x86_64-apple-darwin10.2.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] stats graphics  grDevices utils datasets  methods   base


 sessionInfo()
R version 2.11.0 (2010-04-22)
i486-pc-linux-gnu

locale:
 [1] LC_CTYPE=en_US.utf8   LC_NUMERIC=C
 [3] LC_TIME=en_US.utf8LC_COLLATE=en_US.utf8
 [5] LC_MONETARY=C LC_MESSAGES=en_US.utf8
 [7] LC_PAPER=en_US.utf8   LC_NAME=C
 [9] LC_ADDRESS=C  LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.utf8 LC_IDENTIFICATION=C

attached base packages:
[1] stats graphics  grDevices utils datasets  methods   base


-- 
Jeffrey Ryan
jeffrey.r...@insightalgo.com

ia: insight algorithmics
www.insightalgo.com

[[alternative HTML version deleted]]

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] segfault on 2.11.0 with large POSIXct vector using as.character

2010-05-26 Thread Gabor Grothendieck
On my Vista system there is no seg fault:

 invisible(as.character(Sys.time()+1:5e5))
 win.version()
[1] Windows Vista (build 6002) Service Pack 2
 R.version.string
[1] R version 2.11.0 Patched (2010-04-26 r51822)


On Wed, May 26, 2010 at 7:29 AM, Jeff Ryan jeff.a.r...@gmail.com wrote:
 Running as.character on a large POSIXct causes a segfault on my 2.11
 (2010-04-22) install.  Seems to crash at around 9e4 ... on OSX and Ubuntu at
 least.

 invisible(as.character(Sys.time()+1:7e4))
 invisible(as.character(Sys.time()+1:8e4))
 invisible(as.character(Sys.time()+1:9e4))
 Error: segfault from C stack overflow


 invisible(as.character(Sys.time()+1:5e5))
 Error: segfault from C stack overflow

 Thanks,
 Jeff

 sessionInfo()
 R version 2.11.0 (2010-04-22)
 x86_64-apple-darwin10.2.0

 locale:
 [1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

 attached base packages:
 [1] stats     graphics  grDevices utils     datasets  methods   base


 sessionInfo()
 R version 2.11.0 (2010-04-22)
 i486-pc-linux-gnu

 locale:
  [1] LC_CTYPE=en_US.utf8       LC_NUMERIC=C
  [3] LC_TIME=en_US.utf8        LC_COLLATE=en_US.utf8
  [5] LC_MONETARY=C             LC_MESSAGES=en_US.utf8
  [7] LC_PAPER=en_US.utf8       LC_NAME=C
  [9] LC_ADDRESS=C              LC_TELEPHONE=C
 [11] LC_MEASUREMENT=en_US.utf8 LC_IDENTIFICATION=C

 attached base packages:
 [1] stats     graphics  grDevices utils     datasets  methods   base


 --
 Jeffrey Ryan
 jeffrey.r...@insightalgo.com

 ia: insight algorithmics
 www.insightalgo.com

        [[alternative HTML version deleted]]

 __
 R-devel@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-devel


__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] Indexing bug?

2010-05-26 Thread Duncan Murdoch

Duncan Murdoch wrote:

Is this expected behaviour?

x - factor(c(c, b, a,c))
results - c(c=4, b=5)
results[x]

giving

  results[x]
NAbc NA
  NA54   NA

(i.e. it appears to give results[levels(x)]
  


Thanks to all for pointing out my misinterpretation.It's clearly not 
a bug.


Duncan Murdoch


whereas results[as.character(x)] does what I expected:

as.character(x)
results[as.character(x)]

  as.character(x)
[1] c b a c
  results[as.character(x)]
   cb NAc
   45   NA4

Duncan Murdoch



__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] segfault on 2.11.0 with large POSIXct vector using as.character

2010-05-26 Thread Duncan Murdoch

Jeff Ryan wrote:

Running as.character on a large POSIXct causes a segfault on my 2.11
(2010-04-22) install.  Seems to crash at around 9e4 ... on OSX and Ubuntu at
least.
  


This has been fixed for a while in R-patched.  The 2.11.1 release on 
Monday should be fine. 

Apparently people aren't running the betas/release candidates.  You 
really should run the test versions to flush out bugs.  If you'd run the 
pre-release versions of 2.11.0, this bug would likely have been found 
before release.  The standard tests can miss things; the advantage of 
open source is there are many eyes looking for bugs.  But if those eyes 
are closed, the model doesn't work.


Duncan Murdoch
  

invisible(as.character(Sys.time()+1:7e4))
invisible(as.character(Sys.time()+1:8e4))
invisible(as.character(Sys.time()+1:9e4))


Error: segfault from C stack overflow


  

invisible(as.character(Sys.time()+1:5e5))


Error: segfault from C stack overflow

Thanks,
Jeff

  

sessionInfo()


R version 2.11.0 (2010-04-22)
x86_64-apple-darwin10.2.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

attached base packages:
[1] stats graphics  grDevices utils datasets  methods   base


  

sessionInfo()


R version 2.11.0 (2010-04-22)
i486-pc-linux-gnu

locale:
 [1] LC_CTYPE=en_US.utf8   LC_NUMERIC=C
 [3] LC_TIME=en_US.utf8LC_COLLATE=en_US.utf8
 [5] LC_MONETARY=C LC_MESSAGES=en_US.utf8
 [7] LC_PAPER=en_US.utf8   LC_NAME=C
 [9] LC_ADDRESS=C  LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.utf8 LC_IDENTIFICATION=C

attached base packages:
[1] stats graphics  grDevices utils datasets  methods   base
  





__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] segfault on 2.11.0 with large POSIXct vector using as.character

2010-05-26 Thread Jeff Ryan


 This has been fixed for a while in R-patched.  The 2.11.1 release on Monday
 should be fine.
 Apparently people aren't running the betas/release candidates.  You really
 should run the test versions to flush out bugs.  If you'd run the
 pre-release versions of 2.11.0, this bug would likely have been found before
 release.  The standard tests can miss things; the advantage of open source
 is there are many eyes looking for bugs.  But if those eyes are closed, the
 model doesn't work.


I'd say it does work, and did.  As far as catching before release, that
sounds great, as great as having no bugs.

I am not too sure that my tests could have caught a bug that the got passed
the R tests though, so passing along a segfault issue to those who might
know the source better seemed prudent -- even if I could only have the time
to test against the released version.

Thanks for the fix!

Jeff


 Duncan Murdoch



 invisible(as.character(Sys.time()+1:7e4))
 invisible(as.character(Sys.time()+1:8e4))
 invisible(as.character(Sys.time()+1:9e4))


 Error: segfault from C stack overflow




 invisible(as.character(Sys.time()+1:5e5))


 Error: segfault from C stack overflow

 Thanks,
 Jeff



 sessionInfo()


 R version 2.11.0 (2010-04-22)
 x86_64-apple-darwin10.2.0

 locale:
 [1] en_US.UTF-8/en_US.UTF-8/C/C/en_US.UTF-8/en_US.UTF-8

 attached base packages:
 [1] stats graphics  grDevices utils datasets  methods   base




 sessionInfo()


 R version 2.11.0 (2010-04-22)
 i486-pc-linux-gnu

 locale:
  [1] LC_CTYPE=en_US.utf8   LC_NUMERIC=C
  [3] LC_TIME=en_US.utf8LC_COLLATE=en_US.utf8
  [5] LC_MONETARY=C LC_MESSAGES=en_US.utf8
  [7] LC_PAPER=en_US.utf8   LC_NAME=C
  [9] LC_ADDRESS=C  LC_TELEPHONE=C
 [11] LC_MEASUREMENT=en_US.utf8 LC_IDENTIFICATION=C

 attached base packages:
 [1] stats graphics  grDevices utils datasets  methods   base







-- 
Jeffrey Ryan
jeffrey.r...@insightalgo.com

ia: insight algorithmics
www.insightalgo.com

[[alternative HTML version deleted]]

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Felix Andrews
I'm not necessarily advocating a migration; probably an administrative
nightmare, and everyone involved would be forced to learn new stuff...
I was just enthusing because I recently started using a DVCS for the
first time.


On 26 May 2010 21:16, Gabor Grothendieck ggrothendi...@gmail.com wrote:
 Note that one can also use any of the dvcs systems without actually
 moving from svn by using the dvcs (or associated extension/addon) as
 an svn client or by using it on an svn checkout.

Yes, that's a very good point (although in my experience it takes a
very long time to do the initial download of the SVN repository). I'm
not an expert on these systems, but I imagine the main downside (other
than speed) of having SVN upstream is that you have to keep the
history linear, and so e.g can't collaborate on feature branches this
way. But yeah, worth a go.


 On Wed, May 26, 2010 at 5:44 AM, Martin Maechler
 maech...@stat.math.ethz.ch wrote:
 Felix Andrews fe...@nfrac.org
     on Wed, 26 May 2010 11:20:12 +1000 writes:

     On second thoughts it is really none of my business how the R sources
     are managed.
     But I would encourage package developers and/or r-forge maintainers to
     consider these systems.

 Thank you, Felix, for the compilation of these alternatives.

 One very relevant piece of information that you've not added,
 is, how easily one could *move* from svn to such a system
 (including the full history of every file with revision numbers,
  log messages, etc),

Indeed... here is the basic process for migrating to Git
http://www.jonmaddox.com/2008/03/05/cleanly-migrate-your-subversion-repository-to-a-git-repository/
This will keep the branches, tags, full history, with SVN revision
numbers added to the log messages (if you leave off the --no-metadata
argument). However, the actual commit ids you would use in git log /
git diff / etc will not be the same as the old SVN ids. In fact git
uses hash strings rather than numbers, and bazaar uses sequential
numbering in each branch (rather than sequential numbers globally as
SVN does). Not sure about Mercurial.

 and .. for R-forge, e.g., which of these provide nice and
 flexible tools (as svn does) for an automatic web interface to
 inspect file histories, differences, etc.

All have web interfaces. In fact FusionForge, which is the new name
for G-Forge, apparently supports Git, Bzr and Hg
http://fusionforge.org/
https://alioth.debian.org/scm/?group_id=30261
http://wiki.debian.org/Alioth/Hg

Other examples of web interfaces can be seen on the hosting services
GitHub.com e.g. http://github.com/hadley/ggplot2
Canonical's launchpad.net (bzr) e.g. https://launchpad.net/igraph



 Regards,
 Martin  ( maintainer of svn.r-project.org )

     Regards
     -Felix

     On 26 May 2010 10:29, Felix Andrews fe...@nfrac.org wrote:
     Hi,
    
     Just wondering whether anyone had thought about moving the R sources
     to a distributed version control system such as Bazaar, Git or
     Mercurial. These new generation systems make it easier to work on
     feature branches, allow working offline, are very fast, etc.
    
     Some projects that have moved to Git are
     Linux Kernel
     Perl
     Ruby on Rails
     ...
     http://en.wikipedia.org/wiki/Git_(software)
    
     Some projects that have moved to Bazaar (bzr) are
     Ubuntu
     MySQL
     Inkscape
     ...
     http://en.wikipedia.org/wiki/Bazaar_(software)
    
     Some projects that have moved to Mercurial (hg) are
     Mozilla
     Octave
     Python
     ...
     http://en.wikipedia.org/wiki/Mercurial_(software)
    
     Joel Spolky's take on it:
     http://www.joelonsoftware.com/items/2010/03/17.html
    
     Regards
     -Felix




-- 
Felix Andrews / 安福立
Postdoctoral Fellow
Integrated Catchment Assessment and Management (iCAM) Centre
Fenner School of Environment and Society [Bldg 48a]
The Australian National University
Canberra ACT 0200 Australia
M: +61 410 400 963
T: + 61 2 6125 4670
E: felix.andr...@anu.edu.au
CRICOS Provider No. 00120C
-- 
http://www.neurofractal.org/felix/

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Simon Urbanek

On May 26, 2010, at 10:01 AM, Felix Andrews wrote:

 I'm not necessarily advocating a migration; probably an administrative
 nightmare, and everyone involved would be forced to learn new stuff...
 I was just enthusing because I recently started using a DVCS for the
 first time.
 
 
 On 26 May 2010 21:16, Gabor Grothendieck ggrothendi...@gmail.com wrote:
 Note that one can also use any of the dvcs systems without actually
 moving from svn by using the dvcs (or associated extension/addon) as
 an svn client or by using it on an svn checkout.
 
 Yes, that's a very good point (although in my experience it takes a
 very long time to do the initial download of the SVN repository). I'm
 not an expert on these systems, but I imagine the main downside (other
 than speed) of having SVN upstream is that you have to keep the
 history linear,

That (non-linear history) is IMHO the biggest drawback of DVCS because that 
means there is no way to link a particular build to the source status and you 
cannot use globally valid build numbers.


 and so e.g can't collaborate on feature branches this way.

But feature branches are as easily (IMHO even more easily since you can closely 
monitor what others are contributing) worked on with SVN (routinely used with 
R) so I'm not sure what DVCS would buy you.

AFAICS the only benefit of DVCS is that if you are on a remote island without 
any internet connection you can accumulate multiple commits before merging them 
back. I can't say that I desperately need that functionality ;).

Cheers,
Simon



 
 On Wed, May 26, 2010 at 5:44 AM, Martin Maechler
 maech...@stat.math.ethz.ch wrote:
 Felix Andrews fe...@nfrac.org
 on Wed, 26 May 2010 11:20:12 +1000 writes:
 
 On second thoughts it is really none of my business how the R sources
 are managed.
 But I would encourage package developers and/or r-forge maintainers to
 consider these systems.
 
 Thank you, Felix, for the compilation of these alternatives.
 
 One very relevant piece of information that you've not added,
 is, how easily one could *move* from svn to such a system
 (including the full history of every file with revision numbers,
  log messages, etc),
 
 Indeed... here is the basic process for migrating to Git
 http://www.jonmaddox.com/2008/03/05/cleanly-migrate-your-subversion-repository-to-a-git-repository/
 This will keep the branches, tags, full history, with SVN revision
 numbers added to the log messages (if you leave off the --no-metadata
 argument). However, the actual commit ids you would use in git log /
 git diff / etc will not be the same as the old SVN ids. In fact git
 uses hash strings rather than numbers, and bazaar uses sequential
 numbering in each branch (rather than sequential numbers globally as
 SVN does). Not sure about Mercurial.
 
 and .. for R-forge, e.g., which of these provide nice and
 flexible tools (as svn does) for an automatic web interface to
 inspect file histories, differences, etc.
 
 All have web interfaces. In fact FusionForge, which is the new name
 for G-Forge, apparently supports Git, Bzr and Hg
 http://fusionforge.org/
 https://alioth.debian.org/scm/?group_id=30261
 http://wiki.debian.org/Alioth/Hg
 
 Other examples of web interfaces can be seen on the hosting services
 GitHub.com e.g. http://github.com/hadley/ggplot2
 Canonical's launchpad.net (bzr) e.g. https://launchpad.net/igraph
 
 
 
 Regards,
 Martin  ( maintainer of svn.r-project.org )
 
 Regards
 -Felix
 
 On 26 May 2010 10:29, Felix Andrews fe...@nfrac.org wrote:
 Hi,

 Just wondering whether anyone had thought about moving the R sources
 to a distributed version control system such as Bazaar, Git or
 Mercurial. These new generation systems make it easier to work on
 feature branches, allow working offline, are very fast, etc.

 Some projects that have moved to Git are
 Linux Kernel
 Perl
 Ruby on Rails
 ...
 http://en.wikipedia.org/wiki/Git_(software)

 Some projects that have moved to Bazaar (bzr) are
 Ubuntu
 MySQL
 Inkscape
 ...
 http://en.wikipedia.org/wiki/Bazaar_(software)

 Some projects that have moved to Mercurial (hg) are
 Mozilla
 Octave
 Python
 ...
 http://en.wikipedia.org/wiki/Mercurial_(software)

 Joel Spolky's take on it:
 http://www.joelonsoftware.com/items/2010/03/17.html

 Regards
 -Felix
 
 
 
 
 -- 
 Felix Andrews / 安福立
 Postdoctoral Fellow
 Integrated Catchment Assessment and Management (iCAM) Centre
 Fenner School of Environment and Society [Bldg 48a]
 The Australian National University
 Canberra ACT 0200 Australia
 M: +61 410 400 963
 T: + 61 2 6125 4670
 E: felix.andr...@anu.edu.au
 CRICOS Provider No. 00120C
 -- 
 http://www.neurofractal.org/felix/
 
 __
 R-devel@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-devel
 

__

Re: [Rd] SVN vs DVCS

2010-05-26 Thread Hadley Wickham
 Yes, that's a very good point (although in my experience it takes a
 very long time to do the initial download of the SVN repository). I'm
 not an expert on these systems, but I imagine the main downside (other
 than speed) of having SVN upstream is that you have to keep the
 history linear,

 That (non-linear history) is IMHO the biggest drawback of DVCS because that 
 means there is no way to link a particular build to the source status and you 
 cannot use globally valid build numbers.

Git (and I'm sure the others) provides a globally unique id for each
revision.  Isn't that sufficient?

 But feature branches are as easily (IMHO even more easily since you can 
 closely monitor what others are contributing) worked on with SVN (routinely 
 used with R) so I'm not sure what DVCS would buy you.

Feature branches are _much_ easier with git - to the point where some
people suggest using a separate feature branch for every feature you
develop.

 AFAICS the only benefit of DVCS is that if you are on a remote island without 
 any internet connection you can accumulate multiple commits before merging 
 them back. I can't say that I desperately need that functionality ;).

You have never worked on an airplane or other location without
internet access?  You must have lived a very privileged life ;)

Hadley


-- 
Assistant Professor / Dobelman Family Junior Chair
Department of Statistics / Rice University
http://had.co.nz/

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Hadley Wickham
 and .. for R-forge, e.g., which of these provide nice and
 flexible tools (as svn does) for an automatic web interface to
 inspect file histories, differences, etc.

Every svn alternative provides tools that are as good as or better
than R-forge, with the exception of package building.  It's a real
shame that this unique component of R-forge is so closely connected to
the tools that many other sites provide.

See http://github.com/hadley/plyr for an example of the development
experience that other sites provide.  Note the absence of broken
images and broken https.

Hadley

-- 
Assistant Professor / Dobelman Family Junior Chair
Department of Statistics / Rice University
http://had.co.nz/

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Antonio, Fabio Di Narzo
2010/5/26 Hadley Wickham had...@rice.edu:
 Yes, that's a very good point (although in my experience it takes a
 very long time to do the initial download of the SVN repository). I'm
 not an expert on these systems, but I imagine the main downside (other
 than speed) of having SVN upstream is that you have to keep the
 history linear,

 That (non-linear history) is IMHO the biggest drawback of DVCS because that 
 means there is no way to link a particular build to the source status and 
 you cannot use globally valid build numbers.

 Git (and I'm sure the others) provides a globally unique id for each
 revision.  Isn't that sufficient?

 But feature branches are as easily (IMHO even more easily since you can 
 closely monitor what others are contributing) worked on with SVN (routinely 
 used with R) so I'm not sure what DVCS would buy you.

 Feature branches are _much_ easier with git - to the point where some
 people suggest using a separate feature branch for every feature you
 develop.

 AFAICS the only benefit of DVCS is that if you are on a remote island 
 without any internet connection you can accumulate multiple commits before 
 merging them back. I can't say that I desperately need that functionality ;).

 You have never worked on an airplane or other location without
 internet access?  You must have lived a very privileged life ;)

Some people just have decent web access only at work, and if you work
on your R project like at home or on the train, you're already having
some difficulties. But please, not the airplane argument! (just
joking...).

Moreover, 'local' commits are way faster than network-based commits. I
can testify: 1microsecond vs 1second delay (or more, depending on how
crappy is your net access) *is* a big difference. On your local
machine, you end up committing much more often, with smaller and
self-contained commits, generally producing a cleaner history.

fabio.


 Hadley


 --
 Assistant Professor / Dobelman Family Junior Chair
 Department of Statistics / Rice University
 http://had.co.nz/

 __
 R-devel@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-devel




-- 
Antonio Fabio Di Narzo, PhD.
Swiss Institute for Bioinformatics - Bioinformatics Core Facility
Office 2029, Génopode, Quartier Sorge
CH-1015 Lausanne, Switzerland
Tel: +41 21 692 4087
Fax: +41 21 692 4065

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Simon Urbanek

On May 26, 2010, at 11:35 AM, Hadley Wickham wrote:

 Yes, that's a very good point (although in my experience it takes a
 very long time to do the initial download of the SVN repository). I'm
 not an expert on these systems, but I imagine the main downside (other
 than speed) of having SVN upstream is that you have to keep the
 history linear,
 
 That (non-linear history) is IMHO the biggest drawback of DVCS because that 
 means there is no way to link a particular build to the source status and 
 you cannot use globally valid build numbers.
 
 Git (and I'm sure the others) provides a globally unique id for each
 revision.  Isn't that sufficient?
 

No in that you cannot follow revisions. What you get are those horrible UUIDs 
that you can't seriously use other that in some autogenerated form (that's one 
of the main reasons I abandoned GIt after giving it a try).


 But feature branches are as easily (IMHO even more easily since you can 
 closely monitor what others are contributing) worked on with SVN (routinely 
 used with R) so I'm not sure what DVCS would buy you.
 
 Feature branches are _much_ easier with git - to the point where some
 people suggest using a separate feature branch for every feature you
 develop.
 

Ok, what's different? It's trivial to create branches in SVN and trivial to 
merge - how is it easier in git? That may be the part I don't understand..


 AFAICS the only benefit of DVCS is that if you are on a remote island 
 without any internet connection you can accumulate multiple commits before 
 merging them back. I can't say that I desperately need that functionality ;).
 
 You have never worked on an airplane or other location without internet 
 access?  You must have lived a very privileged life ;)
 

Oh, you don't have internet in your airplane? ;) But seriously, yes, I have 
hacked stuff on airplanes but in general I'm able to get an access soon enough 
to have reasonable commit granularity. But yes, I do agree that it can be 
useful at very limited number of times (maybe once or twice so far for me), but 
that doesn't convince me to give up revisions and central conflict resolution 
which I use daily. [Note: this is my personal preference]

Cheers,
Simon

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Gabor Grothendieck
On Wed, May 26, 2010 at 11:38 AM, Hadley Wickham had...@rice.edu wrote:
 It's a real
 shame that this unique component of R-forge is so closely connected to
 the tools that many other sites provide.

R-Forge does have the capability of mirroring an external subversion
repository according to section 4.2 of the R-Forge manual:
http://r-forge.r-project.org/R-Forge_Manual.pdf

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel



Re: [Rd] SVN vs DVCS

2010-05-26 Thread Simon Urbanek

On May 26, 2010, at 12:26 PM, Antonio, Fabio Di Narzo wrote:

 2010/5/26 Hadley Wickham had...@rice.edu:
 Yes, that's a very good point (although in my experience it takes a
 very long time to do the initial download of the SVN repository). I'm
 not an expert on these systems, but I imagine the main downside (other
 than speed) of having SVN upstream is that you have to keep the
 history linear,
 
 That (non-linear history) is IMHO the biggest drawback of DVCS because that 
 means there is no way to link a particular build to the source status and 
 you cannot use globally valid build numbers.
 
 Git (and I'm sure the others) provides a globally unique id for each
 revision.  Isn't that sufficient?
 
 But feature branches are as easily (IMHO even more easily since you can 
 closely monitor what others are contributing) worked on with SVN (routinely 
 used with R) so I'm not sure what DVCS would buy you.
 
 Feature branches are _much_ easier with git - to the point where some
 people suggest using a separate feature branch for every feature you
 develop.
 
 AFAICS the only benefit of DVCS is that if you are on a remote island 
 without any internet connection you can accumulate multiple commits before 
 merging them back. I can't say that I desperately need that functionality 
 ;).
 
 You have never worked on an airplane or other location without
 internet access?  You must have lived a very privileged life ;)
 
 Some people just have decent web access only at work, and if you work
 on your R project like at home or on the train, you're already having
 some difficulties. But please, not the airplane argument! (just
 joking...).
 
 Moreover, 'local' commits are way faster than network-based commits. I
 can testify: 1microsecond vs 1second delay (or more, depending on how
 crappy is your net access) *is* a big difference. On your local
 machine, you end up committing much more often, with smaller and
 self-contained commits, generally producing a cleaner history.
 

I disagree - I don't find commit time having any impact on what I commit. It's 
always a logical chunk (which is why SVN was such a great step forward from 
CVS). My RForge does check on commit so I don't even bother waiting for the 
commit to finish (waiting is just useful if I want the check result - the 
actual commit is pretty much instantaneous). However, with SVN you'll know 
immediately if someone else was working on the same issue in the meantime - 
with DVCS you won't (this happens in R more often that you would think). [Note: 
again, this is rather about personal preferences I suspect]

Cheers,
Simon

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Seth Falcon

On 5/26/10 4:16 AM, Gabor Grothendieck wrote:

Note that one can also use any of the dvcs systems without actually
moving from svn by using the dvcs (or associated extension/addon) as
an svn client or by using it on an svn checkout.


FWIW, I have been using git for several years now as my vsc of choice 
and use it for all svn-backed projects (R included) via git-svn.


Some of the things I like:

- Being able to organize changes in local commits that can be revised, 
reordered, rebased prior to publishing.  Once I got in the habit of 
working this way, I simply can't imagine going back.


- Having quick access to full repository history without network 
access/delay.  Features for searching change history are more powerful 
(or easier for me to use) and I have found that useful as well.


- This may not be true any longer with more recent svn servers/clients, 
but aside form the initial repo clone, working via git-svn was 
noticeably faster than straight svn client (!) -- I think related to how 
the tools organize the working copy and how many fstat calls they make.


- I find the log reviewing functionality much better suited to reviewing 
changes.



+ seth

--
Seth Falcon | @sfalcon | http://userprimary.net/

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


[Rd] R 2.10 and help

2010-05-26 Thread Jamie Love
Hi all,

I've been developing a little project for the past few months on R2.9.
The project utilises RServe to access R over at tcp/ip connection.

One feature of the project is to take R-Help and display it to the
user through my own UI. In 2.9 this worked well as all the html help
was pre-generated - I could ask R to find the help (e.g. help(plot)
would return the rd file), and then I could look up the .html file
from that.

In R 2.10 things changed dramatically for help, and now html help is
generated when requested from the rd files.

I've tried various ways to access the help as HTML since then, but
I've had no luck. I was wondering if anyone would be able to suggest
an approach where I could run 'help(plot)', and then take the
resulting .rd file and generate the necessary html help from it.

Thanks,

-- 
Jamie Love

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Antonio, Fabio Di Narzo
2010/5/26 Simon Urbanek simon.urba...@r-project.org:

 On May 26, 2010, at 12:26 PM, Antonio, Fabio Di Narzo wrote:

 2010/5/26 Hadley Wickham had...@rice.edu:
 Yes, that's a very good point (although in my experience it takes a
 very long time to do the initial download of the SVN repository). I'm
 not an expert on these systems, but I imagine the main downside (other
 than speed) of having SVN upstream is that you have to keep the
 history linear,

 That (non-linear history) is IMHO the biggest drawback of DVCS because 
 that means there is no way to link a particular build to the source status 
 and you cannot use globally valid build numbers.

 Git (and I'm sure the others) provides a globally unique id for each
 revision.  Isn't that sufficient?

 But feature branches are as easily (IMHO even more easily since you can 
 closely monitor what others are contributing) worked on with SVN 
 (routinely used with R) so I'm not sure what DVCS would buy you.

 Feature branches are _much_ easier with git - to the point where some
 people suggest using a separate feature branch for every feature you
 develop.

 AFAICS the only benefit of DVCS is that if you are on a remote island 
 without any internet connection you can accumulate multiple commits before 
 merging them back. I can't say that I desperately need that functionality 
 ;).

 You have never worked on an airplane or other location without
 internet access?  You must have lived a very privileged life ;)

 Some people just have decent web access only at work, and if you work
 on your R project like at home or on the train, you're already having
 some difficulties. But please, not the airplane argument! (just
 joking...).

 Moreover, 'local' commits are way faster than network-based commits. I
 can testify: 1microsecond vs 1second delay (or more, depending on how
 crappy is your net access) *is* a big difference. On your local
 machine, you end up committing much more often, with smaller and
 self-contained commits, generally producing a cleaner history.


 I disagree - I don't find commit time having any impact on what I commit. 
 It's always a logical chunk (which is why SVN was such a great step forward 
 from CVS). My RForge does check on commit so I don't even bother waiting for 
 the commit to finish (waiting is just useful if I want the check result - the 
 actual commit is pretty much instantaneous).
 However, with SVN you'll know immediately if someone else was working on the 
 same issue in the meantime - with DVCS you won't (this happens in R more 
 often that you would think).

You'll know immediately as long as you're connected, and that holds
for DVCS too.
Beside, people working simultaneously on the same files and needing
svn to tell them of that? And that happening often? I would hope on
better human interaction and work division, rather than svn conflicts
checks. But...
 [Note: again, this is rather about personal preferences I suspect]
indeed.

cheers,
fabio.


 Cheers,
 Simon





-- 
Antonio Fabio Di Narzo, PhD.
Swiss Institute for Bioinformatics - Bioinformatics Core Facility
Office 2029, Génopode, Quartier Sorge
CH-1015 Lausanne, Switzerland
Tel: +41 21 692 4087
Fax: +41 21 692 4065

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] R 2.10 and help

2010-05-26 Thread Simon Urbanek

On May 26, 2010, at 4:51 PM, Jamie Love wrote:

 Hi all,
 
 I've been developing a little project for the past few months on R2.9.
 The project utilises RServe to access R over at tcp/ip connection.
 
 One feature of the project is to take R-Help and display it to the
 user through my own UI. In 2.9 this worked well as all the html help
 was pre-generated - I could ask R to find the help (e.g. help(plot)
 would return the rd file), and then I could look up the .html file
 from that.
 
 In R 2.10 things changed dramatically for help, and now html help is
 generated when requested from the rd files.
 
 I've tried various ways to access the help as HTML since then, but
 I've had no luck. I was wondering if anyone would be able to suggest
 an approach where I could run 'help(plot)', and then take the
 resulting .rd file and generate the necessary html help from it.
 

It's much more simple since 2.10 as you can simply use the URL (same URL as the 
help system uses). For examples see JGR or the Mac GUI. Or you can do the same 
thing that the Rhttpd server does  actually generating the html - see 
tools:::httpd 

Cheers,
Simon

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Peter Dalgaard
Antonio, Fabio Di Narzo wrote:

 Beside, people working simultaneously on the same files and needing
 svn to tell them of that? And that happening often? I would hope on
 better human interaction and work division, rather than svn conflicts
 checks. But...
 [Note: again, this is rather about personal preferences I suspect]
 indeed.

It actually happens quite often. Sometimes because two people in, say,
Canada and Oxford fix the same bug report (usually in nearly the same
way), but more typically because ALL our changes are recorded in the
same NEWS file.

Besides, SVN conflict are almost always trivial to resolve: Either a
matter of selecting one of two changes or keeping both.

-- 
Peter Dalgaard
Center for Statistics, Copenhagen Business School
Phone: (+45)38153501
Email: pd@cbs.dk  Priv: pda...@gmail.com

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] SVN vs DVCS

2010-05-26 Thread Hadley Wickham
On Wednesday, May 26, 2010, Peter Dalgaard pda...@gmail.com wrote:
 Antonio, Fabio Di Narzo wrote:

 Beside, people working simultaneously on the same files and needing
 svn to tell them of that? And that happening often? I would hope on
 better human interaction and work division, rather than svn conflicts
 checks. But...
 [Note: again, this is rather about personal preferences I suspect]
 indeed.

 It actually happens quite often. Sometimes because two people in, say,
 Canada and Oxford fix the same bug report (usually in nearly the same
 way), but more typically because ALL our changes are recorded in the
 same NEWS file.

 Besides, SVN conflict are almost always trivial to resolve: Either a
 matter of selecting one of two changes or keeping both.

And that's no more difficult in git or any other dvcs.

Hadley
 --
 Peter Dalgaard
 Center for Statistics, Copenhagen Business School
 Phone: (+45)38153501
 Email: pd@cbs.dk  Priv: pda...@gmail.com

 __
 r-de...@r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-devel


-- 
Assistant Professor / Dobelman Family Junior Chair
Department of Statistics / Rice University
http://had.co.nz/

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


Re: [Rd] R 2.10 and help

2010-05-26 Thread Hadley Wickham
 I've tried various ways to access the help as HTML since then, but
 I've had no luck. I was wondering if anyone would be able to suggest
 an approach where I could run 'help(plot)', and then take the
 resulting .rd file and generate the necessary html help from it.


 It's much more simple since 2.10 as you can simply use the URL (same URL as 
 the help system uses). For examples see JGR or the Mac GUI. Or you can do the 
 same thing that the Rhttpd server does  actually generating the html - see 
 tools:::httpd

Another approach along the same lines is the in progress helpr
package: http://github.com/hadley/helpr

Hadley

-- 
Assistant Professor / Dobelman Family Junior Chair
Department of Statistics / Rice University
http://had.co.nz/

__
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel