Re: [OT - or may be not] The case for open computer programs

2012-05-31 Thread Michael Banck
On Tue, May 29, 2012 at 08:29:21PM +0200, Michael Hanke wrote:
 On Tue, May 29, 2012 at 08:00:57AM +0200, Andreas Tille wrote:
  http://www.nature.com/nature/journal/v482/n7386/full/nature10836.html
 
 In case you don't want to pay Nature to read about this, you can
 alternatively pay Science...
 
 http://www.sciencemag.org/content/336/6078/159

Another editorial on this, this time in an OA journal:

http://www.scfbm.org/content/7/1/2/abstract


Michael


-- 
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/20120531104834.gm10...@nighthawk.chemicalconnection.dyndns.org



Re: [OT - or may be not] The case for open computer programs

2012-05-31 Thread Luis Ibanez
On Tue, May 29, 2012 at 11:14 PM, Yaroslav Halchenko
deb...@onerussian.comwrote:


 Exactly!   And there is more to it.  Someone bold could event exaggerate
 that requiring open code on its own is **useless** besides for being
 an ideal description of the method implementation.  Why useless?
 Because in majority of the cases open code will hardly be usable by a
 considerable part of scientific community for one reason (e.g. as you
 pointed out commercial base) or another.  Quite often simply
 because that code was not created to be used by others.



+1  Very good point.

Open Code is just a Beginning.

I have indeed seen a lot of research code
that I'll be afraid to use...:-)

and a lot of code, that just plain doesn't work...

We need to promote a culture of scientific programming in which
researchers adopt sound software development practices, and
learn about software development practices that are rooted in
quality assurance.

Researchers tend to make the mistake of:

Not taking the time to go fast

As it is nicely put in the Clean Code book by Robert Martin.
http://www.informit.com/articles/article.aspx?p=1235624seqNum=3

When it comes to chose our daily practices of software development
it is too common to write code: just for the next paper.

A more modern culture of sound software development practices
needs to be grown in the field of research. In particular, Unit testing
(which is indeed basic reproducibility), revision control, tutorials and
documentation.

Great work in this front is done by the Software Carpentry:

   http://software-carpentry.org/

We need to figure out how to further scale this type of initiatives.



   Luis


[OT - or may be not] The case for open computer programs

2012-05-29 Thread Andreas Tille
Hi,

you might like to read:

The case for open computer programs
Darrel C. Ince, Leslie Hatton  John Graham-Cumming

Scientific communication relies on evidence that cannot be entirely
included in publications, but the rise of computational science has
added a new layer of inaccessibility. Although it is now accepted that
data should be made available on request, the current regulations
regarding the availability of software are inconsistent. We argue that,
with some exceptions, anything less than the release of source programs
is intolerable for results that depend on computation. The vagaries of
hardware, software and natural language will always ensure that exact
reproducibility remains uncertain, but withholding code increases the
chances that efforts to reproduce results will fail.

http://www.nature.com/nature/journal/v482/n7386/full/nature10836.html


Kind regards

   Andreas.

-- 
http://fam-tille.de


-- 
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120529060057.gc18...@an3as.eu



Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread lina
On Tue, May 29, 2012 at 2:00 PM, Andreas Tille andr...@an3as.eu wrote:
 Hi,

 you might like to read:

 The case for open computer programs
 Darrel C. Ince, Leslie Hatton  John Graham-Cumming

Seems it's been recommended before in the list.


 Scientific communication relies on evidence that cannot be entirely
 included in publications, but the rise of computational science has
 added a new layer of inaccessibility. Although it is now accepted that
 data should be made available on request, the current regulations
 regarding the availability of software are inconsistent. We argue that,
 with some exceptions, anything less than the release of source programs
 is intolerable for results that depend on computation. The vagaries of
 hardware, software and natural language will always ensure that exact
 reproducibility remains uncertain, but withholding code increases the
 chances that efforts to reproduce results will fail.

 http://www.nature.com/nature/journal/v482/n7386/full/nature10836.html

Thanks,


 Kind regards

       Andreas.

 --
 http://fam-tille.de


 --
 To UNSUBSCRIBE, email to debian-science-requ...@lists.debian.org
 with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
 Archive: http://lists.debian.org/20120529060057.gc18...@an3as.eu



--
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/CAG9cJmkxiAQQQi1utS5Z=en56n0obpmY6i=br1cf_7h-ofg...@mail.gmail.com



Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Oz Nahum Tiram
Hi All,
Indeed strong words, published in Nature where you need to pay $32 to
read what we all know already.

Regards,
Oz

On Tue, May 29, 2012 at 8:00 AM, Andreas Tille andr...@an3as.eu wrote:
 Hi,

 you might like to read:

 The case for open computer programs
 Darrel C. Ince, Leslie Hatton  John Graham-Cumming

 Scientific communication relies on evidence that cannot be entirely
 included in publications, but the rise of computational science has
 added a new layer of inaccessibility. Although it is now accepted that
 data should be made available on request, the current regulations
 regarding the availability of software are inconsistent. We argue that,
 with some exceptions, anything less than the release of source programs
 is intolerable for results that depend on computation. The vagaries of
 hardware, software and natural language will always ensure that exact
 reproducibility remains uncertain, but withholding code increases the
 chances that efforts to reproduce results will fail.

 http://www.nature.com/nature/journal/v482/n7386/full/nature10836.html


 Kind regards

       Andreas.

 --
 http://fam-tille.de


 --
 To UNSUBSCRIBE, email to debian-science-requ...@lists.debian.org
 with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
 Archive: http://lists.debian.org/20120529060057.gc18...@an3as.eu



--
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/cadkmuwm_ijlk15dlak142u8si2fpqe4ptrx6oumutzthkr7...@mail.gmail.com



Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Luis Ibanez
What is more interesting is the reaction that
followed in serious scientific journals:


1)  PLoS ONE (the Open Access Mega Journal that currently
 publishes 3% of all the STM literature) now requires
 software papers to include the source code under an
 Open Source license:

http://www.plosone.org/static/guidelines.action#software
http://www.plosone.org/static/policies.action


quote

*Software.* PLoS supports the development of open source software and
believes that, for submissions in which software is the central part of the
paper, adherence to appropriate open source standards will ensure that the
submission conforms to (1) our requirements that methods be described in
sufficient detail that another researcher can reproduce the experiments
described, (2) our aim to promote openness in research, and (3) our
intention that all work published in PLoS journals can be built upon by
future researchers. Therefore, if new software or a new algorithm is
central to a PLoS paper, the authors must confirm that the software
conforms to the Open Source Definition http://www.opensource.org/docs/osd,
have deposited the following three items in an open software archive, and
included in the submission as Supporting Information:

   - *The associated source code of the software described by the
paper.*This should, as far as possible, follow accepted community
standards and be
   licensed under a suitable license such as BSD, LGPL, or MIT (see
   http://www.opensource.org/licenses/alphabetical for a full list).
   Dependency on commercial software such as Mathematica and MATLAB does not
   preclude a paper from consideration, although complete open source
   solutions are preferred.
   - *Documentation for running and installing the software.* For end-user
   applications, instructions for installing and using the software are
   prerequisite; for software libraries, instructions for using the
   application program interface are prerequisite.
   - *A test dataset with associated control parameter settings.* Where
   feasible, results from standard test sets should be included. Where
   possible, test data should not have any dependencies — for example, a
   database dump.

/quote


One of the new Journals in BiomedCentral,

   http://www.openresearchcomputation.com/about

which is also Open Access,

developed a similar policy:


quote

   - Reproducibility Verificationhttp://www.openresearchcomputation.com/about
  - *Software: Open Research Computation* differs from other journals
  with a software focus in i*ts requirement for the software source
  code to be made available* under an *Open Source* Initiative
  compliant license, and in its assessment of the quality of
documentation and
  * testing* of the software.
  - *Data*: *Open Research Computation* has very high standards for *data
  availability* and *reproducibility*. It is expected that all the *data
  *, *code*, and *software required to reproduce *any examples in the
  paper* will be made freely available for download* from an
  appropriate recognized repository or the journal website.
   - Review criteria for Source Code
   - Code and Licensehttp://www.openresearchcomputation.com/about/reviewers
 - Is the source code as well as executables and/or an instance of
 the service (of a clearly defined version) available on appropriate
 * public repository*?
 - Is the source code made available under an *Open Source *Initiative
 (http://www.opensource.org/licenses/category) compliant license?
 Specifically users must have the right to examine, compile,
run and modify
 the code for any purpose.
 - Are project authors and contributors clearly defined, ideally
 through a Description of a Project [
 http://en.wikipedia.org/wiki/DOAP, http://trac.usefulinc.com/doap]
 document? We recommend the use of the automatic DOAP generator such as
 those linked here:
http://trac.usefulinc.com/doap/wiki/Generatorhttp://trac.usefulinc.com/doap/wiki/Generators

/quote



It has been said that Open Source was the application
of the Scientific Method to the process of Software
development.  These recent developments show that
Open Source has a lot to give back to the scientific
community where the practice of Reproducibility
Verification has been lost and substituted by the
inferior and quite defective practice of peer-reviews
based on simple opinions instead of reproducible
experiments.

For one thing, the simple practice of doing revision
control, and implementing unit testing frameworks
that can be executed over and over again, will
already revolutionize the way software is managed
in many research institutions. It is sadly too common
that nobody in a lab can replicate a computational
experiment even days after it has been performed.


More on this by Victoria Stodden:
http://www.stanford.edu/~vcs/talks/CaltechMay122011-STODDEN.pdf

Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Yaroslav Halchenko
If you have 15$ left have you read the Nature paper, then you could also
read less particular about details version of the same thing from Science:

http://www.sciencemag.org/content/336/6078/159.full
Research Priorities
Shining Light into Black Boxes
A. Morin, J. Urban, P. D. Adams, I. Foster, A. Sali, D. Baker, P. Sliz,* 

The publication and open exchange of knowledge and material form the backbone
of scientific progress and reproducibility and are obligatory for publicly
funded research. Despite increasing reliance on computing in every domain of
scientific endeavor, the computer source code critical to understanding and
evaluating computer programs is commonly withheld, effectively rendering these
programs “black boxes” in the research work flow. Exempting from basic
publication and disclosure standards such a ubiquitous category of research
tool carries substantial negative consequences. Eliminating this disparity will
require concerted policy action by funding agencies and journal publishers, as
well as changes in the way research institutions receiving public funds manage
their intellectual property (IP). 

and publicly available press-release for it from Scientific American:
Secret Computer Code Threatens Science
http://www.scientificamerican.com/article.cfm?id=secret-computer-code-threatens-science


Meanwhile we can just keep going forward making it all possible ;)

Cheers,

On Tue, 29 May 2012, Oz Nahum Tiram wrote:

 Hi All,
 Indeed strong words, published in Nature where you need to pay $32 to
 read what we all know already.

 Regards,
 Oz

 On Tue, May 29, 2012 at 8:00 AM, Andreas Tille andr...@an3as.eu wrote:
  Hi,

  you might like to read:

  The case for open computer programs
  Darrel C. Ince, Leslie Hatton  John Graham-Cumming

  Scientific communication relies on evidence that cannot be entirely
  included in publications, but the rise of computational science has
  added a new layer of inaccessibility. Although it is now accepted that
  data should be made available on request, the current regulations
  regarding the availability of software are inconsistent. We argue that,
  with some exceptions, anything less than the release of source programs
  is intolerable for results that depend on computation. The vagaries of
  hardware, software and natural language will always ensure that exact
  reproducibility remains uncertain, but withholding code increases the
  chances that efforts to reproduce results will fail.

  http://www.nature.com/nature/journal/v482/n7386/full/nature10836.html


  Kind regards

        Andreas.

  --
  http://fam-tille.de


  --
  To UNSUBSCRIBE, email to debian-science-requ...@lists.debian.org
  with a subject of unsubscribe. Trouble? Contact 
  listmas...@lists.debian.org
  Archive: http://lists.debian.org/20120529060057.gc18...@an3as.eu
-- 
Yaroslav O. Halchenko
Postdoctoral Fellow,   Department of Psychological and Brain Sciences
Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755
Phone: +1 (603) 646-9834   Fax: +1 (603) 646-1419
WWW:   http://www.linkedin.com/in/yarik


-- 
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120529180732.go11...@onerussian.com



Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Michael Hanke
On Tue, May 29, 2012 at 08:00:57AM +0200, Andreas Tille wrote:
 http://www.nature.com/nature/journal/v482/n7386/full/nature10836.html

In case you don't want to pay Nature to read about this, you can
alternatively pay Science...

http://www.sciencemag.org/content/336/6078/159

-- 
Michael Hanke
http://mih.voxindeserto.de


-- 
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120529182921.GA15178@meiner



Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Brendon Higgins
Hi,

Yaroslav Halchenko wrote (May 29, 2012):
 If you have 15$ left have you read the Nature paper, then you could also
 read less particular about details version of the same thing from
 Science:
 
 http://www.sciencemag.org/content/336/6078/159.full
 [snip]
 
 Meanwhile we can just keep going forward making it all possible ;)

Actually, this brings to mind an interesting point. The articles mentioned in 
this thread, from what I can see, do not address open operating systems, but 
rather they specifically focus on open access to the specialized code that is 
used to back scientific conclusions. So it seems there is a distinction between 
what we are making possible with Debian, and what is being called for here. In 
this context, open _code_ and open _platforms_ are two different (albeit 
related) beasts.

The articles really don't go far enough. Given the prevalence of closed 
proprietary platforms (e.g. Windows, Matlab, Mathematica, LabVIEW), the fact 
that the foundations of science that relies on computation in various forms 
(which, I assert, is probably most of science these days) are not necessarily 
open seems to me to be a rather significant elephant in the room. How can one 
be certain their results are valid and true if the platform on which those 
results are acquired cannot similarly be scrutinized?

Ideally the whole system should be open, not just the chunk of code unique to 
each experiment.

Peace,
Brendon


-- 
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/201205291455.18362.blhigg...@gmail.com



Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Luis Ibanez
On Tue, May 29, 2012 at 2:55 PM, Brendon Higgins blhigg...@gmail.comwrote:

 Ideally the whole system should be open, not just the chunk of code unique
 to
 each experiment.


+1 Luis


Re: [OT - or may be not] The case for open computer programs

2012-05-29 Thread Yaroslav Halchenko

On Tue, 29 May 2012, Brendon Higgins wrote:
  http://www.sciencemag.org/content/336/6078/159.full
  [snip]
  Meanwhile we can just keep going forward making it all possible ;)
 ...
 In 
 this context, open _code_ and open _platforms_ are two different (albeit 
 related) beasts.
 ...
 Ideally the whole system should be open, not just the chunk of code unique to 
 each experiment.

Exactly!   And there is more to it.  Someone bold could event exaggerate
that requiring open code on its own is **useless** besides for being
an ideal description of the method implementation.  Why useless?
Because in majority of the cases open code will hardly be usable by a
considerable part of scientific community for one reason (e.g. as you
pointed out commercial base) or another.  Quite often simply
because that code was not created to be used by others.

Moreover, we all know, that even providing usable binaries accompanied
by FOSS code, without formalized build procedures and clearly specified
dependencies would complicate any extension of the code, thus often
significantly reducing the benefit of having that code under FOSS
license to begin with.   As the result, mandating open code to
accompany research papers would be of limited practical importance to
the science due to difficulty of its adoption and extension.

And that is where a platform which addresses those demands would be
indispensable; but neither of those papers indeed goes that far.
Overall all these recent trends are only of benefit for us to
promote Debian because of its unique organization and wealth ;)

-- 
Yaroslav O. Halchenko
Postdoctoral Fellow,   Department of Psychological and Brain Sciences
Dartmouth College, 419 Moore Hall, Hinman Box 6207, Hanover, NH 03755
Phone: +1 (603) 646-9834   Fax: +1 (603) 646-1419
WWW:   http://www.linkedin.com/in/yarik


-- 
To UNSUBSCRIBE, email to debian-med-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120530031432.gw11...@onerussian.com