Re: [R] Measurements of 3000 criminals

2006-11-04 Thread Ted Harding
On 31-Oct-06 Martin Maechler wrote:
 Jean == Jean lobry [EMAIL PROTECTED]
 on Sat, 28 Oct 2006 13:07:04 +0200 writes:
  Hallo everyone,
  
  excuse me if this is not a genuine R question but I do
  not know where to ask else.
  
  Referring to e.g.
  
  https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html
  
  I wonder if these measurements of 3000 criminals (raw
  data) are available anywhere.
  [...]
 
 Jean Dietrich,
 
 Jean I'm not sure, but this is perhaps what you want:
 
 Jean crim -
 Jean
 read.table(http://pbil.univ-lyon1.fr/R/donnees/criminals1902.txt;)
 Jean [...]
 
 Jean, this is really very nice, and given the historic
 importance of the data, I'd like to add this to the official
 R 'datasets'  (the standard data set package), with tribute to
 you and Anne-Béatrice, and the references.
 [...]

A further historical note. No doubt many R-helpreaders will
have recently received (e.g. via allstat) the announcement:

  RSS Highlands local group meeting, December 5:
  100 years of statistics in Aberdeen

  Dear all!

  This is a first announcement for a special meeting of the
  Highlands local group of the RSS celebrating the centenary
  of the first lecturer in Statistics at the University of
  Aberdeen (Dr William Robert Macdonell), one of the first
  in the UK.

The same W. R. Macdonell as published the data in question.

Being a centenary takes it back to 1906. Such biographic
information as I have been able to find says:

W. R. MACDONELL. Born October 16, 1852. Died May 15, 1916
  KER Biometrika.1917; 11: 281-283

So he was well into his 50s when so honoured, and had only
few years to go.

Somewhat to my surpise, the excellent Index of Biographies
of mathematicians and statisticians at the University of
St Andrews:

http://www-groups.dcs.st-and.ac.uk/~history/BiogIndex.html

has nothing for him, despite his being a First for Scotland!

Best wishes to all,
Ted.


E-Mail: (Ted Harding) [EMAIL PROTECTED]
Fax-to-email: +44 (0)870 094 0861
Date: 04-Nov-06   Time: 19:32:50
-- XFMail --

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Measurements of 3000 criminals

2006-10-31 Thread Martin Maechler
 Jean == Jean lobry [EMAIL PROTECTED]
 on Sat, 28 Oct 2006 13:07:04 +0200 writes:

 Hallo everyone,
 
 excuse me if this is not a genuine R question but I do
 not know where to ask else.
 
 Referring to e.g.
 
 https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html
 
 I wonder if these measurements of 3000 criminals (raw
 data) are available anywhere.  At least I didn't find
 them in the R datasets package or by means of Google.
 What I did find was a table of frequencies of the central
 values for *grouped* classifications (finger lenghts) in
 the Handbook of Small Data Sets.
 
 Thank you in advance.
 
 D. Trenkler

Jean Dietrich,

Jean I'm not sure, but this is perhaps what you want:

Jean crim -
Jean read.table(http://pbil.univ-lyon1.fr/R/donnees/criminals1902.txt;)

May I propose to enhance this to

  cfile - http://pbil.univ-lyon1.fr/R/donnees/criminals1902.txt;
  crimtab - as.table(data.matrix(read.table(cfile, check.names=FALSE)))

  ## and e.g.,
  print(crimtab, zero=.)


Jean, this is really very nice, and given the historic
importance of the data, I'd like to add this to the official R
'datasets'  (the standard data set package),
with tribute to you and Anne-Béatrice, and the references.

If you agree, I'd be happy if you could help me (in private
e-mail exchanges) to easily complete the relevant help page.

Martin Maechler, ETH Zurich

Jean For some R code playing with this dataset, open this
Jean (draft) document:

Jean http://pbil.univ-lyon1.fr/members/lobry/R/convergencet.pdf

Jean and jump to section 4.

Jean HTH,

Jean Jean -- Jean R. Lobry ([EMAIL PROTECTED])
Jean Laboratoire BBE-CNRS-UMR-5558, Univ. C. Bernard - LYON
Jean I, 43 Bd 11/11/1918, F-69622 VILLEURBANNE CEDEX,
Jean FRANCE allo : +33 472 43 27 56 fax : +33 472 43 13 88
Jean http://pbil.univ-lyon1.fr/members/lobry/

Jean __
Jean R-help@stat.math.ethz.ch mailing list
Jean https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do
Jean read the posting guide
Jean http://www.R-project.org/posting-guide.html and
Jean provide commented, minimal, self-contained,
Jean reproducible code.

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Measurements of 3000 criminals

2006-10-29 Thread Jean lobry

On that basis, and having looked at Jean's PDF

http://pbil.univ-lyon1.fr/members/lobry/R/convergencet.pdf

the table on the 11th page thereof (Section 4) seems to be
a facsimile reproduction of the corresponding page in the
Biometrika article by Macdonell. (I do not have access at
the moment to the original Biometrika, so cannot verify this),
and that table gives the data as originally published by
Macdonell.


Ted,

Yes, indeed, this is a screen-copy from the Macdonell paper
scanned in JSTOR. As you have guessed we (Anne-Béatrice
Dufour and mylself) have entered the data, checked that
margins were consistent, and converted height in cm.

We were planning to use this for teaching: a practical under
R to reproduce Student's original experiment, but we didn't
have the time up to now to write the handout.

Best,

Jean
-- 
Jean R. Lobry([EMAIL PROTECTED])
Laboratoire BBE-CNRS-UMR-5558, Univ. C. Bernard - LYON I,
43 Bd 11/11/1918, F-69622 VILLEURBANNE CEDEX, FRANCE
allo  : +33 472 43 27 56 fax: +33 472 43 13 88
http://pbil.univ-lyon1.fr/members/lobry/

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Measurements of 3000 criminals

2006-10-28 Thread Jean lobry
  Hallo everyone,

  excuse me if this is not a genuine R question but I do not know where to
  ask else.

  Referring  to e.g.

  https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html

  I wonder if these measurements of 3000 criminals (raw data) are
  available anywhere.  At least I didn't find them in the R datasets
  package or by means of Google.  What I did find was a table of
  frequencies of the central values for *grouped* classifications (finger
  lenghts) in the Handbook of Small Data Sets.

  Thank you in advance.

  D. Trenkler

Dietrich,

I'm not sure, but this is perhaps what you want:

crim - read.table(http://pbil.univ-lyon1.fr/R/donnees/criminals1902.txt;)

For some R code playing with this dataset, open this (draft) document:

http://pbil.univ-lyon1.fr/members/lobry/R/convergencet.pdf

and jump to section 4.

HTH,

Jean
-- 
Jean R. Lobry([EMAIL PROTECTED])
Laboratoire BBE-CNRS-UMR-5558, Univ. C. Bernard - LYON I,
43 Bd 11/11/1918, F-69622 VILLEURBANNE CEDEX, FRANCE
allo  : +33 472 43 27 56 fax: +33 472 43 13 88
http://pbil.univ-lyon1.fr/members/lobry/

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Measurements of 3000 criminals

2006-10-28 Thread Ted Harding
On 28-Oct-06 Jean lobry wrote:
  Hallo everyone,

  excuse me if this is not a genuine R question but I do not
  know where to ask else.

  Referring  to e.g.

  https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html

  I wonder if these measurements of 3000 criminals (raw data)
  are available anywhere. At least I didn't find them in the
  R datasets package or by means of Google. What I did find
  was a table of frequencies of the central values for *grouped*
  classifications (finger lenghts) in the Handbook of Small Data
  Sets.

  Thank you in advance.

  D. Trenkler
 
 Dietrich,
 
 I'm not sure, but this is perhaps what you want:
 
 crim -
 read.table(http://pbil.univ-lyon1.fr/R/donnees/criminals1902.txt;)
 
 For some R code playing with this dataset, open this (draft)
 document:
 
 http://pbil.univ-lyon1.fr/members/lobry/R/convergencet.pdf
 
 and jump to section 4.
 
 HTH,
 
 Jean

Following up Dietrich's original URL

https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html

I find that the seed for this thread was originally planted
by myself!

If you go back to that posting, you will read in the quotation
from Student (1908):

  Before I had succeeded in solving my problem analytically,
   I had endeavoured to do so empirically. The material used
   was a correlation table containing the height and left
   middle finger measurements of 3000 criminals, from a paper
   by W. R. Macdonnell (Biometrika, I, p. 219). ... 

[NB Typo: Macdonnell should be Macdonell; for 219 see
 below]

The crucial phrase is correlation table, i.e. a 2-way table
of counts in intervals of one variable by intervals of another
variable.

On that basis, and having looked at Jean's PDF

http://pbil.univ-lyon1.fr/members/lobry/R/convergencet.pdf

the table on the 11th page thereof (Section 4) seems to be
a facsimile reproduction of the corresponding page in the
Biometrika article by Macdonell. (I do not have access at
the moment to the original Biometrika, so cannot verify this),
and that table gives the data as originally published by
Macdonell.

This is not, of course, the raw data which would have been
3000 records each with the measurements of each of the 3000
individuals.

But I think that it is as close as one can get!

The references to Student's and Macdonell's articles are
given in Jean's PDF, including the fact that the table in
question was found on Macdonell's p. 216, not 219.

Cross-checking Jean's data file

http://pbil.univ-lyon1.fr/R/donnees/criminals1902.txt

against the Macdonell reproduction shows that the counts
are the same, the left-hand margins (finger length in mm)
agree, and the top margins also agree on the basis that
the heights are given by Jean in cm corresponding to the
midpoints of Macdonell's intervals in feet/inches.

Thus where Macdonell has 4' 79/16 -- 89/16, Jean has
142.24 which is 2.54*56 = 2.54*(4' 8).

Hoping that this helps!
Ted.


E-Mail: (Ted Harding) [EMAIL PROTECTED]
Fax-to-email: +44 (0)870 094 0861
Date: 28-Oct-06   Time: 13:55:28
-- XFMail --

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Measurements of 3000 criminals

2006-10-26 Thread Dietrich Trenkler
Hallo everyone,

excuse me if this is not a genuine R question but I do not know where to
ask else.

Referring  to e.g.

https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html

I wonder if these measurements of 3000 criminals (raw data) are
available anywhere.  At least I didn't find them in the R datasets
package or by means of Google.  What I did find was a table of
frequencies of the central values for *grouped* classifications (finger
lenghts) in the Handbook of Small Data Sets.

Thank you in advance.

D. Trenkler  

-- 
Dietrich Trenkler c/o Universitaet Osnabrueck 
Rolandstr. 8; D-49069 Osnabrueck, Germany
email: [EMAIL PROTECTED]

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Measurements of 3000 criminals

2006-10-26 Thread Ben Bolker
Dietrich Trenkler Dietrich.Trenkler at uni-osnabrueck.de writes:

 
 Hallo everyone,
 
 excuse me if this is not a genuine R question but I do not know where to
 ask else.
 
 Referring  to e.g.
 
 https://stat.ethz.ch/pipermail/r-help/2004-December/062114.html
 
 I wonder if these measurements of 3000 criminals (raw data) are
 available anywhere.  At least I didn't find them in the R datasets
 package or by means of Google.  What I did find was a table of
 frequencies of the central values for *grouped* classifications (finger
 lenghts) in the Handbook of Small Data Sets.
 
 Thank you in advance.
 
 D. Trenkler  
 

  The original paper on JSTOR
(Biometrika Vol I, no 2, 1902, W. R. Macdonell)
gives tables of bivariate distributions (head length
vs head breadth, height vs head breadth, height
vs left middle finger length) for the 3000-criminal
sample, a 1306-criminal subsample, and for a sample
of 1000 Cambridge men.
  It would be an interesting challenge for a
historian of science ...

   Ben Bolker

__
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.