Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Duncan Murdoch
It's easier to install a package from CRAN than from Github, but not all 
that much easier.  If it's too much trouble to satisfy CRAN, then don't 
bother.  Just post instructions on how to do a Github install of it and 
move on to other things.  (I'd post those instructions in a README.md 
file on the Github site, so when people follow your link they'll find 
the description.)


Duncan Murdoch

On 28/06/2020 12:07 p.m., Charles Geyer wrote:

I have a package that has the datasets for Categorical Data Analysis by
Agresti that do not appear in the book.  The whole package is a github repo
https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
translated mechanically using the R script foo.R included in the repo (but
not in the package) from Agresti's web site
http://www.stat.ufl.edu/~aa/cda/data.html.

This package seems to be a useful service to students and teachers.  The
data
are much simpler to use with this package than trying to get the data from
Agresti's web page (foo.R has 277 lines of code).

When I submitted the package to CRAN, I got the following response.


The Description field of the DESCRIPTION file is intended to be a (one
paragraph) description of what the package does and why it may be
useful. Please elaborate. Tell the users what the datasets are about and
what they contain so they can use them even when they haven't read your
book.



Please fix and resubmit, and document what was changed in the submission
comments.


In an alternate universe without copyright law this seems a reasonable
request.  In this universe it seems to be asking for trouble.  I know about
fair use, but I am not a lawyer and do not want to walk the borderline
between fair use and copyright violation.

The package as it is seems OK because it comes from the author's public web
site and these data were never in the book.

Please note that I made Alan Agresti (with his acquiescence) the author of
the package because it is his book and his data, but I (or rather foo.R)
did all the work.

I replied to cran.r-project.org, but that was apparently sent to /dev/null.

This book is IMHO the authoritative textbook on the subject.  Amazon sales
rank agrees.  The book is used for many courses.  So this package would be
very helpful as is to many students and teachers.

So what to do?  Is there any way to get this package on CRAN?



__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Jeff Newmiller
Data are not copyrightable. As long as you do not use the exact phrases from 
the book you should be okay. But if you don't agree, then you should not 
proceed, and complaining here is not going to fix this.

On June 28, 2020 11:22:41 AM PDT, Charles Geyer  wrote:
>That's where the copyright violation comes in.  Copying descriptions
>from
>the book is clear copyright violation unless it is so minimal as to be
>"fair use" and I have no idea where that line is.  Even Alan may not be
>free to add such descriptions.  I have no idea what his contract with
>Wiley
>says.
>
>So like I said in my original posting, good idea in an alternative
>universe
>where copyright law does not exist.
>
>On Sun, Jun 28, 2020 at 1:13 PM Jeff Newmiller
>
>wrote:
>
>> Just describe the nature of the data sets literally as though the
>book was
>> inaccessible. They are not asking you to describe how one should
>analyze
>> the the data, so there really shouldn't be any conflict with the book
>> content that your agreement with the author has not already resolved.
>>
>> If you feel you are being held to a higher standard than others have
>> been... that is life. As a user of packages I agree with the CRAN
>that
>> package documention should be usable on its own.
>>
>> On June 28, 2020 10:58:15 AM PDT, Charles Geyer
>
>> wrote:
>> >CRAN did not just ask for an expanded Description field.  They
>> >instructed
>> >"Tell the users what the datasets are about and what they contain so
>> >they
>> >can use them even when they haven't read your book".  AFAIK no CRAN
>> >package
>> >that goes with a book satisfies that.
>> >
>> >On Sun, Jun 28, 2020 at 12:52 PM Max Turgeon
>
>> >wrote:
>> >
>> >> Fair enough. But CRAN is clearly asking for a more detailed
>> >Description
>> >> field. I simply offered one suggestion for expanding it. Keep in
>mind
>> >> that users will typically see the DESCRIPTION file first, and not
>the
>> >help
>> >> pages.
>> >>
>> >>
>> >> Max Turgeon
>> >> Assistant Professor
>> >> Department of Statistics
>> >> Department of Computer Science
>> >> University of Manitoba
>> >> maxturgeon.ca
>> >>
>> >>
>> >> --
>> >> *From:* Charles Geyer 
>> >> *Sent:* June 28, 2020 12:48:06 PM
>> >> *To:* Max Turgeon
>> >> *Cc:* R Package Development
>> >> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
>> >>
>> >> *Caution:* This message was sent from outside the University of
>> >Manitoba.
>> >> The link to Alan's web site is on every help page (in the source
>> >> section).  That's where the source is supposed to be.
>> >>
>> >> I have no problem with adding the source to the DESCRIPTION file,
>but
>> >that
>> >> is not what CRAN asked me to do.
>> >>
>> >> On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon
>> >
>> >> wrote:
>> >>
>> >>> For what it's worth, I'd be inclined to interpreting CRAN's
>response
>> >>> *very* literally, i.e. your Description field is not descriptive
>> >enough.
>> >>> According to what I can see in the Github repo, you only have
>> >>>
>> >>>
>> >>> "Datasets used in the book Categorical Data Analysis by Agresti
>but
>> >not
>> >>> printed in the book."
>> >>>
>> >>>
>> >>> Which is not much more than what the Title field says. One
>glaring
>> >>> omission (IMO) from the Description field is any mention of
>> >Agresti's
>> >>> website, where the data comes from.
>> >>>
>> >>>
>> >>> In contrast, looking at the "woolridge" package, I can see from
>the
>> >>> Description field that it contains 111 datasets (well, that's in
>the
>> >Title
>> >>> field), it's about econometrics, and the purpose of the package
>is
>> >to make
>> >>> it easier for students to work with these datasets.
>> >>>
>> >>>
>> >>> Max Turgeon
>> >>> Assistant Professor
>> >>> Department of Statistics
>> >>> Department of Computer Science
>> >>

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
That's where the copyright violation comes in.  Copying descriptions from
the book is clear copyright violation unless it is so minimal as to be
"fair use" and I have no idea where that line is.  Even Alan may not be
free to add such descriptions.  I have no idea what his contract with Wiley
says.

So like I said in my original posting, good idea in an alternative universe
where copyright law does not exist.

On Sun, Jun 28, 2020 at 1:13 PM Jeff Newmiller 
wrote:

> Just describe the nature of the data sets literally as though the book was
> inaccessible. They are not asking you to describe how one should analyze
> the the data, so there really shouldn't be any conflict with the book
> content that your agreement with the author has not already resolved.
>
> If you feel you are being held to a higher standard than others have
> been... that is life. As a user of packages I agree with the CRAN that
> package documention should be usable on its own.
>
> On June 28, 2020 10:58:15 AM PDT, Charles Geyer 
> wrote:
> >CRAN did not just ask for an expanded Description field.  They
> >instructed
> >"Tell the users what the datasets are about and what they contain so
> >they
> >can use them even when they haven't read your book".  AFAIK no CRAN
> >package
> >that goes with a book satisfies that.
> >
> >On Sun, Jun 28, 2020 at 12:52 PM Max Turgeon 
> >wrote:
> >
> >> Fair enough. But CRAN is clearly asking for a more detailed
> >Description
> >> field. I simply offered one suggestion for expanding it. Keep in mind
> >> that users will typically see the DESCRIPTION file first, and not the
> >help
> >> pages.
> >>
> >>
> >> Max Turgeon
> >> Assistant Professor
> >> Department of Statistics
> >> Department of Computer Science
> >> University of Manitoba
> >> maxturgeon.ca
> >>
> >>
> >> --
> >> *From:* Charles Geyer 
> >> *Sent:* June 28, 2020 12:48:06 PM
> >> *To:* Max Turgeon
> >> *Cc:* R Package Development
> >> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
> >>
> >> *Caution:* This message was sent from outside the University of
> >Manitoba.
> >> The link to Alan's web site is on every help page (in the source
> >> section).  That's where the source is supposed to be.
> >>
> >> I have no problem with adding the source to the DESCRIPTION file, but
> >that
> >> is not what CRAN asked me to do.
> >>
> >> On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon
> >
> >> wrote:
> >>
> >>> For what it's worth, I'd be inclined to interpreting CRAN's response
> >>> *very* literally, i.e. your Description field is not descriptive
> >enough.
> >>> According to what I can see in the Github repo, you only have
> >>>
> >>>
> >>> "Datasets used in the book Categorical Data Analysis by Agresti but
> >not
> >>> printed in the book."
> >>>
> >>>
> >>> Which is not much more than what the Title field says. One glaring
> >>> omission (IMO) from the Description field is any mention of
> >Agresti's
> >>> website, where the data comes from.
> >>>
> >>>
> >>> In contrast, looking at the "woolridge" package, I can see from the
> >>> Description field that it contains 111 datasets (well, that's in the
> >Title
> >>> field), it's about econometrics, and the purpose of the package is
> >to make
> >>> it easier for students to work with these datasets.
> >>>
> >>>
> >>> Max Turgeon
> >>> Assistant Professor
> >>> Department of Statistics
> >>> Department of Computer Science
> >>> University of Manitoba
> >>> maxturgeon.ca
> >>>
> >>>
> >>>
> >>>
> >>> --
> >>> *From:* R-package-devel  on
> >>> behalf of Charles Geyer 
> >>> *Sent:* June 28, 2020 11:38 AM
> >>> *To:* Neal Fultz
> >>> *Cc:* R Package Development
> >>> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
> >>>
> >>> 
> >>> Caution: This message was sent from outside the University of
> >Manitoba.
> >>> 
&

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
That's a good idea about adding some licence info to the web page, but note
that Agresti is not the original source of any of this data.  Some of it
was published in subject matter papers by the scientists involved.  For
some perhaps (I don't know) Alan got the whole data set via consulting and
the whole data (as opposed to summary statistics) was never published.
This is the case for almost all real data that statisticians re-analyze for
examples.  There is no "licence" and AFAIK (IANAL) none is needed.  If
there were most textbooks and most papers in statistics would be in
violation of law.

On Sun, Jun 28, 2020 at 1:08 PM Ivan Krylov  wrote:

> On Sun, 28 Jun 2020 11:07:46 -0500
> Charles Geyer  wrote:
>
> >Please note that I made Alan Agresti (with his acquiescence) the
> >author of the package
>
> Sorry to derail this, but is it possible for Alan Agresti to add a line
> to the page [*] allowing redistribution of the data, preferably under
> the terms of a well-known license such as CC BY-NC [**] or ODbL [***]?
> As it stands, CRAN only has your word (and the fact that this whole
> thread is Cc: to Prof. Agresti) that Alan Agresti agreed to have the
> data published as an R package. It might be needed to allow creating
> derivative works to make creating such a package feasible, though
> (otherwise I would assume that only literal redistribution is allowed).
>
> With that done, you could be much more comfortable providing the
> requested description for the package, no?
>
> --
> Best regards,
> Ivan
>
> [*] http://www.stat.ufl.edu/~aa/cda/data.html
> [**] https://creativecommons.org/licenses/
> [***] https://opendatacommons.org/licenses/
>


-- 
Charles Geyer
Professor, School of Statistics
Resident Fellow, Minnesota Center for Philosophy of Science
University of Minnesota
char...@stat.umn.edu

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Jeff Newmiller
Just describe the nature of the data sets literally as though the book was 
inaccessible. They are not asking you to describe how one should analyze the 
the data, so there really shouldn't be any conflict with the book content that 
your agreement with the author has not already resolved.

If you feel you are being held to a higher standard than others have been... 
that is life. As a user of packages I agree with the CRAN that package 
documention should be usable on its own.

On June 28, 2020 10:58:15 AM PDT, Charles Geyer  wrote:
>CRAN did not just ask for an expanded Description field.  They
>instructed
>"Tell the users what the datasets are about and what they contain so
>they
>can use them even when they haven't read your book".  AFAIK no CRAN
>package
>that goes with a book satisfies that.
>
>On Sun, Jun 28, 2020 at 12:52 PM Max Turgeon 
>wrote:
>
>> Fair enough. But CRAN is clearly asking for a more detailed
>Description
>> field. I simply offered one suggestion for expanding it. Keep in mind
>> that users will typically see the DESCRIPTION file first, and not the
>help
>> pages.
>>
>>
>> Max Turgeon
>> Assistant Professor
>> Department of Statistics
>> Department of Computer Science
>> University of Manitoba
>> maxturgeon.ca
>>
>>
>> ----------
>> *From:* Charles Geyer 
>> *Sent:* June 28, 2020 12:48:06 PM
>> *To:* Max Turgeon
>> *Cc:* R Package Development
>> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
>>
>> *Caution:* This message was sent from outside the University of
>Manitoba.
>> The link to Alan's web site is on every help page (in the source
>> section).  That's where the source is supposed to be.
>>
>> I have no problem with adding the source to the DESCRIPTION file, but
>that
>> is not what CRAN asked me to do.
>>
>> On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon
>
>> wrote:
>>
>>> For what it's worth, I'd be inclined to interpreting CRAN's response
>>> *very* literally, i.e. your Description field is not descriptive
>enough.
>>> According to what I can see in the Github repo, you only have
>>>
>>>
>>> "Datasets used in the book Categorical Data Analysis by Agresti but
>not
>>> printed in the book."
>>>
>>>
>>> Which is not much more than what the Title field says. One glaring
>>> omission (IMO) from the Description field is any mention of
>Agresti's
>>> website, where the data comes from.
>>>
>>>
>>> In contrast, looking at the "woolridge" package, I can see from the
>>> Description field that it contains 111 datasets (well, that's in the
>Title
>>> field), it's about econometrics, and the purpose of the package is
>to make
>>> it easier for students to work with these datasets.
>>>
>>>
>>> Max Turgeon
>>> Assistant Professor
>>> Department of Statistics
>>> Department of Computer Science
>>> University of Manitoba
>>> maxturgeon.ca
>>>
>>>
>>>
>>>
>>> --
>>> *From:* R-package-devel  on
>>> behalf of Charles Geyer 
>>> *Sent:* June 28, 2020 11:38 AM
>>> *To:* Neal Fultz
>>> *Cc:* R Package Development
>>> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
>>>
>>> 
>>> Caution: This message was sent from outside the University of
>Manitoba.
>>> 
>>>
>>> Actually the wooldridge package does not seem to satisfy any of the
>>> specific requests CRAN asked me for.  I have checked several other
>CRAN
>>> packages for textbooks and they don't seem to satisfy those
>requirements
>>> either.  So this seems to be a new idea from CRAN.
>>>
>>> On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz 
>wrote:
>>>
>>> > I'm not sure exactly what cran is asking for, but the wooldridge
>>> > package is a good example of a text book data set package, so
>maybe
>>> > you can use the same format they did.
>>> >
>>> > https://cran.r-project.org/web/packages/wooldridge/index.html
>>> >
>>> > Best,
>>> >
>>> > Neal
>>> >
>>> > On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer
>
>>> > wrote:
>>> > >
>>> 

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Steven Scott
Charlie,
The folks at CRAN try hard to serve everyone, but sometimes they wind up
making up rules as they go along.  I'd add another sentence or two about
the data, perhaps with an eye towards the wide world of "citizen data
scientists" who don't know about Alan's book.  Then resubmit the package
with a note that you've addressed their concerns the best you can, and see
what happens.
Steve

On Sun, Jun 28, 2020, 10:58 AM Charles Geyer  wrote:

> CRAN did not just ask for an expanded Description field.  They instructed
> "Tell the users what the datasets are about and what they contain so they
> can use them even when they haven't read your book".  AFAIK no CRAN package
> that goes with a book satisfies that.
>
> On Sun, Jun 28, 2020 at 12:52 PM Max Turgeon 
> wrote:
>
> > Fair enough. But CRAN is clearly asking for a more detailed Description
> > field. I simply offered one suggestion for expanding it. Keep in mind
> > that users will typically see the DESCRIPTION file first, and not the
> help
> > pages.
> >
> >
> > Max Turgeon
> > Assistant Professor
> > Department of Statistics
> > Department of Computer Science
> > University of Manitoba
> > maxturgeon.ca
> >
> >
> > --------------
> > *From:* Charles Geyer 
> > *Sent:* June 28, 2020 12:48:06 PM
> > *To:* Max Turgeon
> > *Cc:* R Package Development
> > *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
> >
> > *Caution:* This message was sent from outside the University of Manitoba.
> > The link to Alan's web site is on every help page (in the source
> > section).  That's where the source is supposed to be.
> >
> > I have no problem with adding the source to the DESCRIPTION file, but
> that
> > is not what CRAN asked me to do.
> >
> > On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon 
> > wrote:
> >
> >> For what it's worth, I'd be inclined to interpreting CRAN's response
> >> *very* literally, i.e. your Description field is not descriptive enough.
> >> According to what I can see in the Github repo, you only have
> >>
> >>
> >> "Datasets used in the book Categorical Data Analysis by Agresti but not
> >> printed in the book."
> >>
> >>
> >> Which is not much more than what the Title field says. One glaring
> >> omission (IMO) from the Description field is any mention of Agresti's
> >> website, where the data comes from.
> >>
> >>
> >> In contrast, looking at the "woolridge" package, I can see from the
> >> Description field that it contains 111 datasets (well, that's in the
> Title
> >> field), it's about econometrics, and the purpose of the package is to
> make
> >> it easier for students to work with these datasets.
> >>
> >>
> >> Max Turgeon
> >> Assistant Professor
> >> Department of Statistics
> >> Department of Computer Science
> >> University of Manitoba
> >> maxturgeon.ca
> >>
> >>
> >>
> >>
> >> --
> >> *From:* R-package-devel  on
> >> behalf of Charles Geyer 
> >> *Sent:* June 28, 2020 11:38 AM
> >> *To:* Neal Fultz
> >> *Cc:* R Package Development
> >> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
> >>
> >> 
> >> Caution: This message was sent from outside the University of Manitoba.
> >> 
> >>
> >> Actually the wooldridge package does not seem to satisfy any of the
> >> specific requests CRAN asked me for.  I have checked several other CRAN
> >> packages for textbooks and they don't seem to satisfy those requirements
> >> either.  So this seems to be a new idea from CRAN.
> >>
> >> On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:
> >>
> >> > I'm not sure exactly what cran is asking for, but the wooldridge
> >> > package is a good example of a text book data set package, so maybe
> >> > you can use the same format they did.
> >> >
> >> > https://cran.r-project.org/web/packages/wooldridge/index.html
> >> >
> >> > Best,
> >> >
> >> > Neal
> >> >
> >> > On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
> >> > wrote:
> >> > >
> >> > > I have a pa

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Ivan Krylov
On Sun, 28 Jun 2020 11:07:46 -0500
Charles Geyer  wrote:

>Please note that I made Alan Agresti (with his acquiescence) the
>author of the package

Sorry to derail this, but is it possible for Alan Agresti to add a line
to the page [*] allowing redistribution of the data, preferably under
the terms of a well-known license such as CC BY-NC [**] or ODbL [***]?
As it stands, CRAN only has your word (and the fact that this whole
thread is Cc: to Prof. Agresti) that Alan Agresti agreed to have the
data published as an R package. It might be needed to allow creating
derivative works to make creating such a package feasible, though
(otherwise I would assume that only literal redistribution is allowed).

With that done, you could be much more comfortable providing the
requested description for the package, no?

-- 
Best regards,
Ivan

[*] http://www.stat.ufl.edu/~aa/cda/data.html
[**] https://creativecommons.org/licenses/
[***] https://opendatacommons.org/licenses/

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
CRAN did not just ask for an expanded Description field.  They instructed
"Tell the users what the datasets are about and what they contain so they
can use them even when they haven't read your book".  AFAIK no CRAN package
that goes with a book satisfies that.

On Sun, Jun 28, 2020 at 12:52 PM Max Turgeon 
wrote:

> Fair enough. But CRAN is clearly asking for a more detailed Description
> field. I simply offered one suggestion for expanding it. Keep in mind
> that users will typically see the DESCRIPTION file first, and not the help
> pages.
>
>
> Max Turgeon
> Assistant Professor
> Department of Statistics
> Department of Computer Science
> University of Manitoba
> maxturgeon.ca
>
>
> --
> *From:* Charles Geyer 
> *Sent:* June 28, 2020 12:48:06 PM
> *To:* Max Turgeon
> *Cc:* R Package Development
> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
>
> *Caution:* This message was sent from outside the University of Manitoba.
> The link to Alan's web site is on every help page (in the source
> section).  That's where the source is supposed to be.
>
> I have no problem with adding the source to the DESCRIPTION file, but that
> is not what CRAN asked me to do.
>
> On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon 
> wrote:
>
>> For what it's worth, I'd be inclined to interpreting CRAN's response
>> *very* literally, i.e. your Description field is not descriptive enough.
>> According to what I can see in the Github repo, you only have
>>
>>
>> "Datasets used in the book Categorical Data Analysis by Agresti but not
>> printed in the book."
>>
>>
>> Which is not much more than what the Title field says. One glaring
>> omission (IMO) from the Description field is any mention of Agresti's
>> website, where the data comes from.
>>
>>
>> In contrast, looking at the "woolridge" package, I can see from the
>> Description field that it contains 111 datasets (well, that's in the Title
>> field), it's about econometrics, and the purpose of the package is to make
>> it easier for students to work with these datasets.
>>
>>
>> Max Turgeon
>> Assistant Professor
>> Department of Statistics
>> Department of Computer Science
>> University of Manitoba
>> maxturgeon.ca
>>
>>
>>
>>
>> --
>> *From:* R-package-devel  on
>> behalf of Charles Geyer 
>> *Sent:* June 28, 2020 11:38 AM
>> *To:* Neal Fultz
>> *Cc:* R Package Development
>> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
>>
>> 
>> Caution: This message was sent from outside the University of Manitoba.
>> 
>>
>> Actually the wooldridge package does not seem to satisfy any of the
>> specific requests CRAN asked me for.  I have checked several other CRAN
>> packages for textbooks and they don't seem to satisfy those requirements
>> either.  So this seems to be a new idea from CRAN.
>>
>> On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:
>>
>> > I'm not sure exactly what cran is asking for, but the wooldridge
>> > package is a good example of a text book data set package, so maybe
>> > you can use the same format they did.
>> >
>> > https://cran.r-project.org/web/packages/wooldridge/index.html
>> >
>> > Best,
>> >
>> > Neal
>> >
>> > On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
>> > wrote:
>> > >
>> > > I have a package that has the datasets for Categorical Data Analysis
>> by
>> > > Agresti that do not appear in the book.  The whole package is a github
>> > repo
>> > > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
>> > > translated mechanically using the R script foo.R included in the repo
>> > (but
>> > > not in the package) from Agresti's web site
>> > > http://www.stat.ufl.edu/~aa/cda/data.html.
>> > >
>> > > This package seems to be a useful service to students and teachers.
>> The
>> > > data
>> > > are much simpler to use with this package than trying to get the data
>> > from
>> > > Agresti's web page (foo.R has 277 lines of code).
>> > >
>> > > When I submitted the package to CRAN, I got the following response.
>> > >
>> > > > The Description field of the DESCRIPT

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
The link to Alan's web site is on every help page (in the source section).
That's where the source is supposed to be.

I have no problem with adding the source to the DESCRIPTION file, but that
is not what CRAN asked me to do.

On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon 
wrote:

> For what it's worth, I'd be inclined to interpreting CRAN's response
> *very* literally, i.e. your Description field is not descriptive enough.
> According to what I can see in the Github repo, you only have
>
>
> "Datasets used in the book Categorical Data Analysis by Agresti but not
> printed in the book."
>
>
> Which is not much more than what the Title field says. One glaring
> omission (IMO) from the Description field is any mention of Agresti's
> website, where the data comes from.
>
>
> In contrast, looking at the "woolridge" package, I can see from the
> Description field that it contains 111 datasets (well, that's in the Title
> field), it's about econometrics, and the purpose of the package is to make
> it easier for students to work with these datasets.
>
>
> Max Turgeon
> Assistant Professor
> Department of Statistics
> Department of Computer Science
> University of Manitoba
> maxturgeon.ca
>
>
>
>
> --
> *From:* R-package-devel  on behalf
> of Charles Geyer 
> *Sent:* June 28, 2020 11:38 AM
> *To:* Neal Fultz
> *Cc:* R Package Development
> *Subject:* Re: [R-pkg-devel] package CatDataAnalysis
>
> 
> Caution: This message was sent from outside the University of Manitoba.
> 
>
> Actually the wooldridge package does not seem to satisfy any of the
> specific requests CRAN asked me for.  I have checked several other CRAN
> packages for textbooks and they don't seem to satisfy those requirements
> either.  So this seems to be a new idea from CRAN.
>
> On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:
>
> > I'm not sure exactly what cran is asking for, but the wooldridge
> > package is a good example of a text book data set package, so maybe
> > you can use the same format they did.
> >
> > https://cran.r-project.org/web/packages/wooldridge/index.html
> >
> > Best,
> >
> > Neal
> >
> > On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
> > wrote:
> > >
> > > I have a package that has the datasets for Categorical Data Analysis by
> > > Agresti that do not appear in the book.  The whole package is a github
> > repo
> > > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
> > > translated mechanically using the R script foo.R included in the repo
> > (but
> > > not in the package) from Agresti's web site
> > > http://www.stat.ufl.edu/~aa/cda/data.html.
> > >
> > > This package seems to be a useful service to students and teachers.
> The
> > > data
> > > are much simpler to use with this package than trying to get the data
> > from
> > > Agresti's web page (foo.R has 277 lines of code).
> > >
> > > When I submitted the package to CRAN, I got the following response.
> > >
> > > > The Description field of the DESCRIPTION file is intended to be a
> (one
> > > > paragraph) description of what the package does and why it may be
> > > > useful. Please elaborate. Tell the users what the datasets are about
> > and
> > > > what they contain so they can use them even when they haven't read
> your
> > > > book.
> > >
> > > > Please fix and resubmit, and document what was changed in the
> > submission
> > > > comments.
> > >
> > > In an alternate universe without copyright law this seems a reasonable
> > > request.  In this universe it seems to be asking for trouble.  I know
> > about
> > > fair use, but I am not a lawyer and do not want to walk the borderline
> > > between fair use and copyright violation.
> > >
> > > The package as it is seems OK because it comes from the author's public
> > web
> > > site and these data were never in the book.
> > >
> > > Please note that I made Alan Agresti (with his acquiescence) the author
> > of
> > > the package because it is his book and his data, but I (or rather
> foo.R)
> > > did all the work.
> > >
> > > I replied to cran.r-project.org, but that was apparently sent to
> > /dev/null.
> > >
> > > This boo

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Max Turgeon
Fair enough. But CRAN is clearly asking for a more detailed Description field. 
I simply offered one suggestion for expanding it. Keep in mind that users will 
typically see the DESCRIPTION file first, and not the help pages.


Max Turgeon
Assistant Professor
Department of Statistics
Department of Computer Science
University of Manitoba
maxturgeon.ca<http://maxturgeon.ca>



From: Charles Geyer 
Sent: June 28, 2020 12:48:06 PM
To: Max Turgeon
Cc: R Package Development
Subject: Re: [R-pkg-devel] package CatDataAnalysis

Caution: This message was sent from outside the University of Manitoba.

The link to Alan's web site is on every help page (in the source section).  
That's where the source is supposed to be.

I have no problem with adding the source to the DESCRIPTION file, but that is 
not what CRAN asked me to do.

On Sun, Jun 28, 2020 at 12:16 PM Max Turgeon 
mailto:max.turg...@umanitoba.ca>> wrote:

For what it's worth, I'd be inclined to interpreting CRAN's response *very* 
literally, i.e. your Description field is not descriptive enough. According to 
what I can see in the Github repo, you only have


"Datasets used in the book Categorical Data Analysis by Agresti but not printed 
in the book."


Which is not much more than what the Title field says. One glaring omission 
(IMO) from the Description field is any mention of Agresti's website, where the 
data comes from.


In contrast, looking at the "woolridge" package, I can see from the Description 
field that it contains 111 datasets (well, that's in the Title field), it's 
about econometrics, and the purpose of the package is to make it easier for 
students to work with these datasets.


Max Turgeon
Assistant Professor
Department of Statistics
Department of Computer Science
University of Manitoba
maxturgeon.ca<http://maxturgeon.ca>




From: R-package-devel 
mailto:r-package-devel-boun...@r-project.org>>
 on behalf of Charles Geyer mailto:char...@stat.umn.edu>>
Sent: June 28, 2020 11:38 AM
To: Neal Fultz
Cc: R Package Development
Subject: Re: [R-pkg-devel] package CatDataAnalysis


Caution: This message was sent from outside the University of Manitoba.


Actually the wooldridge package does not seem to satisfy any of the
specific requests CRAN asked me for.  I have checked several other CRAN
packages for textbooks and they don't seem to satisfy those requirements
either.  So this seems to be a new idea from CRAN.

On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz 
mailto:nfu...@gmail.com>> wrote:

> I'm not sure exactly what cran is asking for, but the wooldridge
> package is a good example of a text book data set package, so maybe
> you can use the same format they did.
>
> https://cran.r-project.org/web/packages/wooldridge/index.html
>
> Best,
>
> Neal
>
> On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
> mailto:char...@stat.umn.edu>>
> wrote:
> >
> > I have a package that has the datasets for Categorical Data Analysis by
> > Agresti that do not appear in the book.  The whole package is a github
> repo
> > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
> > translated mechanically using the R script foo.R included in the repo
> (but
> > not in the package) from Agresti's web site
> > http://www.stat.ufl.edu/~aa/cda/data.html.
> >
> > This package seems to be a useful service to students and teachers.  The
> > data
> > are much simpler to use with this package than trying to get the data
> from
> > Agresti's web page (foo.R has 277 lines of code).
> >
> > When I submitted the package to CRAN, I got the following response.
> >
> > > The Description field of the DESCRIPTION file is intended to be a (one
> > > paragraph) description of what the package does and why it may be
> > > useful. Please elaborate. Tell the users what the datasets are about
> and
> > > what they contain so they can use them even when they haven't read your
> > > book.
> >
> > > Please fix and resubmit, and document what was changed in the
> submission
> > > comments.
> >
> > In an alternate universe without copyright law this seems a reasonable
> > request.  In this universe it seems to be asking for trouble.  I know
> about
> > fair use, but I am not a lawyer and do not want to walk the borderline
> > between fair use and copyright violation.
> >
> > The package as it is seems OK because it comes from the author's public
> web
> > site and these data were never in the book.
> >
> > Pl

Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
As I said in my original post, I have asked Alan Agresti and he agreed.
There is no problem about authorship.

On Sun, Jun 28, 2020 at 12:12 PM Hugh Parsonage 
wrote:

> If you’re not the author of the data, you can’t submit it as such. The
> website has a copyright message on it which, while not definitive, doesn’t
> suggest the use you’re proposing would be allowed.
>
> Bear in mind that you may expose CRAN not just yourself to liability if
> you try to conceal true authorship.
>
> I’d recommend asking the author of that website for permission. Either he
> says that’s fine in which case you have permission, or he doesn’t and you
> dodge a bullet.
>
>
> On Mon, 29 Jun 2020 at 2:45 am, Charles Geyer 
> wrote:
>
>> Actually the wooldridge package does not seem to satisfy any of the
>> specific requests CRAN asked me for.  I have checked several other CRAN
>> packages for textbooks and they don't seem to satisfy those requirements
>> either.  So this seems to be a new idea from CRAN.
>>
>> On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:
>>
>> > I'm not sure exactly what cran is asking for, but the wooldridge
>> > package is a good example of a text book data set package, so maybe
>> > you can use the same format they did.
>> >
>> > https://cran.r-project.org/web/packages/wooldridge/index.html
>> >
>> > Best,
>> >
>> > Neal
>> >
>> > On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
>> > wrote:
>> > >
>> > > I have a package that has the datasets for Categorical Data Analysis
>> by
>> > > Agresti that do not appear in the book.  The whole package is a github
>> > repo
>> > > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
>> > > translated mechanically using the R script foo.R included in the repo
>> > (but
>> > > not in the package) from Agresti's web site
>> > > http://www.stat.ufl.edu/~aa/cda/data.html.
>> > >
>> > > This package seems to be a useful service to students and teachers.
>> The
>> > > data
>> > > are much simpler to use with this package than trying to get the data
>> > from
>> > > Agresti's web page (foo.R has 277 lines of code).
>> > >
>> > > When I submitted the package to CRAN, I got the following response.
>> > >
>> > > > The Description field of the DESCRIPTION file is intended to be a
>> (one
>> > > > paragraph) description of what the package does and why it may be
>> > > > useful. Please elaborate. Tell the users what the datasets are about
>> > and
>> > > > what they contain so they can use them even when they haven't read
>> your
>> > > > book.
>> > >
>> > > > Please fix and resubmit, and document what was changed in the
>> > submission
>> > > > comments.
>> > >
>> > > In an alternate universe without copyright law this seems a reasonable
>> > > request.  In this universe it seems to be asking for trouble.  I know
>> > about
>> > > fair use, but I am not a lawyer and do not want to walk the borderline
>> > > between fair use and copyright violation.
>> > >
>> > > The package as it is seems OK because it comes from the author's
>> public
>> > web
>> > > site and these data were never in the book.
>> > >
>> > > Please note that I made Alan Agresti (with his acquiescence) the
>> author
>> > of
>> > > the package because it is his book and his data, but I (or rather
>> foo.R)
>> > > did all the work.
>> > >
>> > > I replied to cran.r-project.org, but that was apparently sent to
>> > /dev/null.
>> > >
>> > > This book is IMHO the authoritative textbook on the subject.  Amazon
>> > sales
>> > > rank agrees.  The book is used for many courses.  So this package
>> would
>> > be
>> > > very helpful as is to many students and teachers.
>> > >
>> > > So what to do?  Is there any way to get this package on CRAN?
>> > >
>> > > --
>> > > Charles Geyer
>> > > Professor, School of Statistics
>> > > Resident Fellow, Minnesota Center for Philosophy of Science
>> > > University of Minnesota
>> > > char...@stat.umn.edu
>> > >
>> > > [[alternative HTML version deleted]]
>> > >
>> > > __
>> > > R-package-devel@r-project.org mailing list
>> > > https://stat.ethz.ch/mailman/listinfo/r-package-devel
>> >
>>
>>
>> --
>> Charles Geyer
>> Professor, School of Statistics
>> Resident Fellow, Minnesota Center for Philosophy of Science
>> University of Minnesota
>> char...@stat.umn.edu
>>
>> [[alternative HTML version deleted]]
>>
>> __
>> R-package-devel@r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-package-devel
>>
>

-- 
Charles Geyer
Professor, School of Statistics
Resident Fellow, Minnesota Center for Philosophy of Science
University of Minnesota
char...@stat.umn.edu

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Max Turgeon
For what it's worth, I'd be inclined to interpreting CRAN's response *very* 
literally, i.e. your Description field is not descriptive enough. According to 
what I can see in the Github repo, you only have


"Datasets used in the book Categorical Data Analysis by Agresti but not printed 
in the book."


Which is not much more than what the Title field says. One glaring omission 
(IMO) from the Description field is any mention of Agresti's website, where the 
data comes from.


In contrast, looking at the "woolridge" package, I can see from the Description 
field that it contains 111 datasets (well, that's in the Title field), it's 
about econometrics, and the purpose of the package is to make it easier for 
students to work with these datasets.


Max Turgeon
Assistant Professor
Department of Statistics
Department of Computer Science
University of Manitoba
maxturgeon.ca<http://maxturgeon.ca>




From: R-package-devel  on behalf of 
Charles Geyer 
Sent: June 28, 2020 11:38 AM
To: Neal Fultz
Cc: R Package Development
Subject: Re: [R-pkg-devel] package CatDataAnalysis


Caution: This message was sent from outside the University of Manitoba.


Actually the wooldridge package does not seem to satisfy any of the
specific requests CRAN asked me for.  I have checked several other CRAN
packages for textbooks and they don't seem to satisfy those requirements
either.  So this seems to be a new idea from CRAN.

On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:

> I'm not sure exactly what cran is asking for, but the wooldridge
> package is a good example of a text book data set package, so maybe
> you can use the same format they did.
>
> https://cran.r-project.org/web/packages/wooldridge/index.html
>
> Best,
>
> Neal
>
> On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
> wrote:
> >
> > I have a package that has the datasets for Categorical Data Analysis by
> > Agresti that do not appear in the book.  The whole package is a github
> repo
> > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
> > translated mechanically using the R script foo.R included in the repo
> (but
> > not in the package) from Agresti's web site
> > http://www.stat.ufl.edu/~aa/cda/data.html.
> >
> > This package seems to be a useful service to students and teachers.  The
> > data
> > are much simpler to use with this package than trying to get the data
> from
> > Agresti's web page (foo.R has 277 lines of code).
> >
> > When I submitted the package to CRAN, I got the following response.
> >
> > > The Description field of the DESCRIPTION file is intended to be a (one
> > > paragraph) description of what the package does and why it may be
> > > useful. Please elaborate. Tell the users what the datasets are about
> and
> > > what they contain so they can use them even when they haven't read your
> > > book.
> >
> > > Please fix and resubmit, and document what was changed in the
> submission
> > > comments.
> >
> > In an alternate universe without copyright law this seems a reasonable
> > request.  In this universe it seems to be asking for trouble.  I know
> about
> > fair use, but I am not a lawyer and do not want to walk the borderline
> > between fair use and copyright violation.
> >
> > The package as it is seems OK because it comes from the author's public
> web
> > site and these data were never in the book.
> >
> > Please note that I made Alan Agresti (with his acquiescence) the author
> of
> > the package because it is his book and his data, but I (or rather foo.R)
> > did all the work.
> >
> > I replied to cran.r-project.org, but that was apparently sent to
> /dev/null.
> >
> > This book is IMHO the authoritative textbook on the subject.  Amazon
> sales
> > rank agrees.  The book is used for many courses.  So this package would
> be
> > very helpful as is to many students and teachers.
> >
> > So what to do?  Is there any way to get this package on CRAN?
> >
> > --
> > Charles Geyer
> > Professor, School of Statistics
> > Resident Fellow, Minnesota Center for Philosophy of Science
> > University of Minnesota
> > char...@stat.umn.edu
> >
> > [[alternative HTML version deleted]]
> >
> > __
> > R-package-devel@r-project.org mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-package-devel
>


--
Charles Geyer
Professor, School of Statistics
Resident Fellow, Minnesota Center for Philosophy of Science
University of Minnesota
char...@stat.umn.edu

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Hugh Parsonage
If you’re not the author of the data, you can’t submit it as such. The
website has a copyright message on it which, while not definitive, doesn’t
suggest the use you’re proposing would be allowed.

Bear in mind that you may expose CRAN not just yourself to liability if you
try to conceal true authorship.

I’d recommend asking the author of that website for permission. Either he
says that’s fine in which case you have permission, or he doesn’t and you
dodge a bullet.


On Mon, 29 Jun 2020 at 2:45 am, Charles Geyer  wrote:

> Actually the wooldridge package does not seem to satisfy any of the
> specific requests CRAN asked me for.  I have checked several other CRAN
> packages for textbooks and they don't seem to satisfy those requirements
> either.  So this seems to be a new idea from CRAN.
>
> On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:
>
> > I'm not sure exactly what cran is asking for, but the wooldridge
> > package is a good example of a text book data set package, so maybe
> > you can use the same format they did.
> >
> > https://cran.r-project.org/web/packages/wooldridge/index.html
> >
> > Best,
> >
> > Neal
> >
> > On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
> > wrote:
> > >
> > > I have a package that has the datasets for Categorical Data Analysis by
> > > Agresti that do not appear in the book.  The whole package is a github
> > repo
> > > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
> > > translated mechanically using the R script foo.R included in the repo
> > (but
> > > not in the package) from Agresti's web site
> > > http://www.stat.ufl.edu/~aa/cda/data.html.
> > >
> > > This package seems to be a useful service to students and teachers.
> The
> > > data
> > > are much simpler to use with this package than trying to get the data
> > from
> > > Agresti's web page (foo.R has 277 lines of code).
> > >
> > > When I submitted the package to CRAN, I got the following response.
> > >
> > > > The Description field of the DESCRIPTION file is intended to be a
> (one
> > > > paragraph) description of what the package does and why it may be
> > > > useful. Please elaborate. Tell the users what the datasets are about
> > and
> > > > what they contain so they can use them even when they haven't read
> your
> > > > book.
> > >
> > > > Please fix and resubmit, and document what was changed in the
> > submission
> > > > comments.
> > >
> > > In an alternate universe without copyright law this seems a reasonable
> > > request.  In this universe it seems to be asking for trouble.  I know
> > about
> > > fair use, but I am not a lawyer and do not want to walk the borderline
> > > between fair use and copyright violation.
> > >
> > > The package as it is seems OK because it comes from the author's public
> > web
> > > site and these data were never in the book.
> > >
> > > Please note that I made Alan Agresti (with his acquiescence) the author
> > of
> > > the package because it is his book and his data, but I (or rather
> foo.R)
> > > did all the work.
> > >
> > > I replied to cran.r-project.org, but that was apparently sent to
> > /dev/null.
> > >
> > > This book is IMHO the authoritative textbook on the subject.  Amazon
> > sales
> > > rank agrees.  The book is used for many courses.  So this package would
> > be
> > > very helpful as is to many students and teachers.
> > >
> > > So what to do?  Is there any way to get this package on CRAN?
> > >
> > > --
> > > Charles Geyer
> > > Professor, School of Statistics
> > > Resident Fellow, Minnesota Center for Philosophy of Science
> > > University of Minnesota
> > > char...@stat.umn.edu
> > >
> > > [[alternative HTML version deleted]]
> > >
> > > __
> > > R-package-devel@r-project.org mailing list
> > > https://stat.ethz.ch/mailman/listinfo/r-package-devel
> >
>
>
> --
> Charles Geyer
> Professor, School of Statistics
> Resident Fellow, Minnesota Center for Philosophy of Science
> University of Minnesota
> char...@stat.umn.edu
>
> [[alternative HTML version deleted]]
>
> __
> R-package-devel@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-package-devel
>

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
Actually the wooldridge package does not seem to satisfy any of the
specific requests CRAN asked me for.  I have checked several other CRAN
packages for textbooks and they don't seem to satisfy those requirements
either.  So this seems to be a new idea from CRAN.

On Sun, Jun 28, 2020 at 11:32 AM Neal Fultz  wrote:

> I'm not sure exactly what cran is asking for, but the wooldridge
> package is a good example of a text book data set package, so maybe
> you can use the same format they did.
>
> https://cran.r-project.org/web/packages/wooldridge/index.html
>
> Best,
>
> Neal
>
> On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer 
> wrote:
> >
> > I have a package that has the datasets for Categorical Data Analysis by
> > Agresti that do not appear in the book.  The whole package is a github
> repo
> > https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
> > translated mechanically using the R script foo.R included in the repo
> (but
> > not in the package) from Agresti's web site
> > http://www.stat.ufl.edu/~aa/cda/data.html.
> >
> > This package seems to be a useful service to students and teachers.  The
> > data
> > are much simpler to use with this package than trying to get the data
> from
> > Agresti's web page (foo.R has 277 lines of code).
> >
> > When I submitted the package to CRAN, I got the following response.
> >
> > > The Description field of the DESCRIPTION file is intended to be a (one
> > > paragraph) description of what the package does and why it may be
> > > useful. Please elaborate. Tell the users what the datasets are about
> and
> > > what they contain so they can use them even when they haven't read your
> > > book.
> >
> > > Please fix and resubmit, and document what was changed in the
> submission
> > > comments.
> >
> > In an alternate universe without copyright law this seems a reasonable
> > request.  In this universe it seems to be asking for trouble.  I know
> about
> > fair use, but I am not a lawyer and do not want to walk the borderline
> > between fair use and copyright violation.
> >
> > The package as it is seems OK because it comes from the author's public
> web
> > site and these data were never in the book.
> >
> > Please note that I made Alan Agresti (with his acquiescence) the author
> of
> > the package because it is his book and his data, but I (or rather foo.R)
> > did all the work.
> >
> > I replied to cran.r-project.org, but that was apparently sent to
> /dev/null.
> >
> > This book is IMHO the authoritative textbook on the subject.  Amazon
> sales
> > rank agrees.  The book is used for many courses.  So this package would
> be
> > very helpful as is to many students and teachers.
> >
> > So what to do?  Is there any way to get this package on CRAN?
> >
> > --
> > Charles Geyer
> > Professor, School of Statistics
> > Resident Fellow, Minnesota Center for Philosophy of Science
> > University of Minnesota
> > char...@stat.umn.edu
> >
> > [[alternative HTML version deleted]]
> >
> > __
> > R-package-devel@r-project.org mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-package-devel
>


-- 
Charles Geyer
Professor, School of Statistics
Resident Fellow, Minnesota Center for Philosophy of Science
University of Minnesota
char...@stat.umn.edu

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


Re: [R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Neal Fultz
I'm not sure exactly what cran is asking for, but the wooldridge
package is a good example of a text book data set package, so maybe
you can use the same format they did.

https://cran.r-project.org/web/packages/wooldridge/index.html

Best,

Neal

On Sun, Jun 28, 2020 at 9:08 AM Charles Geyer  wrote:
>
> I have a package that has the datasets for Categorical Data Analysis by
> Agresti that do not appear in the book.  The whole package is a github repo
> https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
> translated mechanically using the R script foo.R included in the repo (but
> not in the package) from Agresti's web site
> http://www.stat.ufl.edu/~aa/cda/data.html.
>
> This package seems to be a useful service to students and teachers.  The
> data
> are much simpler to use with this package than trying to get the data from
> Agresti's web page (foo.R has 277 lines of code).
>
> When I submitted the package to CRAN, I got the following response.
>
> > The Description field of the DESCRIPTION file is intended to be a (one
> > paragraph) description of what the package does and why it may be
> > useful. Please elaborate. Tell the users what the datasets are about and
> > what they contain so they can use them even when they haven't read your
> > book.
>
> > Please fix and resubmit, and document what was changed in the submission
> > comments.
>
> In an alternate universe without copyright law this seems a reasonable
> request.  In this universe it seems to be asking for trouble.  I know about
> fair use, but I am not a lawyer and do not want to walk the borderline
> between fair use and copyright violation.
>
> The package as it is seems OK because it comes from the author's public web
> site and these data were never in the book.
>
> Please note that I made Alan Agresti (with his acquiescence) the author of
> the package because it is his book and his data, but I (or rather foo.R)
> did all the work.
>
> I replied to cran.r-project.org, but that was apparently sent to /dev/null.
>
> This book is IMHO the authoritative textbook on the subject.  Amazon sales
> rank agrees.  The book is used for many courses.  So this package would be
> very helpful as is to many students and teachers.
>
> So what to do?  Is there any way to get this package on CRAN?
>
> --
> Charles Geyer
> Professor, School of Statistics
> Resident Fellow, Minnesota Center for Philosophy of Science
> University of Minnesota
> char...@stat.umn.edu
>
> [[alternative HTML version deleted]]
>
> __
> R-package-devel@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-package-devel

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel


[R-pkg-devel] package CatDataAnalysis

2020-06-28 Thread Charles Geyer
I have a package that has the datasets for Categorical Data Analysis by
Agresti that do not appear in the book.  The whole package is a github repo
https://github.com/cjgeyer/CatDataAnalysis.  All of the data were
translated mechanically using the R script foo.R included in the repo (but
not in the package) from Agresti's web site
http://www.stat.ufl.edu/~aa/cda/data.html.

This package seems to be a useful service to students and teachers.  The
data
are much simpler to use with this package than trying to get the data from
Agresti's web page (foo.R has 277 lines of code).

When I submitted the package to CRAN, I got the following response.

> The Description field of the DESCRIPTION file is intended to be a (one
> paragraph) description of what the package does and why it may be
> useful. Please elaborate. Tell the users what the datasets are about and
> what they contain so they can use them even when they haven't read your
> book.

> Please fix and resubmit, and document what was changed in the submission
> comments.

In an alternate universe without copyright law this seems a reasonable
request.  In this universe it seems to be asking for trouble.  I know about
fair use, but I am not a lawyer and do not want to walk the borderline
between fair use and copyright violation.

The package as it is seems OK because it comes from the author's public web
site and these data were never in the book.

Please note that I made Alan Agresti (with his acquiescence) the author of
the package because it is his book and his data, but I (or rather foo.R)
did all the work.

I replied to cran.r-project.org, but that was apparently sent to /dev/null.

This book is IMHO the authoritative textbook on the subject.  Amazon sales
rank agrees.  The book is used for many courses.  So this package would be
very helpful as is to many students and teachers.

So what to do?  Is there any way to get this package on CRAN?

-- 
Charles Geyer
Professor, School of Statistics
Resident Fellow, Minnesota Center for Philosophy of Science
University of Minnesota
char...@stat.umn.edu

[[alternative HTML version deleted]]

__
R-package-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-package-devel