On Thu, May 21, 2015 at 3:10 PM Andreas Tille <[email protected]> wrote:

> Hi Michael,
>
> On Thu, May 21, 2015 at 06:14:48PM +0000, Michael Crusoe wrote:
> > >
> http://anonscm.debian.org/cgit/debian-med/prokka.git/commit/?id=91b39ca656f979b3c2c704d8756d9e04e54ae5f9
> > > >
> > > > `prokka-tigrfams_to_hmm` & `prokka-make_tarball` are not for end
> users
> > > > which is why I didn't ship them.
> > >
> > > ?  The code and the description do not match.  According to the code
> you
> > > ship all files in bin/*, right?
> >
> > I believe your commit switched to shipping all files in bin/*. I had the
> > preselected list prior.
>
> Cool, changing something and blaming somebody else about it. ;-)
>
> I think I suspected "new" binaries and considered it simpler to use "*"
> which would cover all new things.  That's the result if somebody pokes
> around without deeper knowledge.


> I reverted my change to your version - sorry for this.


:-) No worries!


> However, if you
> say it is not for end users is it possibly for root (my colleague told
> me that root needed to call a db generation script).  If this would be
> the case I'd consider:
>


>
> $ git diff
> diff --git a/debian/prokka.install b/debian/prokka.install
> index c06d46a..0aa4238 100644
> --- a/debian/prokka.install
> +++ b/debian/prokka.install
> @@ -7,3 +7,6 @@ bin/prokka-genbank_to_fasta_db  usr/bin
>  bin/prokka-genpept_to_fasta_db usr/bin
>  bin/prokka-hamap_to_hmm                usr/bin
>  bin/prokka-uniprot_to_fasta_db usr/bin
> +bin/prokka-build_kingdom_dbs   usr/sbin
> +bin/prokka-make_tarball        usr/sbin
> +bin/prokka-tigrfams_to_hmm     usr/sbin
>

Theoretically an end user might use some of these scripts to expand the
program but most won't and those that do I would consider to be developers
who can roll their own environment from source. Therefore I'd like to keep
them out of the Debian package.


>
>
> ... assuming that the new prokka-build_kingdom_dbs is also of this
> nature.  If it is for end users it probably needs to be added to the
> usual set of installed files.
>
> > > BTW, without diving into the code and checking myself I can just quote
> a
> > > colleague of mine who installed prokka manually that you need to
> > > generate something as root (prokka --setupdb).  I have mixed feelings
> > > about this why this needs to be done by root and whether the resulting
> > > files should not rather go to /var/lib/prokka.  We should not bloat
> /usr
> > > by files out of control of dpkg.
> > >
> >
> > It generates 2.1GiB of files and one only needs to be root to write them
> in
> > the correct place. With the extra files the upstream tarball compress
> down
> > to 270MiB using XZ and its default compression level (-6)
> >
> > Is that too large to ship as an arch=all data package?
>
> We should probably fill the debian mirrors with good reasons but I have
> not heard that there is some upper limit for package space.  There is a
> several years existing plan about a data area which for instance is not
> mirrored to so many mirrors world wide.  Since this has not been
> realised we might simply increase the pressure to realise the plan by
> starting to upload such packages.  May be some mirror admin will start
> implementing things and provide code to ftpmaster?
>
> But to be clear about this root-issue:  We know that root can write
> files where ever needed.  However, root *should* *not* write manually to
> /usr.  Locations like /var, may be /usr/local or /srv are fine but /usr
> should be clean from files that are not
>
>    a) inside the package and
>    b) created in {pre,post}inst and deleted in {pre,post}rm
>

FYI: In my packaging I had the --setupdb command run during postinst.

Regardless, since it compress decently we should figure out a way to run
prokka --setupdb during package building so that its output is included in
a prokka-data arch=all package.



>
>
> > > Seems you are refering to README.Debian
> > >
> > >   Prokka's databases are installed to /usr/share/prokka
> > >
> > >   HAMAP.hmm is 224M, took 21 hours and 10 minutes, 15M of memory,
> single
> > > threaded?
> > >
> > >   Shipped HAMAP.hmm is 88M ??
> > >
> > > right?  Am I understand you correctly that the base data are free and
> > > just the processing of 21hours should be CC-BY-ND?  I can not believe
> > > this since this would be insane.  Sorry for my naïve question.
> > >
> >
> > The original databases (not shipped by upstream) are CC-BY-ND. It is now
> my
> > understanding that the derived files (using the uncopyrightable facts
> from
> > the CC-BY-ND databases but not their structure or organization) are not
> > subject to the CC-BY-ND license.
> >
> > So I think we are okay to distribute after all.
>
> Sounds logical to me.
>

Huzzah!


>
> > > Definitely.  However, I'm not sure whether my personal opinion is
> > > helpful here.  I remember I was sending a series of e-mails to authors
> > > of databases shipped with emboss and in the end we were able to clear
> > > out all licenses.  No idea in how far this will lead here.  In any case
> > > I'm sure that ftpmaster will refuse a package that contains some
> > > CC-BY-ND data (and will probably not dive into a discussion whether
> > > facts of nature are copyrightable or not).
> >
> > Hmm.. There is no data copied from any CC-BY-ND databases. The
> > uncopyrightable facts that were retrieved from those databases were
> > transformed into a new work (the HMMs).
>
> This should be made pretty clear in a Comment field in debian/copyright.
>

Yep.


> Perhaps in this case also the License field CC-BY-ND for these files is
> not correct in this case.
>

Indeed, they need updating for our new understanding.


>
> Kind regards
>

Thanks as always!

Reply via email to