On Tue, Apr 24, 2012 at 09:51:33AM +0200, Andreas Tille wrote: > On Tue, Apr 24, 2012 at 03:44:12PM +0900, Charles Plessy wrote: > > Much of the references I entered in the debian/upstream files are pasted > > from > > the publisher's site (after removing the abstract, that is copyrighted > > work). > > I see a lot of heterogeneity there. > > Yes, there is currently heterogenicity in our data set which is most > probably caused by the (at this time) reasonable approach to copy the > data.
Right, I have normalized the author data to "A. First, B. Second and C. D. Third" for the obsolete metadata in the task meta-packages. http://wiki.debian.org/UpstreamMetadata says "please use full names" in the template which I think will make the author information too verbose, especially for articles with lots of authors (as can be the case for general-purpose software-description articles). If it the web sentinel (or other places where the information is displayed) will shorten the first name to the initial due to some processing, that would be fine as well. However, I think requiring the full first names might be too harsh, as it can be quite cumbersome to research them if they are not provided in the citation (one needs usually at least check the publication url, or even the full PDF). So I think we should not scramble to normalize the data, but if we do, we should agree on some form. Michael -- To UNSUBSCRIBE, email to [email protected] with a subject of "unsubscribe". Trouble? Contact [email protected] Archive: http://lists.debian.org/[email protected]

