Thanks Anand. On Fri, Mar 8, 2013 at 12:12 AM, Anand Chitipothu <[email protected]> wrote:
> > This only a part of the data. This only show the imports during the first > 1 or 2 years of the project. For recent imports, we've been using the > source records field. Combining both of these would give more accurate > results. > That's good feedback. Can I assume that editions with source records are disjoint with editions in your file with machine comments or do I need to apply some type of additional heuristic? > Also, one thing to remember is that there could be repetitions. There is > plenty of chance that 2 records from different sources, but mapped to the > same edition. > For the purposes of this investigation, I think we're really trying to figure out how many sources need to be investigated. If two different sources contributed to a single edition, theoretically either could "pollute" it from an intellectual property standpoint, so both need to be included in the count. Tom
_______________________________________________ Ol-tech mailing list [email protected] http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech To unsubscribe from this mailing list, send email to [email protected]
