On 08-Dec-2010, at 6:53 AM, Jeulin-L Michael wrote:
> Hi,
>
> Actually, it doesn't work like that in my data.
> I have :
> - "ol_dump_2010-11-30.txt" (http://openlibrary.org/data)
>
> My book is related to a work and they both have the same author key.
> But the author key doesn't correspond to any author.
>
> You can see as a research in the full bulk file the following results :
>
> "Search "OL414281A" (2 hits in 1 files)
> ol_dump_2010-11-30.txt (2 hits)
>
> Line 107198: /type/edition /books/OL6837588M 2
> 2009-12-14T23:32:31.751114 {"publishers": ["RADA Ediciones"],
> "pagination": "50 p. :", "languages": [{"key": "/languages/spa"}],
> "lc_classifications": ["MLCS 2002/05180 (P)"], "title": "Arena", "lccn":
> ["00334746"], "series": ["Serie Creacio\u0301n ;", "4", "Serie Creacio\u0301n
> (Trujillo, La Libertad, Peru) ;", "4."], "number_of_pages": 50,
> "edition_name": "1. ed.", "last_modified": {"type": "/type/datetime",
> "value": "2009-12-14T23:32:31.751114"}, "latest_revision": 2,
> "publish_country": "pe ", "key": "/books/OL6837588M", "authors": [{"key":
> "/authors/OL414281A"}], "publish_date": "1999", "publish_places": ["Trujillo,
> Peru\u0301"], "works": [{"key": "/works/OL2796358W"}], "type": {"key":
> "/type/edition"}, "by_statement": "Gladys Benko Angulo ; [ilustraciones
> interiores, Manlio Holgui\u0301n Go\u0301mez].", "revision": 2}
>
> Line 327392: /type/work /works/OL2796358W 2
> 2010-02-06T16:32:17.806241 {"lc_classifications": ["MLCS 2002/05180 (P)"],
> "key": "/works/OL2796358W", "created": {"type": "/type/datetime", "value":
> "2009-12-10T00:26:56.990080"}, "title": "Arena", "first_publish_date":
> "1999", "latest_revision": 2, "last_modified": {"type": "/type/datetime",
> "value": "2010-02-06T16:32:17.806241"}, "authors": [{"type":
> "/type/author_role", "author": {"key": "/authors/OL414281A"}}], "type":
> {"key": "/type/work"}, "revision": 2}"
>
> As you can see there is the relation between book and work but absolutely
> none between book and author and/or between work and author.
>
> Am I missing something ?
Looks like you don't have complete data. Partial download?
I'm seeing 5 matchs.
$ zcat ol_dump_2010-11-30.txt.gz | grep OL414281A
/type/edition /books/OL6837588M 2 2009-12-14T23:32:31.751114
{"publishers": ["RADA Ediciones"], "pagination": "50 p. :", "languages":
[{"key": "/languages/spa"}], "lc_classifications": ["MLCS 2002/05180 (P)"],
"title": "Arena", "lccn": ["00334746"], "series": ["Serie Creacio\u0301n ;",
"4", "Serie Creacio\u0301n (Trujillo, La Libertad, Peru) ;", "4."],
"number_of_pages": 50, "edition_name": "1. ed.", "last_modified": {"type":
"/type/datetime", "value": "2009-12-14T23:32:31.751114"}, "latest_revision": 2,
"publish_country": "pe ", "key": "/books/OL6837588M", "authors": [{"key":
"/authors/OL414281A"}], "publish_date": "1999", "publish_places": ["Trujillo,
Peru\u0301"], "works": [{"key": "/works/OL2796358W"}], "type": {"key":
"/type/edition"}, "by_statement": "Gladys Benko Angulo ; [ilustraciones
interiores, Manlio Holgui\u0301n Go\u0301mez].", "revision": 2}
/type/work /works/OL2796358W 2 2010-02-06T16:32:17.806241
{"lc_classifications": ["MLCS 2002/05180 (P)"], "key": "/works/OL2796358W",
"created": {"type": "/type/datetime", "value": "2009-12-10T00:26:56.990080"},
"title": "Arena", "first_publish_date": "1999", "latest_revision": 2,
"last_modified": {"type": "/type/datetime", "value":
"2010-02-06T16:32:17.806241"}, "authors": [{"type": "/type/author_role",
"author": {"key": "/authors/OL414281A"}}], "type": {"key": "/type/work"},
"revision": 2}
/type/author /authors/OL414281A 2 2008-09-02T21:34:21.624464
{"name": "Gladys Benko", "personal_name": "Gladys Benko", "last_modified":
{"type": "/type/datetime", "value": "2008-09-02T21:34:21.624464"}, "key":
"/authors/OL414281A", "birth_date": "1949", "type": {"key": "/type/author"},
"revision": 2}
/type/work /works/OL2796359W 2 2010-02-06T16:32:17.806241
{"lc_classifications": ["MLCM 97/08524 (P)"], "key": "/works/OL2796359W",
"created": {"type": "/type/datetime", "value": "2009-12-10T00:26:56.990080"},
"title": "Cada pa\u0301gina un recuerdo y un sentir", "first_publish_date":
"1967", "latest_revision": 2, "last_modified": {"type": "/type/datetime",
"value": "2010-02-06T16:32:17.806241"}, "authors": [{"type":
"/type/author_role", "author": {"key": "/authors/OL414281A"}}], "type": {"key":
"/type/work"}, "revision": 2}
/type/edition /books/OL722600M 2 2009-12-11T23:22:30.156119
{"publishers": ["[s.n."], "pagination": "46 leaves ;", "lc_classifications":
["MLCM 97/08524 (P)"], "title": "Cada pa\u0301gina un recuerdo y un sentir",
"lccn": ["97109885"], "number_of_pages": 46, "languages": [{"key":
"/languages/spa"}], "last_modified": {"type": "/type/datetime", "value":
"2009-12-11T23:22:30.156119"}, "latest_revision": 2, "publish_country": "pe ",
"key": "/books/OL722600M", "authors": [{"key": "/authors/OL414281A"}],
"publish_date": "1967", "publish_places": ["Trujillo, Peru\u0301"], "works":
[{"key": "/works/OL2796359W"}], "type": {"key": "/type/edition"},
"by_statement": "Gladys Benko.", "revision": 2}
Please verify the md5sum of the file that you have. Here is the correct value:
$ md5sum ol_dump_2010-11-30.txt.gz
451537405c7494f7d85f5eddca8bccfd ol_dump_2010-11-30.txt.gz
Anand
_______________________________________________
Ol-tech mailing list
[email protected]
http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech
To unsubscribe from this mailing list, send email to
[email protected]