Just incase it helps, I'm also unable to modify any records using bibupload. Please see below for details. Any suggestions for diagnosing or fixing our DB would be greatly appreciated.
Many thanks, Devin ------ [root@server]# wget -O 2136.xml https://documents.classe.cornell.edu/record/2136?of=xm [root@server]# cat 2136.xml <?xml version="1.0" encoding="UTF-8"?> <collection xmlns="http://www.loc.gov/MARC21/slim"> <record> <controlfield tag="001">2136</controlfield> <datafield tag="037" ind1=" " ind2=" "> <subfield code="a">DEMO-ARTICLE-2010-002</subfield> </datafield> <datafield tag="041" ind1=" " ind2=" "> <subfield code="a">eng</subfield> </datafield> <datafield tag="088" ind1=" " ind2=" "> <subfield code="a">DEMO-ARTICLE-2009-001</subfield> </datafield> <datafield tag="100" ind1=" " ind2=" "> <subfield code="a">Devin Bougie</subfield> </datafield> <datafield tag="245" ind1=" " ind2=" "> <subfield code="a">test title</subfield> </datafield> <datafield tag="260" ind1=" " ind2=" "> <subfield code="c">2009-01-01</subfield> </datafield> <datafield tag="300" ind1=" " ind2=" "> <subfield code="a">1</subfield> </datafield> <datafield tag="500" ind1=" " ind2=" "> <subfield code="a">test comments</subfield> </datafield> <datafield tag="856" ind1="0" ind2=" "> <subfield code="f">[email protected]</subfield> </datafield> <datafield tag="980" ind1=" " ind2=" "> <subfield code="a">ARTICLE</subfield> </datafield> <datafield tag="520" ind1=" " ind2=" "> <subfield code="a">test abstract & test again</subfield> </datafield> <datafield tag="653" ind1="1" ind2=" "> <subfield code="a">test keyword</subfield> </datafield> <datafield tag="653" ind1="1" ind2=" "> <subfield code="a">test</subfield> </datafield> <datafield tag="653" ind1="1" ind2=" "> <subfield code="a">test2</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="u">https://edms.classe.cornell.edu/record/2136/files/DEMO-ARTICLE-2010-002.pdf</subfield> <subfield code="z">Access to files</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="u">https://edms.classe.cornell.edu/record/2136/files/DEMO-ARTICLE-2010-002.ps.gz</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="u">https://edms.classe.cornell.edu/record/2136/files/test.pdf</subfield> </datafield> <datafield tag="856" ind1="4" ind2=" "> <subfield code="u">https://edms.classe.cornell.edu/record/2136/files/test.ps.gz</subfield> </datafield> </record> </collection> [root@server]# su -s /bin/bash apache -c "/mnt/documents/invenio/bin/bibupload -c --pretend -v 9 2136.xml" [root@server]# cat /mnt/documents/invenio/var/log/bibsched/1/bibsched_task_10971.log 2017-01-05 12:01:51 --> argv=['/mnt/documents/invenio/bin/bibupload', '-c', '--pretend', '-v', '9', '/tmp/2136.xml'] 2017-01-05 12:01:51 --> _OPTIONS={'force': False, 'verbose': 1, 'pretend': True, 'notimechange': 0, 'tag': None, 'mode': 'correct', 'stage_to_start_from': 1, 'file_path': '/tmp/2136.xml'} 2017-01-05 12:01:51 --> _TASK_PARAMS={'profile': [], 'stop_queue_on_error': False, 'post-process': [], 'task_stop_helper_fnc': None, 'verbose': 9, 'task_id': 10971, 'sleeptime': '', 'fixed_time': False, 'runtime_limit': None, 'priority': 0, 'task_specific_name': '', 'version': '$Id$', 'sequence-id': None, 'user': '', 'task_name': 'bibupload', 'host': '', 'runtime': '2017-01-05 12:01:51'} 2017-01-05 12:01:51 --> Task #10971 started. 2017-01-05 12:01:51 --> Updating task status to RUNNING. 2017-01-05 12:01:51 --> Input file '/tmp/2136.xml', input mode 'correct'. 2017-01-05 12:01:51 --> STAGE 0: 2017-01-05 12:01:51 --> start preocessing 2017-01-05 12:01:51 --> Updating task progress to Reading XML input. 2017-01-05 12:01:51 --> -Open XML marc: DONE 2017-01-05 12:01:51 --> Entering task_sleep_now_if_required with status=RUNNING 2017-01-05 12:01:51 --> Entering records loop 2017-01-05 12:01:51 --> Entering task_sleep_now_if_required with status=RUNNING 2017-01-05 12:01:51 --> Inserting into main database 2017-01-05 12:01:51 --> -Tag 001 not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag SYSNO not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag EXTOAIID not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag SYSNO not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag DOI not found in the xml marc file. 2017-01-05 12:01:51 --> Failed: Record not found in the database. Please insert the file before updating it. (-h for help) 2017-01-05 12:01:51 --> Failed: either the record already exists and insert was requested or the record does not exists and replace/correct/append has been used 2017-01-05 12:01:51 --> Record could not have been parsed 2017-01-05 12:01:51 --> Updating task progress to Done 0 out of 1.. 2017-01-05 12:01:51 --> Identifiers table after processing: {} versions: {} 2017-01-05 12:01:51 --> Uploading BDR and BDM fields 2017-01-05 12:01:51 --> -Tag 001 not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag SYSNO not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag EXTOAIID not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag SYSNO not found in the xml marc file. 2017-01-05 12:01:51 --> -Tag DOI not found in the xml marc file. 2017-01-05 12:01:51 --> Failed: Record not found in the database. Please insert the file before updating it. (-h for help) 2017-01-05 12:01:51 --> -Stage NOT NEEDED 2017-01-05 12:01:51 --> -Stage NOT NEEDED 2017-01-05 12:01:51 --> Task stats: 1 input records, 0 updated, 0 inserted, 1 errors, 0 inserted to holding pen. Time 0.99 sec. 2017-01-05 12:01:51 --> Updating task status to DONE WITH ERRORS. 2017-01-05 12:01:51 --> Task #10971 finished. [DONE WITH ERRORS] [root@server]# cat /mnt/documents/invenio/var/log/bibsched/1/bibsched_task_10971.err 2017-01-05 12:01:51 --> Failed: Record not found in the database. Please insert the file before updating it. (-h for help) 2017-01-05 12:01:51 --> Failed: either the record already exists and insert was requested or the record does not exists and replace/correct/append has been used 2017-01-05 12:01:51 --> Record could not have been parsed 2017-01-05 12:01:51 --> Failed: Record not found in the database. Please insert the file before updating it. (-h for help) ------ On Dec 20, 2016, at 3:27 PM, Devin A. Bougie <[email protected]> wrote: > We recently migrated our invenio installation from 1.0.1 on 32-bit SL5 to > 1.2.2 on 64-bit SL6. For the most part, everything seems to be working > properly. However, when browsing to the Files tab for some records, we get: > ------ > Warning > Requested record does not seem to have been integrated. > ------ > > The XM output format for these records looks fine, as far as I can tell. > > Any suggestions for fixing the broken records would be greatly appreciated. > It would be great if there were a way to programmatically find and fix all > records that have this problem.

