Hello everyone, I'm experiencing the following 'issue' with bibauthorid. (I'm using the latest master branch and the demo site with only the demo records).
# sudo -u apache /opt/invenio/bin/bibauthorid -u admin --disambiguate --from-scratch Bibauthorid Task Submission =========================== Username: admin 2015-02-06 11:04:56 --> Task #15 submitted. [root@droopy invenio]# sudo -u apache /opt/invenio/bin/bibauthorid 15 2015-02-06 11:05:02 --> Task #15 started. Process Process-185: Traceback (most recent call last): File "/usr/lib64/python2.7/multiprocessing/process.py", line 258, in _bootstrap Process Process-186: Traceback (most recent call last): File "/usr/lib64/python2.7/multiprocessing/process.py", line 258, in _bootstrap self.run() self.run() File "/usr/lib64/python2.7/multiprocessing/process.py", line 114, in run File "/usr/lib64/python2.7/multiprocessing/process.py", line 114, in run self._target(*self._args, **self._kwargs) self._target(*self._args, **self._kwargs) File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_wedge.py", line 70, in wedge File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_wedge.py", line 70, in wedge matr = ProbabilityMatrix(cluster_set.last_name) matr = ProbabilityMatrix(cluster_set.last_name) AttributeError: 'function' object has no attribute 'last_name' AttributeError: 'function' object has no attribute 'last_name' Process Process-187: [...] (lots of similar errors) [...] Process Process-367: Traceback (most recent call last): File "/usr/lib64/python2.7/multiprocessing/process.py", line 258, in _bootstrap self.run() File "/usr/lib64/python2.7/multiprocessing/process.py", line 114, in run self._target(*self._args, **self._kwargs) File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_wedge.py", line 70, in wedge matr = ProbabilityMatrix(cluster_set.last_name) AttributeError: 'function' object has no attribute 'last_name' Process Process-368: Traceback (most recent call last): File "/usr/lib64/python2.7/multiprocessing/process.py", line 258, in _bootstrap self.run() File "/usr/lib64/python2.7/multiprocessing/process.py", line 114, in run self._target(*self._args, **self._kwargs) File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_wedge.py", line 70, in wedge matr = ProbabilityMatrix(cluster_set.last_name) AttributeError: 'function' object has no attribute 'last_name' 2015-02-06 11:08:15 --> Task #15 finished. [RUNNING] 2015-02-06 11:08:15 --> Unexpected error occurred: local variable 'tortoise_db_name' referenced before assignment. 2015-02-06 11:08:15 --> Unexpected error occurred: local variable 'tortoise_db_name' referenced before assignment. 2015-02-06 11:08:15 --> Traceback is: 2015-02-06 11:08:15 --> Traceback is: 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibtask.py", line 610, in task_init 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibtask.py", line 610, in task_init 2015-02-06 11:08:15 --> ret = _task_run(task_run_fnc) 2015-02-06 11:08:15 --> ret = _task_run(task_run_fnc) 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibtask.py", line 1173, in _task_run 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibtask.py", line 1173, in _task_run 2015-02-06 11:08:15 --> if callable(task_run_fnc) and task_run_fnc(): 2015-02-06 11:08:15 --> if callable(task_run_fnc) and task_run_fnc(): 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_daemon.py", line 157, in _ta sk_run_core 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_daemon.py", line 157, in _ta sk_run_core 2015-02-06 11:08:15 --> run_tortoise(bool(bibtask.task_get_option("from_scratch"))) 2015-02-06 11:08:15 --> run_tortoise(bool(bibtask.task_get_option("from_scratch"))) 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_daemon.py", line 319, in run _tortoise 2015-02-06 11:08:15 --> File "/usr/lib64/python2.7/site-packages/invenio/bibauthorid_daemon.py", line 319, in run _tortoise 2015-02-06 11:08:15 --> insert_user_log(tortoise_db_name, '-1', '', '', '', timestamp=start_time) 2015-02-06 11:08:15 --> insert_user_log(tortoise_db_name, '-1', '', '', '', timestamp=start_time) 2015-02-06 11:08:15 --> Exiting. 2015-02-06 11:08:15 --> Exiting. - One issue seems to be: AttributeError: 'function' object has no attribute 'last_name' . Seems important, but this doesn't break things. - The Unexpected error occurred: local variable 'tortoise_db_name' referenced before assignment does break the process and happens only with '--from-scratch' is used. Looking at bibauthorid_daemon.py it seems that moving tortoise_db_name and start_time definitions outside the else block solves the problem. Having said that, I should also mention that I cannot seem to get any meaningful results in the aid% tables, probably(?) in relation to the first issue. Can you verify my findings in your local installations? Best regards, Theodoros ps. apologies if there is a fix already available and waiting to be merged to master :)