Hi, The document has XMP metadata inside XML attributes, instead of element text. The script did not handle this well, and there were a few other issues too.
I am attaching a fixed script for your testing, it should replace /usr/share/recoll/rclpdf.py J.F. Dockes
rclpdf.py
Description: Binary data