I've thought about it many times, and I don't think it can be done,
especially not with good performances.
Those diffs are just normal, unix-style, patches. Many times the
reference to the movie/person/character/company a changed line
refers to is completely missing, and you should apply it to the
complete file and analyze it just to know which entry to update.
From there, you should find the match in the db, which is another
complex task and decide to add/update/delete the data.
Just to download and apply the diff files to an existing sets
of files, you may try the applydiffs.sh and download_applydiffs.py
scripts in the docs/goodies directory.
On Sat, May 27, 2017 at 9:15 PM, Philip Earvolino wrote:
> IMDB creates weekly updates in the “diffs” folder on the imdb host ftp
> site(s). Has anyone tried running this script on these files and merging the
> created SQL db with the existing db? If “diffs" is not only adding new
> records but updating existing ones will a merge properly key on the
> as-created imdb ids?
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> Imdbpy-help mailing list
Davide Alberani [PGP KeyID: 0x3845A3D4AC9B61AD]
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
Imdbpy-help mailing list