Hi,

Could it help to use fuzzy string matching on the local tsv files?

https://github.com/seatgeek/fuzzywuzzy


--
Turgut


On 27-09-2018 18:33, Kesselheim, David DK - NOH wrote:
> Hi,
> 
>  
> 
> I need to look up quite a large number of titles (16k). I have the title
> (more or less, sometimes ‘4k’, ‘3D’, etc is appended to it), the year
> and the runtime. With that info I can get the correct title-ID from IMDB
> but it is very slow because after having searched for the title and
> filtered by year I need to run ia.update(title) on each of my initial
> matches to compare the runtime.
> 
>  
> 
> Is there a way to speed up the process?
> 
>  
> 
> I insourced the tsv.gz files from IMDB but since the title isn’t an
> exact match I am not sure how well SQL Server is suited for finding the
> right title.
> 
>  
> 
> Thanks for the help!
> 
>  
> 
> *David *
> 
>  
> 
> 
> 
> 
> 
> _______________________________________________
> Imdbpy-help mailing list
> Imdbpy-help@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/imdbpy-help
> 


_______________________________________________
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help

Reply via email to