Re: [Imdbpy-help] SQL vs HTTP access for make a search_movie()
On Thu, Nov 20, 2014 at 1:08 PM, Gustavo Duarte wrote: > > My doubt is, when I make a movie search by title in a another different > language than English, why the search is more precise if use HTTP access > than SQL ? They just have a different set of information. The plain text data files (used by SQL) is somehow limited; on the IMDb many more information came from third-party companies and are licensed in a way that prevents them to be included in the plain text data files. In your case, if you open the aka-titles.list file, you will see that "El gran robo" is not associated to Flypaper at all (it's, instead, correctly associated to 7 uomini e un cervello, but that's obviously another movie). So, it has no way to know that your El gran robo is in fact Flypaper. Plus, even having all the data, I do a limited kind of search on titles and AKAs: they are first matched (with/without articles like The, A, An, ...) to the entries in the database using a soundex algorithm (to match things that "sounds similar"), then the matching subset is compared and sorted using a distance algorithm. It works quite well, in fact, but on the IMDb site they can use much more sophysticated algorithms and also meta-information like how many people are looking for a given title, how much popular a movie is and so on. So, in your specific case: no way. In general: it's not that bad, but I'm obviously open to receive patches that improve the search. :-) -- Davide Alberani [PGP KeyID: 0x465BFD47] http://www.mimante.net/ -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration & more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751&iu=/4140/ostg.clktrk ___ Imdbpy-help mailing list Imdbpy-help@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/imdbpy-help
[Imdbpy-help] SQL vs HTTP access for make a search_movie()
Hi guys, I'm posting here a question also posted recently on stackoverflow, http://stackoverflow.com/questions/27038949/searching-movie-by-titlle-sql-vs-http-acess . My doubt is, when I make a movie search by title in a another different language than English, why the search is more precise if use HTTP access than SQL ? For example i want to retrieve info about the Movie "El gran Robo" (spanish title) original title "Flypaper". case 1) using SQL access ia = IMDb(accessSystem='sql', uri = "mysql://devel:password@localhost/imdb") movies = ia.search_movie("el gran robo") for m in movies: print m['titles'] OUTPUT: 7 uomini e un cervello El gran rabo List item Mathias Sandorf La gran revista Green River Green River The Grim Reaper The Grim Reaper La commare secca Anthropophagus La verte moisson Granny R.I.P. Gehirnreflexe Rosso sangue Green River Killer Grammar Revolution German Whorefare 4 Cuori in campo Grim Reaper Grim Reaping case 2) using HTTP access ia = IMDb() movies = ia.search_movie("el gran robo") for m in movies: print m['titles'] OUTPUT: Flypaper The Big Steal Robbery El gran robo del tren El gran robo de Lorenza The Bank Job The First Great Train Robbery Grand Theft Auto The Great Train Robbery Hot Money 7 uomini e un cervello The Great Missouri Raid Le Great Dane Robbery El gran río Después del gran robo Gran-Robo tonda As you can see, the search with HTTP returned the right Movie, Flypaper. Is it possible get the same results using SQL ? Thanks in advance. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration & more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751&iu=/4140/ostg.clktrk___ Imdbpy-help mailing list Imdbpy-help@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/imdbpy-help