Re: [Imdbpy-help] SQL vs HTTP access for make a search_movie()

2014-11-21 Thread Davide Alberani
On Thu, Nov 20, 2014 at 1:08 PM, Gustavo Duarte  wrote:
>
> My doubt is, when I make a movie search by title in a another different
> language than English, why the search is more precise if use HTTP access
> than SQL ?

They just have a different set of information.
The plain text data files (used by SQL) is somehow limited; on the
IMDb many more information came from third-party companies
and are licensed in a way that prevents them to be included in
the plain text data files.

In your case, if you open the aka-titles.list file, you will see that
"El gran robo"
is not associated to Flypaper at all (it's, instead, correctly
associated to 7 uomini e un cervello, but that's obviously
another movie).

So, it has no way to know that your El gran robo is in fact
Flypaper.

Plus, even having all the data, I do a limited kind of search on
titles and AKAs:
they are first matched (with/without articles like The, A, An, ...) to
the entries
in the database using a soundex algorithm (to match things that
"sounds similar"),
then the matching subset is compared and sorted using a distance algorithm.
It works quite well, in fact, but on the IMDb site they can use much more
sophysticated algorithms and also meta-information like how many people
are looking for a given title, how much popular a movie is and so on.

So, in your specific case: no way.  In general: it's not that bad, but
I'm obviously
open to receive patches that improve the search. :-)


-- 
Davide Alberani   [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration & more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751&iu=/4140/ostg.clktrk
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] SQL vs HTTP access for make a search_movie()

2014-11-20 Thread Gustavo Duarte
Hi guys,

I'm posting here a question also posted recently on stackoverflow,
http://stackoverflow.com/questions/27038949/searching-movie-by-titlle-sql-vs-http-acess
.

My doubt is, when I make a movie search by title in a another different
language than English, why the search is more precise if use HTTP access
than SQL ?

For example i want to retrieve info about the Movie "El gran Robo" (spanish
title) original title "Flypaper".

case 1) using SQL access

ia = IMDb(accessSystem='sql', uri = "mysql://devel:password@localhost/imdb")
movies = ia.search_movie("el gran robo")
for m in movies:
  print m['titles']
OUTPUT:

7 uomini e un cervello
El gran rabo
List item
Mathias Sandorf
La gran revista
Green River
Green River
The Grim Reaper
The Grim Reaper
La commare secca
Anthropophagus
La verte moisson
Granny R.I.P.
Gehirnreflexe
Rosso sangue
Green River Killer
Grammar Revolution
German Whorefare 4
Cuori in campo
Grim Reaper
Grim Reaping
case 2) using HTTP access

ia = IMDb()
movies = ia.search_movie("el gran robo")
for m in movies:
  print m['titles']
OUTPUT:

Flypaper
The Big Steal
Robbery
El gran robo del tren
El gran robo de Lorenza
The Bank Job
The First Great Train Robbery
Grand Theft Auto
The Great Train Robbery
Hot Money
7 uomini e un cervello
The Great Missouri Raid
Le Great Dane Robbery
El gran río
Después del gran robo
Gran-Robo tonda
As you can see, the search with HTTP returned the right Movie, Flypaper.

Is it possible get the same results using SQL ?

Thanks in advance.
--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration & more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751&iu=/4140/ostg.clktrk___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help