Re: [Imdbpy-devel] Unable to get IMDb for specific entry

Davide Alberani Sun, 06 May 2007 23:17:01 -0700

On May 06, Jesper Noehr <[EMAIL PROTECTED]> wrote:

[please, reply on the mailing list if possible]


> >I'm about to do some tests with other "+" titles and using
> >urllib.quote instead of urllib.quote_plus.
> 
> Ah. Seems like I might've overfixed it then.

No, these was just some ideas: I've done it and the problem
persists, so it looks like you've spotted the bug correctly.

> >>--- /root/IMDbPY-3.0/build/lib.linux-i686-2.4/imdb/__init__.py
> >
> >Is this part of the patch really necessary?
> 
> I haven't made lots of patches so it was just diff -ru. Feel free
> to correct, and tell me how to make them properly :-)

No no, the patch was perfect (I'm used to create it with diff -Naur,
but your is ok).
I was talking about the changes to the get_imdbID method - but it
seems that you were right (see below).

> >Are you sure that a search with 's=all&q=%s' could return the main
> >page of a movie, while the same parameters used with the query
> >'q=%s;s=pt' has just returned a list of movies?
> >
> >I'll do some tests, but if you have some examples, it would help.
> 
> I've tested this, and it does. Try with "Tristan + Isolde (2006)".

Done; you're correct, and I've an explanation to this behavior:
doing the s=pt query the server expect a query for a _perfectly
correct_ title (title, kind year, imdbIndex); for some reasons
titles with "+" are not correctly handled: I suspect an error in
our (or their) code handling URL encoding, but so far I haven't
found a way to encode it correctly (if possible at all...)
On the other hand a search with s=all (I suspect s=tt would be
better) does a normal search, but the "IMDbPYnew" account is
set to return a movie's main page if there's just one "good enough
result".  I'm wondering what the default is for accesses without an
account (IMDbPY sends a cookie: it has its own account on IMDb so
that it can search for adult titles; but if adultSearch is off,
it doesn't use any account [1]).
I'll check it later.

> I had to mess around with various HTTP headers and telnet to figure
> this one out.

Sounds funny! :-)

> >and in at least one case (in retrieve_unicode) the IOError
> >exception (raised by urllib) is trapped, but urllib2 raises the
> >urlib2.HTTPError exception, which doesn't have the same attributes.
> 
> Hmyeah, but it seems like urllib vs. urllib2 is *basically* the
> same with regards to error handling, with the exception of the
> Exceptions being raised.

That's a good news: I hope to have time to read the documentation, soon.

> It's definitely possible to have IMDbURLopener subclass a module
> of our own that utilizes urllib2, so no code change would be needed.

Can be a solution, at least as a first step.

> >Oh, by the way: if you're interested in do some (ever very little)
> >work on IMDbPY just let me know: I will be glad to add you to the
> >list of developers and grant you access to the CVS repository.
> 
> As I'm messing with IMDbPY excessively these days (as you might've
> noticed from all my emails), I'd love to be able to contribute
> directly. Of course I'll put a little more thought into the patches
> before I submit them -- hadn't had my coffee this morning ;-)

Ok, welcome aboard! :-)
Let me know what your username on sourceforge is, and I'll add you
to the developers list.


Thanks!


+++
[1] that makes me think we should set another account for queries
    with adultSearch set to off...  Mmmm...
-- 
Davide Alberani <[EMAIL PROTECTED]> [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Imdbpy-devel mailing list
Imdbpy-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel

Re: [Imdbpy-devel] Unable to get IMDb for specific entry

Reply via email to