Re: [Imdbpy-help] Timing SQLITE creation

2009-01-14 Thread Davide Alberani
On Jan 13, Mike Castle dalg...@gmail.com wrote:

 I also just tested with PRAGMA journal_mode = OFF;

Good - I'll update the code and documentation ASAP (and submit it to you,
to check that I've understood everything).

 But, are you doing something at the beginning with preserving current
 ids?

Not exactly (actually, at least); the only information saved and
restored between two runs are imdbID (collected when IMDbPY have
to retrieve from the web the real imdbID for a movie/person/...,
and stored in the database for future faster accesses).
They are not matched against the 'id' columns of their respective
tables, but on the whole set of information (title, imdbIndex, year, ...)
and so this would not a problem.


Thanks!
-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] BUG: 4.0-dev - Newline Missing after Genres and incorrect plot summary

2009-02-22 Thread Davide Alberani
On Feb 22, Chris Thompson chris.thomp...@stormfolio.net wrote:

 a) the output of get_first_movie appears to be missing a newline at
 the end of the genres and before Director:
 b) the plot summary is incorrect

Hi!
Thank you very much for your bug reports - I've fixed the
Movie.summary method (it wasn't updated after some recent
changes to IMDbPY).

 There are some warnings, maybe this is related?

No, you don't have the lxml package installed, and so the html
is parsed using BeautifulSoup (pure python).
It's just a warning and not a real problem (except that everything
will be a lot slower).

I'm still trying to understand what can cause the problem you
reported in your other email.


Thank you again - I've added your name to the credits.
-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy and imdb licences/access terms

2009-03-02 Thread Davide Alberani
On Mar 02, Andre Eliatamby andre.eliata...@gmail.com wrote:

 What I would like to know is:
  - are my assumptions above correct?

Mostly; see also the other recent (and old) threads in imdbpy-devel.

  - Does imdbpy have a special license from imdb to provide this package?

No.  I honestly can't even remember if I ever asked (maybe, maybe not:
I started the project in 2004).
For sure at some point (years ago) I questioned the developers of
other related libraries; many (most) never asked, some had tried.
The results can be summarized in:
- no reply at all.
- no, you can't: buy this (license for X thousands of data, missing
  the point of the request).
- yes you can (very few).

As far as I can remember there was a discussion about this kind of
software in debian-devel, and their opinion was that they are
ok for inclusion in Debian (it goes without saying that they
are more interested in the license of the code, so this may not
be conclusive).

The fact is (as usual: I'm not a lawyer) that similar terms
_as far as I know_ aren't clearly enforceable (I'm not sure
about the USA laws, however).
IMDbPY fetches the data the same way your browser (or your cache,
or your crap-filtering proxy) does.  After that, _for your own usage_
I'm pretty sure you can do whatever you like (your how usage completely
excludes redistribution and making any kind of profit, no matter how
indirect, that's for sure).

Moreover: keep in mind that IMDbPY is not (and never will be)
tailored as a mass-scraper bot: it can handle only single (and
serial) requests, and for the way it's structured using it
on the whole database is impossible.
In fact, it's as nice as it can be with the IMDb server.

  - Do I need to get special permission for imdb to actually use this
 package (or at least the query functions)?

You can use IMDbPY to access the plain text data files (putting
them in a SQL database: see README.sqldb).
If you plan to use 'http'/'mobile', obviously a permission won't
hurt, but as long as you use it for your own personal non-commercial
usage, I can't see any problem.

It goes without saying that if you plan to create a public service
with it (or make money out of it in any other way), you can stop
right now. :-)

Again: as said other times, I'm genuinely convinced that using it
non-commercially is legal, but... I'm not a lawyer.


-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] [Imdbpy-devel] IMDbPY 4.0 and IMDbPYKit 1.0

2009-03-13 Thread Davide Alberani
On Mar 13, Jean-Yves Avenard jyaven...@gmail.com wrote:

 Doesn't build on any of my machines (Ubuntu 8.10 and CentOS 5.3) :(

The kind of things you don't want to hear 12 hours after
a release. ;-)

 imdb/parser/common/cutils.c:63:20: error: Python.h: No such file or
 directory

Looks like you're missing the python-dev package, used to compile
an (optional, but suggested) C module.

Let me know if it solves the problem.

As an extreme measure, you can add the --without-cutils command
line argument to exclude the cutils.c module, but...


Bye,
-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Get a movie's keyword

2009-03-15 Thread Davide Alberani
On Mar 15, Friedrich Weber f...@samurai-x.org wrote:

 Hi all!

Hi!

 is it possible to obtain a movie's keywords
 (just a list of the keywords listed at, for example,
 http://www.imdb.com/title/tt0087544/keywords)? I couldn't find
 anything about this in the sources, but maybe I am overlooking
 something :)

Maybe the documentation? ;-)
See README.package, especially the 'INFORMATION SETS' section.

Example:
  from imdb import IMDb
  i = IMDb('http')
  nausicaa = i.get_movie('0087544')
  # The update method is the key for what you need.
  i.update(nausicaa, 'keywords')
  print nausicaa['keywords']


The same information is available in every data access system ('sql',
'local', 'mobile', ...)

By the way, if you really need _only_ these information and you
already have the movieID (maybe from a i.search_movie call), you can
save some time/bandwidth asking only for these information:
  from imdb import IMDb
  i = IMDb('http')
  nausicaa = i.get_movie('0087544', info='keywords')
  # Here the nausicaa Movie instance doesn't contains many information:
  # the movie doesn't even have a title!
  print nausicaa['keywords']


HTH,
-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] MPAA Ratings

2009-03-16 Thread Davide Alberani
On Mar 15, Chris Spencer chriss...@gmail.com wrote:

 I had looked for the key in Movie.py and get_movie_infoset() in
 __init__.py, but I didn't see mpaa.

There isn't, because it's not a separated info set: it's one
of the keys you get, with the 'main' info set (the one with 'cast',
'director' and so on).  It's not guaranteed that retrieving an
info set will add _a single_ key to the instance: there can be more
than one key in the returned information.


Bye!
-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] BUG: 4.0-dev - Newline Missing after Genres and incorrect plot summary

2009-03-25 Thread Davide Alberani
On Mar 25, H. Turgut Uyar u...@itu.edu.tr wrote:

 I was browsing the archives and noticed that this was not answered.

Completely missed that part. :-)

 The 1.3.6 version of python-lxml does not seem to have an html
 module.

It's pretty old, it was released in October 2007.

 Maybe we should set the dependency info in the setup.py file so
 that a release with version greater than 2.1 will be selected.

For sure it won't hurt. :-)

PS: in a matter of days, I hope to add top 250/bottom 100 links
to imdbpykit.
Whata about i18n?  I see a file for English and one for Turkish;
I can add Italian.  Is there an easy way to translate the XML
generated by the XSL file?


Thanks!
-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] With importing to MySQL I get these and other warnings and further.

2009-04-13 Thread Davide Alberani
On Apr 13, Atul Kulkarni atulskulka...@gmail.com wrote:

 /usr/bin/imdbpy2sql.py:1278: Warning: Incorrect string value: '\xC2\x92s
 ch...' for column 'info' at row 1164
 /usr/bin/imdbpy2sql.py:1006: Warning: Incorrect string value: '\xC2\x80$'
 for column 'title' at row 35842
 
 Will they have any impact on my query?

I can't tell, since I can't figure the strange errors you got.
Or better: looks like a problem with garbage in the plain text
data files or a misconfiguration about charset/collation of your
MySQL, but... who knows?  That chars don't make much sense. :-/


 I am getting some errors like
  [...]
 TypeError: int() argument must be a string or a number, not 'NoneType'

That's really bad and should never happen.
It's as if the 'id' column in the 'title' table contains some
NULL value, but this can't be.

If possible, you can try this:
- update to 4.0 (I guess you're using 3.9 or older).
- run imdbpy2sql.py appending this:
 21 | tee OUTPUT.txt
  so that you can record its output in OUTPUT.txt.
- show me the code you're running (or a short example useful to
  reproduce the problem), if possible using movie titles and not
  movieIDs.
  I.e.: i.get_movie(1234) is not that useful to debug the sql
  data access system; it's much better:
movie = i.search_movie('The Movie Title')[0]
i.update(movie)


Thank you very much!

-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Upcoming US Air Times.

2009-04-19 Thread Davide Alberani
On Apr 19, Andre LeBlanc andreplebl...@gmail.com wrote:

 Thats actually not the data that I am looking for. I need the date
 AND TIME that the episode is airing, not the original air date.
 on the page that I linked to you can see that 13x07 is airing at
 10PM on wednesday.

Hmmm... no: no sign of it in my page. :-)
You can see what I got here:
  http://imdbpy.sourceforge.net/tt0121955_episodes.html

Are you from the United States?  I suppose IMDb customize the
pages based on the country of the viewer.

By the way, remember that IMDbPY accesses the IMDb server using its own
user (it sends its own cookies) and uses the 'akas' web server; i.e.:
  http://akas.imdb.com/title/tt0121955/tvschedule

 but when I do this:
 
 i.update(ep, 'airing') ep['airing']
 I get a KeyError, there is no 'airing' info for that episode.

I see... this is due to the fact that, for episodes, the table
containing the 'tv schedule' is different. :-/
You can look at:
  http://akas.imdb.com/title/tt0121955/tvschedule
  http://akas.imdb.com/title/tt1397945/tvschedule

respectively for the series and the episode.

So you are right; using the previous example, this won't work:
import imdb
i = imdb.IMDb('http') # or 'mobile'
m = i.get_movie('0121955')
i.update(m, 'episodes')
episode = m['episodes'][13][7]
i.update(episode, 'airing')
print episode['airing']


I'll see if it's possible to support both formats, but it could
take some time.
In the meantime, maybe you can parse the list of aired episodes
(taking it from the series) and look specifically at the episode
you're interested in.


Thanks for the bug report, I'll add your name to the credits. :-)

-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Stay on top of everything new and different, both inside and 
around Java (TM) technology - register by April 22, and save
$200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco.
300 plus technical and hands-on sessions. Register today. 
Use priority code J9JMT32. http://p.sf.net/sfu/p
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] problems with the plain text data files

2009-04-21 Thread Davide Alberani
On Apr 21, Davide Alberani davide.alber...@gmail.com wrote:

 So, let's wait: usually they reply in a short time.

Aaargh!  I didn't notice it before; from:
  http://imdb.com/help/search?domain=helpdesk_faqindex=1file=titleformat

  Articles now remain where they occur in the title. This was a policy change 
in April 2009.

Can I say it?  A very awful choice.  One thing is to _diplay_ the
title in The Title format on the web page, but storing them
losing the notion of what the article is, looks really bad to me. :-(

This will affect 'http' and 'mobile', too.
Probably nothing major (the code to handle both formats is already
present, after all), but it will require some time to fix it
everywhere.


-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Stay on top of everything new and different, both inside and 
around Java (TM) technology - register by April 22, and save
$200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco.
300 plus technical and hands-on sessions. Register today. 
Use priority code J9JMT32. http://p.sf.net/sfu/p
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] RuntimeError: dictionary changed size during iteration

2009-05-02 Thread Davide Alberani
On May 01, Andre LeBlanc andreplebl...@gmail.com wrote:

 I got this error using the latest SVN, seems to only affect that one movie
 but I haven't tried many.
   [...]
 RuntimeError: dictionary changed size during iteration

My fault: I made a bad assumption iterating over a dictionary. :-/
It's fixed in the SVN.

Thank you very much for the bug report.


-- 
Davide Alberani davide.alber...@gmail.com [PGP KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Register Now  Save for Velocity, the Web Performance  Operations 
Conference from O'Reilly Media. Velocity features a full day of 
expert-led, hands-on workshops and two days of sessions from industry 
leaders in dedicated Performance  Operations tracks. Use code vel09scf 
and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] installer problems (WAS: IMDbPY 4.1 and IMDbPYKit 1.1)

2009-05-04 Thread Davide Alberani
On May 03, Davide Alberani davide.alber...@gmail.com wrote:

 There are problems with the installer of both IMDbPY and IMDbPYKit.

IMDbPYKit 1.1.1 was released today to fix the problem.

As said, IMDbPY 4.1 localization won't work out-of-the-box.
You can use the SVN version, or follow these simple steps,
courtesy of H. Turgut Uyar:

--
cd PATH-TO-YOUR/site-packages
mv IMDbPY-4.1-py2.5-linux-i686.egg imdbpy.zip
mkdir IMDbPY-4.1-py2.5-linux-i686.egg
cd IMDbPY-4.1-py2.5-linux-i686.egg/
unzip ../imdbpy.zip
rm ../imdbpy.zip
cd imdb/locale/
cp PATH-TO-IMDBPY-SOURCES/imdb/locale/*po .
python rebuildmo.py
--


Pardon for all the troubles,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Register Now  Save for Velocity, the Web Performance  Operations 
Conference from O'Reilly Media. Velocity features a full day of 
expert-led, hands-on workshops and two days of sessions from industry 
leaders in dedicated Performance  Operations tracks. Use code vel09scf 
and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Cast - lead or supporting role?

2009-05-04 Thread Davide Alberani
On May 04, Roman Gaufman hacke...@gmail.com wrote:

 Is there no way to get a list of just actors mentioned on the main
 page for the movie?

If you're fetching the data from the web, you can use other
data access systems (beyond the default http - which uses the
combined page as its main source of information) to fetch
less data: both mobile and httpThin uses the maindetails
page.
httpThin uses the same parser of http (it just fetches the
main details page) while mobile is a different parser, tailored
for systems with little CPU/bandwidth resources.

E.g.:
  from imdb import IMDb
  ia = IMDb('httpThin') # or mobile
  movie = ia.search_movie('12 Angry Men')[0] # gets the first result.
  ia.update(movie) # updates the information.
  print len(movie['cast']) # will print 12, and not 16 as http.

It goes without saying that some information present in combined
are not present in maindetails, so it's up to you to decide if
you need them or not.


HTH,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Register Now  Save for Velocity, the Web Performance  Operations 
Conference from O'Reilly Media. Velocity features a full day of 
expert-led, hands-on workshops and two days of sessions from industry 
leaders in dedicated Performance  Operations tracks. Use code vel09scf 
and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Data too long for column 'phonetic_code' at row 88

2009-05-10 Thread Davide Alberani
On May 10, Davide Alberani davide.alber...@gmail.com wrote:

 In the next days, I'll try with MySQL 5.1 and the latest IMDb data.

Everything is fine for me, after a fast test with a partial set of
data (5% of every plain text data file) and MySQL 5.1.34 with MyISAM
tables (standard configuration from Debian/sid).

I'm almost sure there is a problem in your configuration.


HTH,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql.py, PostgreSQL, UTF8, invalid byte sequence

2009-10-06 Thread Davide Alberani
On Oct 06, Vitaly Pashkov ad...@fluda.net wrote:

 You can get my copy here: http://fluda.net/personal/movies.list.gz

It's identical to the one I have.

 imdb= select id, title from title order by random() limit 5;
id|  title
 -+-
  1147146 | (#3.11)
  1010589 | (2007-05-21)

The above ones are not symptoms of problems: many episodes of
tv series are identified only by their #SEASON.EPISODE number
or by their airing date (when a title is missing, obviously).

 Got it!
  [...]
 Auf der grünen Wiese

Not exactly the nasty title I expected - whatever it means. :-)
There's nothing wrong with it, and there are many more umlauts
in that list - this is probably the first occurrence that creates
problems when the data are flushed to the database (the data are
temporarily stored in a Python dictionary, and so they are not
flushed to the db in order).

 /usr/lib/python2.5/site-packages/IMDbPY-4.2-py2.5.egg/EGG-INFO/scripts/imdbpy2sql.py:628:
  UnicodeWarning: Unicode equal comparison failed to convert both
 arguments to Unicode - interpreting them as being unequal
   if ts[-1].lower() in _articles:

Try this simple script, to see if it creates any problem:

#!/usr/bin/env python

import imdb

utf8_title = 'Auf der gr\xc3\xbcnen Wiese'

print utf8_title in imdb.utils._articles



If it replies False without raising warnings of exceptions,
I think the problem is not in Python itself but in the psycopg2
module or in the configuration of PostgreSQL.

 BUT! I also tried to run it without my modification at line 1427 and with
 modified title_soundex. It stopped at the other line:
  [...]
 /usr/lib/python2.5/site-packages/IMDbPY-4.2-py2.5.egg/EGG-INFO/scripts/imdbpy2sql.py,
 line 1030, in _runCommand
 CURS.executemany(self.sqlstr, self.converter(dataList))
 psycopg2.DataError: byte sequence invalid for encoding UTF8: 0xc333
 HINT:  This error can also happen if the byte sequence does not match the
 encoding expected by the server, which is controlled by client_encoding.

It's more or less expected: the data is processed without glitches,
but everything explodes when CURS.executemany tries to dump it into
the database.

To summarize how imdbpy2sql.py works: it reads the plain text data
files (which are mostly in iso-8859-1 encoding), convert them to
utf-8 for internal usage (for a series of more or less good reasons)
and uses a cursor provided by the db access module (psycopg2, in this
case) to store them (again, passing the strings as utf-8).

Your change forces imdbpy2sql.py to use _unicode_ representation of
titles; the UnicodeWarning you get is because it compares a unicode
string to a list of utf-8 encoded strings (imdb.utils._articles).
As a temporary solution you can convert _articles to a list of unicodes,
but I can't consider this a real fix.  Put this line somewhere at the
top of the script - hoping it will not break something else ;-) :
  _articles = [x.decode('utf8') for x in _articles]

Why psycopg2 or your PostgreSQL don't play nicely with utf-8 strings
is beyond my imagination. :-)

Another small test (this _could_ exclude psycopg2, even if it can
always be a matter of how it's initialized by SQLObject/SQLAlchemy):

===
#!/usr/bin/env python

import psycopg2

utf8_title = 'Auf der gr\xc3\xbcnen Wiese'

connection = psycopg2.connect(database='imdb', user='UNAME', password='PWD')
curs = connection.cursor()

curs.execute(INSERT INTO title (title, kind_id) VALUES (%s, 1);,
(utf8_title,))

connection.commit()

===

In my installation, it works.

As you can see, debugging these strange interactions between
charsets/encodings and various modules, libraries and database
engines is a real pain. :-(


Thanks for your time!

-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Come build with us! The BlackBerryreg; Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9#45;12, 2009. Register now#33;
http://p.sf.net/sfu/devconf
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] [Imdbpy-devel] changes to imdb.com

2009-11-17 Thread Davide Alberani
On Nov 17, H. Turgut Uyar u...@itu.edu.tr wrote:

 Yes, it returns this:

Good - even if it's making me crazy. ;-)
You get what I see on my local system.

Please, can you try it again in a hour or so (or tomorrow) and
see if it's still ok (I've made some last-minute changes to the
code and the account)?

And if possible, can you try to install it (I tried it with Python 2.6,
and it works, on my system)?

 I've tested it in clean virtual environments (2.6) both with lxml
 and without lxml and got the same result.

It's the same on my system (Python 2.5) where it works.
On a remote system, where IMDbPY is updated from the SVN and
installed locally (on a clean directory), it still doesn't work. :-(

The funny thing is that both my system and the remote one fetch
pages that are served for the IMDbPYweb user account (I can see it in
the HTML), and so I can't figure out why the HTML served to the remote
system contains the sortable lists.

  It should, since I've set up the IMDbPYweb IMDb account to _not_
  use the sortable lists, showing information about people.
 
 Do I need to set something in imdbpy.cfg for this or is it already
 handled by the code?

No - it should be fine without any changes.

I suspect there's something veeery wrong somewhere - but I can't
find what - keep in mind that the remote system I'm talking about
is clean, with no transparent proxies or strange things like these,
and hosted on a more-than-trusted provider.


Thanks!
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY 4.3

2009-11-18 Thread Davide Alberani
IMDbPY 4.3 is out, to fix the last problems with some changes to
the IMDb website.
There're also some cool new features, like the ability to guess
the language of a movie title.

As usual, it can be downloaded from here:
  http://imdbpy.sf.net/


Since IMDb is introducing a new style for page about persons,
soon we'll change the parsers to support it.

-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Ending with error code 17 after traceback.

2009-11-18 Thread Davide Alberani
On Nov 19, jaw knee johnn...@gmail.com wrote:

 joh...@ears:/750/Code/Projects/MySQL/IMDB$ sudo imdbpy2sql.py -o sqlobject
 -d /750/Code/Projects/MySQL/IMDB/ -u mysql://root:passw...@localhost/imdb -c
 .
  [...]
 raise InternalError(ErrorMessage(e))
 sqlobject.dberrors.InternalError: Can't create/write to file
 '/750/tmp/mysql/#sql_46a1_0.MYI' (Errcode: 17)

While I'm a little confused by the fact that the error seems to be
generated by SQLObject, it looks like this is a permission problem
on your filesystem.

Do you have read/write access to /750/tmp/mysql/ and to your working
directory (for the -c . argument)?

Does your MySQL works correctly (e.g., using phpmyadmin)?


HTH,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] get_imdbMovieId implementation ?

2009-11-26 Thread Davide Alberani
On Nov 26, Jonas Geiregat jo...@geiregat.org wrote:

 When trying out the method get_imdbMovieId() it doesn't return an
 object. It does return the same string I entered as param.
 
  ai.get_imdbMovieID(Snatch)
 'Snatch'
 
 Is this method not implemented yet ?

It is, but it serves another purpose. :-)
See documentation about it in README.package, README.devel and
in the code itself.

Basically, it takes a movieID specific to the used data access
system (i.e. http, httpThin, mobile or sql) and convert
it to the unique imdbID used on the imdb.com web site.
For http/httpThin and mobile, it does nothing since they
use the same ID (i.e., the movieID _is_ the imdbID).
But sql internally uses IDs that are valid only locally (to your
installation); this means that a query to the imdb.com site must
be issued, to convert it to the real imdbID (beware that it can
return None).

Generally speaking, it's better to use the get_imdbID method, which
takes a Movie/Person/Character/Company instance, and not a
movieID/personID/characterID/companyID.

E.g.:
  from imdb import IMDb
  ia = IMDb('sq', uri='mysql://USER:p...@localhost/imdb') # my local DB.
  tu = ia.search_movie(u'The Untouchables (1987)')[0] # First item.
  print tu.movieID # will print 558195, valid only for my database.
  print ia.get_imdbID(tu) # will print '0094226', that is the ID used
  # by imdb.com
  # The same thing, with the less flexible/smart get_imdbMovieId:
  print ia.get_imdbMovieId(tu.movieID)


HTH,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://erlug.linux.it/~da/

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY 4.5

2010-02-28 Thread Davide Alberani
Released in a little rush, here it is 4.5:
  http://imdbpy.sourceforge.net/

This fixes (I really do hope...) the recent issues with the imdb.com
web server and contains other fixes and improvements (most notably,
the ability to decouple the creation of CSV files from their insertion
in a SQL database, using the imdbpy2sql.py script).

As usual, report any bug and share your ideas for improvements!


Enjoy,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] IMDbPY 4.5

2010-03-02 Thread Davide Alberani
On Feb 28, Davide Alberani davide.alber...@gmail.com wrote:

 Released in a little rush, here it is 4.5:
   http://imdbpy.sourceforge.net/

You may have noticed that 4.5.1 is already out. :-)


-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdb column questions

2010-03-19 Thread Davide Alberani
On Mar 19, Michael Liu mikel...@gmail.com wrote:

 So in short, if i understand it all correctly, the IMDB has in
 their own database: the real IMDB id, whether something is adult,
 whether something is on Amazon/Blockbuster.

More or less: the imdbIDs are used only on the site, and
amazon/blockbuster information are not in the plain text data files.
If a movie is for mature viewers, you can guess it searching Adult
in their genre, and this can be done in 'sql', too (in fact, using
the do_adult_search method you can exclude adult movies from your
movie searches - if you already have a Movie instance, all you
need is a simple 'Adult' in movie.get('genres') chech)

 However, none of this is shared by them, and the only way to get any
 of them is to use a script to do a title search and scrape the pages?

You don't need to write it: IMDbPY already does everything you need.

Just use something like:
  from imdb import IMDb
  ia = IMDb('http') # or 'mobile' for a faster but less complete parser.
  ... here you do your searches and you fetch movies, persons, ...

In the example above, the movieIDs you'll use will be _the_ imdbIDs;
no need to mention that now you're NOT accessing your local database. :-)

There can be a little mess if you start mixing information from the
web with info from the sql database, but that's all.


Oh, by the way, information about products on Amazon can also be
found in 'sales' and 'dvd' info sets, I think (see README.package,
section 'INFORMATION SETS') and maybe other pages, I don't recall
for sure.


Bye,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY: build objects from XML dumps

2010-03-22 Thread Davide Alberani
I love Mercurial. :-)
Have fun with this fork (soon it will be merged):
  http://bitbucket.org/alberanid/imdbpy_import_xml/overview/

New feature: since some time IMDbPY can dump its object into
XML strings/files; now you can use this XML to recreated IMDbPY
objects (instances of classes Movie, Person, Character and Company).

I've not tested it extensively, and for sure it will fail here
and there (and the code is not exactly nice to look at).

You can test it this way (no output is a good sign ;-):
from imdb import IMDb
from imdb.helpers import parseXML

ia = IMDb('http')
matrix = ia.get_movie('0133093')
xml = matrix.asXML()
matrix_rebuilt = parseXML(xml)

for key in sorted(matrix.keys()):
origValue = matrix[key]
rebuiltValue = matrix_rebuilt.get(key)
if origValue != rebuiltValue:
print 'ERROR', key
print origValue
print rebuiltValue
print ''


-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Retrieving a List of Movies in a Given Year

2010-03-27 Thread Davide Alberani
On Mar 27, Chris Spencer chriss...@gmail.com wrote:

 How would you use IMDBPy to retrieve a list of movies within a given year?

It's not possible, at this time (and I'm not sure it's a feature I
want to add).

Anyway, if you're using the plain text data files (see README.sqldb,
otherwise), you can scan your database for the 'production_year' column
in the 'title' table, after that you can get the 'id's and use IMDbPY
to retrieve all the information about these movies.


HTH,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
Download Intel#174; Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql.py and 8-bit bytestrings

2010-05-13 Thread Davide Alberani
On May 13, Dustin Wyatt dustin.wy...@gmail.com wrote:

Hi!

 imdbpy2sql.py -d C:\test\imdb_db\text_files -u
 sqlite:///C:\test\imdb_db\db.db --sqlite-transactions -o sqlalchemy

I think it's a better idea to use the UNIX path separator (/),
even on Windows.  See the 'Microsoft Windows paths' section in
the README.sqldb file of the documentation (one of these days
I'll fix every troubles with the Windows' path separator...)

 CURS.executemany(self.sqlstr, self.converter(dataList))
 sqlite3.ProgrammingError: You must not use 8-bit bytestrings unless you use a 
 te
 xt_factory that can interpret 8-bit bytestrings (like text_factory = str). It 
 is
  highly recommended that you instead just switch your application to Unicode 
 str
 ings.

I see - that's a new feature introduced in the SQLite module
shipped with Python 2.6.

Thank you very much for the report: I wasn't aware of it: I've already
fixed it in the Mercurial repository: http://imdbpy.sf.net/?page=download#hg

By they way, I keep noticing other problems with SQLite and SQLAlchemy.
I see a 'cannot commit - no transaction is active' error trying to
executing the 'COMMIT;' statement (--sqlite-transactions option).
The script goes on but... who know what's going on.

Anyway, thank you again for the bug report: I've added your name
to the credits.

Bye!
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--

___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] a person photo

2010-06-16 Thread Davide Alberani
On Jun 15, Karim Ihab ki...@connectmetv.com wrote:

 Dear all,

Wrong address: it should go to imdbpy-help@lists.sourceforge.net :-)

 But it didn't include any photos or posters or any thing that I can use can
 u help!

Person instances (can) have the 'headshot' keyword:
  person.get('headshot')

Will give you 
'http://ia.media-imdb.com/images/M/MV5BMTQzNjU3MDczN15BMl5BanBnXkFtZTYwNzY2Njc4._V1._SX94_SY140_.jpg'

You can get a larger version of the image with a function in
the 'helpers' module.  E.g.:
   import imdb.helpers

   print imdb.helpers.fullSizeCoverURL(person)

It works for Movie instances, too.


Bye!
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY 4.6 released

2010-06-20 Thread Davide Alberani
IMDbPY 4.6 is available: http://imdbpy.sf.net/

In this release, a lot of minor fixes and some overall improvements.

Sorry for the long delay -  we really need a release manager. :-)


Have fun!
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] TV series with unknown episode numbers

2010-06-27 Thread Davide Alberani
On Jun 27, Mike Castle dalg...@gmail.com wrote:

 I assume that the bit that processes XML or whatever would be the
 ideal place to have a counter to keep track of it?

Sure, or - even better - rethink the data structure: after all,
maybe we don't really need a dictionary of dictionaries to store
seasons/episodes.
The current choice was probably made to accomodate things like
unknown season or episode 0, but maybe we don't need these
pieces of information and move to a list of lists or a dictionary
of lists.

 But I was busy and didn't have time to figure out your entire
 code base.   :-

Feel free to poke around, clone/fork the repository, and ask if
you need help!

 For a personal fix, I ended up using just an integer
  [...]
 I've not yet looked at yours, though.

It does the same. :-)

 I suppose on of the problems with this is that there is no good
 way to guarantee an order.

That's why there are the sortedSeasons and sortedEpisodes
functions in the 'helpers' module (but they can't guarantee
to return the same order you've read on the web page).

Hmmm... I've to think more carefully about it, but right now
I'm inclined to use a dictionary of lists (keeping the 'season'
a key in a dictionary - when possible an integer - and the list
of episodes as its values).

E.g., for movie['episodes']:
  {1: [Movie1.1, Movie1.2, ...],
   2: [Movie2.1, Movie2.2, ...],
   'unknown': [MovieX.1, MovieX.2, ...]}

Where MovieX.Y obviously are Movie instance, like now.

What do you think?


Thanks for your help!
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Better imdbID support for imdbpy2sql

2010-07-30 Thread Davide Alberani
On Wed, Jul 28, 2010 at 8:41 PM, Alexmipego alexmip...@gmail.com wrote:

 For the project I've in mind I really need
 to have as many imdbid values mapped as possible. During research, and
 checking the raw files myself, I found that many people ask for it but
 it's kinda impossible for imdbpy2sql to do better than it does at
 guessing ids.

More or less. :-)
The basic problem is that the imdbIDs are not distributed in
the plain text data files.

 My solution is based on the fact that searching imdb for the raw names
 (in the movies.list file) returns an exact match almost aways. That
 means, overtime, some applications will end up getting the true id of
 a movie but there is no way for imdbpy2sql/database to recover the
 original raw title.

I'm not sure to have understood your point.
What's the advantage of the MD5 sum, over the normal title?
I mean: if the title changes, also its MD5 will change and you will
not be able to find the imdbID.

 When changes in titles, new titles, etc... would occur it would simply
 fail gracefully and over time those new hash-imdbid codes could be
 made available.

Well, it may works and it's easy to implement, but it means that you
need a central repository for this hash table.
Since I (as IMDbPY) don't want to provide it, most of the users
will use none or create their own.
By the way it's not clear to me  when you want to ask the hash table
for an imdbID: when the imdbpy2sql.py script runs (but this will have
a heavy impact on performances, I fear) or when a single item (movie,
person, character or company) is requested.

 Let me know what you think. The changes to support a MD5 column are
 just 2-3 lines iirc and it shouldn't cause any problems to anyone, yet
 it would allow for this type of feature to be implemented even if
 outside the imdbpy code base

Yup - I see your point on this, and I'll take it on consideration.


-- 
Davide Alberani d...@mimante.net  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Better imdbID support for imdbpy2sql

2010-07-30 Thread Davide Alberani
On Jul 30, Alexmipego alexmip...@gmail.com wrote:

 Once the MD5 column was present on the
 distribution everyone would be compatible with my solution. You ask
 about when and where this matching would be done, and the beauty of
 this (for you) is that it can be done completely outside imdbpy2sql.

I admit you've more or less persuaded me. ;-)
If I'll have time, this week or the next one, I'll look at what
can be done.

 Btw, I initially thought that I could simply assume the ID you assign
 to each title is sequential, so I could simply assume the first line
 in the CSV was ID 1 and the last would simply match. However, I found
 out that the final sql table has more rows than the raw file and that
 means something is either wrong or some extra processing is done.

That's due to the fact that not every file in the set of the
plain text data files is updated at the same time, and so there
can be incongruences and even errors here and there.
So, every time imdbpy2sql is asked to handle a movie title (or a
person or company name, for instance), it checks if it's already
known or create it otherwise.
This means that imdbpy2sql will run nicely even if the movies.list.gz
file is removed.


Bye,
-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] IMDb redesign: call for help

2010-09-25 Thread Davide Alberani
On Sep 19, Davide Alberani davide.alber...@gmail.com wrote:

 So... is anyone out there willing to help and be in charge of
 one or more parsers?

I forgot to mention how I arranged the development of the new parsers: the
old account (automatically used by IMDbPY) was changed to use the old
set of web pages (mostly: the ones about people still needs to be fixed), so
it can't be used to develop the new parsers.

I've then created a new fork of IMDbPY on bitbucket, which uses a new account
set to refer to the new web pages; this repository can be clone by here:
  http://bitbucket.org/alberanid/imdbpy_parsers2010/

Once you have cloned this repository, you can install this version on your
system (or in a virtualenv) and modify it to fix the parsers.

You can test each page as you wish; there's also a more comprehensive (well,
more or less...) set of tests: http://bitbucket.org/alberanid/imdbpy-testsuite

Specifically in the http-mobile directory.
The steps:
- download from http://erlug.linux.it/~da/erlugtmp/imdbpy_p.tar.gz a 
more-or-less
  correct set of .p files (dumps of IMDbPY objects taken when the parsers were 
in
  a good state) and untar it in the http-mobile directory.
- fetch the new .html web pages with ./test_parser.py -f
- run the tests with ./test_parser.py -t 21 | less
- spot a problem (missing information or something like that), change the
  parsers and re-run the tests until the problem is not fixed. :-)

In the 'standalone/' directory there is a separate test for each file (the
ones labeled *lxml* are faster than the *bsoup* ones.

Keep in mind that it's normal to see errors about things like changes
in the number of votes, or new cast/companies informations; what really
matters is that the parser - from one run to the other - doesn't lose complete
sets of information (and that no crap ends up in the strings, movie titles and
so on).  If a key is completely missing the test_parser.py script will report
it in the lists of key that are only in the expected or in the received 
information.

If this was not clear enough, feel free to ask me anything!


-- 
Davide Alberani davide.alber...@gmail.com [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] [Imdbpy-devel] IMDb redesign: call for help

2010-09-27 Thread Davide Alberani
On Sun, Sep 26, 2010 at 4:31 PM, H. Turgut Uyar u...@itu.edu.tr wrote:


 I'll try to help. I have quite lot of work these days but I'll get to
 the parsers as soon as I can.

As usual, thank you! :-)

I hope to have time to check to at least the main problems about people's pages
within this week.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy 4.6 can't fetch person head shot (and who add IMDb to the names ?)

2010-09-29 Thread Davide Alberani
On Tue, Sep 28, 2010 at 10:50 PM, Israel Fruchter
israel.fruch...@gmail.com wrote:
 I fixed my two issues (Person name  headshot),

Great!  Thank you very much.

I'll have time to check the patch and commit it to Mercurial only
tomorrow.  By the way, anyone should feel free to fork the IMDbPY
repository on Bitbucket (specifically
http://bitbucket.org/alberanid/imdbpy_new_search_parsers/ ),
commit his changes and ask for a pull.

 BTW, I really like the parser, it's a good scraper,

Most of the credit should go to H. Turgut Uyar, who wrote that
wonderful DOM/XPath-based parser.

 tried building it myself, imdb page killed lxml, elementtree and
 BeautifulSoup,

Strange: we're based on lxml (falling back to BeautifulSoup if
lxml is not installed).

 you think they are writing an ill-formed html deliberately to keep us
 scrapers away ??

Hmmm... I fear that, like every big portal, they have to cope with
so many browsers and environment that a nice and standard HTML
can't work. :-)


Thanks again!
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy 4.6 can't fetch person head shot (and who add IMDb to the names ?)

2010-09-29 Thread Davide Alberani
On Wed, Sep 29, 2010 at 12:45 PM, H. Turgut Uyar u...@itu.edu.tr wrote:

 Is it that fork or the one called imdbpy_parsers2010?

No, pardon: my mistake.

The correct project to fork is imdbpy_parsers2010 :
  http://bitbucket.org/alberanid/imdbpy_parsers2010


Thanks!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Suppressing lxml warning

2010-11-01 Thread Davide Alberani
On Sun, Oct 31, 2010 at 9:15 PM, Dustin Wyatt dustin.wy...@gmail.com wrote:
 What's the best way to suppress the warning about not having lxml?

Probably the best way is to set the 'useModule' parameter, initialiazing
the IMDb instance.
Something like:
  from imdb import IMDb
  ia = IMDb('http', useModule='beautifulsoup')

It can also be set in the imdbpy.cfg configuration file (it should be copied
in your home directory as an hidden file, or in /etc)

Anyway, it would be a good idea to have a way to turn off all the
warnings from the code/configuration options.


HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Nokia and ATT present the 2010 Calling All Innovators-North America contest
Create new apps  games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDb redesign: IMDbPY status

2010-11-13 Thread Davide Alberani
Hi all,
I've just pulled into the 'imdbpy' Mercurial repository some changes made in
the 'imdbpy_parsers2010' repository.

Some things will work now, other are still to be done.
Persons' biographies work, and so most of the other information.
The 'httpThin' parsers broken, and I have no plan to fix it (it was never
really useful: let's consider it gone).
I'm not too sure about the parsers for Characters and Companies, but
they seem to be ok.

The development will continue in the 'imdbpy_parsers2010': I plan to work
on the 'mobile' parsers.

As usual, any help is welcome.

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Mysql foreign keys not created using innodb

2010-11-22 Thread Davide Alberani
On Thu, Oct 28, 2010 at 5:00 PM, laurent l.ver...@libertysurf.fr wrote:
  Hi everybody,

Hi!
I'm very sorry for the time it took to approve your mail: I didn't notice
it in the list of pending messages.

  I installed imbdpy and downloaded the imdb files to install a mysql
 database of it. I ran
   ./imdbpy2sql.py -d /mnt/win_h/video/imdb_data/ --mysql-innodb -u
 'mysql://user:passw...@localhost/base'  and everything went fine. It
 took about 5 hours to complete. At the end, i got the innodb tables
 created but the foreign keys weren't created. Here is the end of the logs:

  [...]
 # TIME createForeignKeys() : 49min, 26sec (wall) 0min, 0sec (user) 0min,
 0sec (system)
 EXECUTING END:FOR_EVERY_TABLE:ALTER TABLE %(table)s ENGINE=InnoDB;...
 EXECUTING ALTER TABLE aka_name ENGINE=InnoDB;... DONE!
 # TIME END command : 0min, 35sec (wall) 0min, 0sec (user) 0min, 0sec
 (system)

Well, that's odd: it seems that the script _did_ something.

   I don't know if i did something wrong but i suspect that creating the
 foreign keys before having tables in innodb format is the problem
 because in myisam, we can't have foreign keys.

Well, I assume you're right: probably the previous step just created
some normal indexes, instead of foreign keys in the MyISAM tables.

 Right now i'm generating the foreign keys 1 by 1 so i have some time
 to report the problem :)

Wooops. :-)
To tell the truth foreign keys are not required, for IMDbPY, so you can
live without them.

Anyway, I've modified the imdbpy2sql.py script in the Mercurial repository [1]
so that it would modify the tables (from MyISAM to InnoDB) _before_ the
creation of the indexes and foreign keys.
I've not tested it, but it should work.

PS: thank you very much - I've added your name to the credits.

+++
[1] http://bitbucket.org/alberanid/imdbpy/
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Fwd: IMDB Analysis

2010-11-28 Thread Davide Alberani
On Sun, Nov 28, 2010 at 2:50 AM, Derek Ditch derek.ditch...@gmail.com wrote:

 I'm working on a project that analyzes graph structures using a modified
 version of PageRank for a sample data set, I'm considering IMDB, using
 imdbpy,

Hi!
Your project sounds extremely cool. :-)

 So, it doesn't look like imdbpy has the ability built-in to iterate through
 all movies, or movies of a specific genre, nor of actors. So I suppose I
 will create a new method of the Movie class and implementation within the
 sql parser to return all results.

You're right: by itself IMDbPY doesn't have the ability to iterate over a
large subset of the IMDb database, and I don't think it's a feature we
should integrate too much; keep in mind that as a principle IMDbPY tries
to be transparent regarding its access to the information: such a feature
would be too specific of the SQL database, and impossible - or at least
legally dubious - to implement for the HTTP access.

Obviously a more or less separated package/framework to work on bunches
of items extracted from the SQL database would be more than welcome and,
to tell the truth, a way to express more complex searches on the SQL database
could be a really nice and useful feature. :-)

Basically, it works this way: each items which must be uniquely
identified (Person,
Movie, Character and Company instances) uses the 'id' primary key column of
its database table as ID (as you may have noticed, the ID used for a movie in
the SQL database is _not_ the imdbID used by IMDb on its web site, since the
latter are not included in the plain text data file).

So the best approach would be to access the SQL database in the same way
IMDbPY does: since we're amazingly cool ;-) we didn't settle on one ORM, but
we transparently support both SQLObject (we use its semantic, in our code)
and SQLAlchemy.
Their interface is abstracted in the dbschema.py, alchemyadapter.py and
objectadapter.py (beware: there's a certain amount of black magic involved :-)

To use them, the process is somewhat manual, and could probably be more
automated; to import what you need, see the __init__ method of the
IMDbSqlAccessSystem class in the sql/__init__.py file (very similar code can
be fund in the imdbpy2sql.py script, around line 277.
After that, as said, you can access the database using the created objects
(the ones returned by the getDBTables function) using the SQLObject syntax.
With these object, you can create complex queries on the database; once
you have the list of IDs you're interested in (or a generator of IDs), you can
use a normal imdb.IMDb('sql', ...) instance to access every information you
need using IMDbPY.

It goes without saying that it's possible that the information that you need
are somewhat limited, and such a solution could be too much: maybe you
can give up using the ORM abstraction and even IMDbPY, working directly
on the database.

Let me know how you decide to proceed, and if you need any help - as ideas
or clarification on the internals of IMDbPY: unfortunately right now I don't
have any time to write code :-/

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Fwd: IMDB Analysis

2010-12-02 Thread Davide Alberani
On Thu, Dec 2, 2010 at 7:15 PM, Derek Ditch derek.ditch...@gmail.com wrote:
 So, I've been able to begin building a graph reflecting relationships among
 actors, but I've reached the point where it's huge iterations, so I'm
 parallelizing it (using pp). The issue I'm having is that since I'm running
 queries manually, I have to use sqlobject objects directly (i.e. CastInfo,
 MovieInfoIdx, etc). I can do this in IPython by using from imdb.parser.sql
 import *, and then running my script (even though my script imports the
 same).
 When I run the script standalone, I get errors telling me that CastInfo
 isn't defined. I see that these are exported into globals in the SQL imdb
 access class __init__, so, how can I ensure I'm importing these correctly?

That instances/classes are not created until you run the getDBTables
function of the proper module (objectadapter or alchemyadapter).
The easiest way to do it:
import imdb
ia = imdb.IMDb('sql', 'sqlite:///tmp/imdb.db')
print imdb.parser.sql.CastInfo

As you can see, after the imdb.IMDb object is instanced, you can access
the CastInfo (and other) class in the imdb.parser.sql module.

It goes without saying that you've to replace 'sqlite:///tmp/imdb.db' with
the correct credentials to access your database. :-)


HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Increase Visibility of Your 3D Game App  Earn a Chance To Win $500!
Tap into the largest installed PC base  get more eyes on your game by
optimizing for Intel(R) Graphics Technology. Get started today with the
Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs.
http://p.sf.net/sfu/intelisp-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY 4.7 released

2011-01-23 Thread Davide Alberani
Released after a long delay, and despite that in a hurry, IMDbPY 4.7 can
be found here:
  http://imdbpy.sf.net/

This is a transitional release, after the recent redesign of the IMDb pages.
A new account is used and the new pages are parsed; for sure there are
still many bugs; please read the README.redesign file for other details.

Please contribute to the development with fixes and bug reports.

Enjoy!
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires 
February 28th, so secure your free ArcSight Logger TODAY! 
http://p.sf.net/sfu/arcsight-sfd2d
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Proxy issue

2011-01-24 Thread Davide Alberani
On Mon, Jan 24, 2011 at 7:15 PM, David Kaufman david.kauf...@gmx.de wrote:

  File G:\My Dropbox\Entwicklung\diribeo\imdb\parser\http\__init__.py,
 line 237, in http_error_default
    'proxy': self.get_proxy()}
 IMDbDataAccessError: {'url':
 'http://akas.imdb.com/title/tt0898266/epdate', 'error type':
 'http_error_default', 'errcode': 500, 'headers': httplib.HTTPMessage
 instance at 0x03838FD0, 'proxy': '', 'errmsg': 'Internal Server
 Error'}

 This actually already should have been fixed:
 http://www.mail-archive.com/imdbpy-help@lists.sourceforge.net/msg00204.html

Hi!
From a fast check (I'm looking at a tcpdump), my headers seems to be
correct.

Are you using it in a strange environment of some kind, which could
rewrite the headers adding its own User-agent?
Which version of Python are you using, and what's the operating system?
If you can, you can try to look at what passes on the network, with
wireshark or something.


Bye,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires 
February 28th, so secure your free ArcSight Logger TODAY! 
http://p.sf.net/sfu/arcsight-sfd2d
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] [Imdbpy-devel] IMDbPY 4.7 released

2011-01-26 Thread Davide Alberani
On Wed, Jan 26, 2011 at 8:56 AM, H. Turgut Uyar u...@itu.edu.tr wrote:
 On 01/23/2011 05:12 PM, Davide Alberani wrote:
 Please contribute to the development with fixes and bug reports.

 Will development continue on the main fork or on the imdbpy_parsers2010
 fork?

Good question. :-)
If there're no reason to do otherwise, I'd prefer to move the future
development in the main repository (removing imdbpy_parsers2010).


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires 
February 28th, so secure your free ArcSight Logger TODAY! 
http://p.sf.net/sfu/arcsight-sfd2d
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Filmografie

2011-02-07 Thread Davide Alberani
On Mon, Feb 7, 2011 at 1:47 AM, Sereno Polveroso
serenopolver...@hotmail.com wrote:
 Thanks, now it's ok!

Good!

 I have only one more question: I've seen that IMDb shows you immediatly the
 films italian akas, for example if you search Tarantino in filmography it
 shows you Bastardi senza gloria instead of the english title. Is there a
 way to obtain this in the field 'director', for example in the program you
 have send me. Or i have always to uodate the film and search the key 'akas'
 and then search the italian aka??

I fear there's no easy way.  Some time ago it was possible,
setting IMDbPY to point to the italian.imdb.com site, but now
that site is an alias for imdb.it, which has slightly different web pages.
You can do some tests changing the imdbURL_base variable
in the imdb/__init__.py file, but expect some problems.

Once you have all the information about a movie (after the 'update'),
you can use the guessLanguage method of a Movie instance, to
get its language.
With that, one can more or less easily write a function to extract the
AKA in the original language.  To tell the truth we can also check for
some special notes in the AKA list (e.g.: original title or something
like that).

Hmmm... this is a feature we can take into consideration for a future
release. :-)

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] IMDbPY User Support

2011-02-20 Thread Davide Alberani
On Sun, Feb 20, 2011 at 16:46, Te Kim tk...@ucsc.edu wrote:
 My group is using a SQL database to make these calls. Is this a bug or 
 could there
 possibly be something wrong with our database?

What you're trying to do should work.
Are you sure to have downloaded and processed all the
available plain text data files?
Please download each available file (and be sure that they're
not corrupted) and - if possible - keep track of the output of
the imdbpy2sql.py script.
In a Unix environment, use something like:
  imdbpy2sql.py -d /path/to/imdb_files/ -u db::/connection_params 21
| tee debug.txt

Notice that you can somewhat speed-up the process creating temporary
CSV files (see the README.sqldb for complete information - essentially
you need to add a '-c /path/to/a/tmp/directory/' argument to the command line).

 In addition, I am unable to find ratings or votes in the database.
 The query:
 select * from  movie_info mi join info_type it on mi.info_type_id
 = it.id where it.info = 'runtimes' group by mi.info;
 yields results, as well as in the case where it.info = 'genres'.
 However, if I change it.info to 'votes', 'cast', or 'rating', I get 0 rows.

Well, 'cast' information are not there: they are in the cast_info table.
Votes and rating should be present.

 What could be the reason for the missing information?

Hard to tell: or you don't have some information, or something has
gone wrong importing it.
Unfortunately I'll be unable to do deeper tests on the database
access system for another day or two, so in the meanwhile try to
collect as much information as possible to spot the problem.


HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] IMDb IDs aren't really necessary for names and titles

2011-03-30 Thread Davide Alberani
On Tue, Mar 29, 2011 at 07:13, Michael Grier mr.michael.gr...@gmail.com wrote:
 I found this out by using the moviedb-3.24 programs.

 If you url encode the entire original movie line from movies.list...
 for example, on an episode:

Yes, it's the old search system and more or less what it used to be the Exact
Primary search feature of IMDb.
Until some years ago, using a special key in the query, it was possible to do
a strict search for a match (getting back the searched page if and only
if an exact match existed, or an error page otherwise, if I recall correctly).
By now, that key is gone: at the time, IMDbPY used it to convert from
title/names to IDs.

What's wrong with your query is that it doesn't work for titles or names
containing what we call an imdbIndex (the roman number after the name/title,
to distinguish between homonyms).
E.g., try to search for Gibson Mel (I):
  http://www.imdb.com/Name?Gibson+Mel+%28I%29

it doesn't present you the page of Mel Columcille Gerard Gibson, but just
a list of people named Mel Gibson or something similar.

Now, the message of Petite Abeille seems to contradict me somehow,
so I suppose I've to do some tests. :-)


Bye!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Create and publish websites with WebMatrix
Use the most popular FREE web apps or write code yourself; 
WebMatrix provides all the features you need to develop and 
publish your website. http://p.sf.net/sfu/ms-webmatrix-sf
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-11 Thread Davide Alberani
On Mon, Apr 11, 2011 at 18:35, darklow dark...@gmail.com wrote:

   File ./imdbpy2sql.py, line 1194, in _toDB
     CURS.executemany(self.sqlstr, self.converter(l))
 psycopg2.DataError: invalid byte sequence for encoding UTF8: 0xc320
 HINT:  This error can also happen if the byte sequence does not match the
 encoding expected by the server, which is controlled by client_encoding.

 Any suggestions? I found similar topic, but there were also no solutions.

Yes, I've had other reports about this bug.
Seems to be related to some garbage in the actors.list.gz file.
I hope to have time to investigate the problem within a week or two.

Thanks for the bug report!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Forrester Wave Report - Recovery time is now measured in hours and minutes
not days. Key insights are discussed in the 2010 Forrester Wave Report as
part of an in-depth evaluation of disaster recovery service providers.
Forrester found the best-in-class provider in terms of services and vision.
Read this report now!  http://p.sf.net/sfu/ibm-webcastpromo
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-13 Thread Davide Alberani
On Mon, Apr 11, 2011 at 18:35, darklow dark...@gmail.com wrote:

   File ./imdbpy2sql.py, line 1194, in _toDB
     CURS.executemany(self.sqlstr, self.converter(l))
 psycopg2.DataError: invalid byte sequence for encoding UTF8: 0xc320
 HINT:  This error can also happen if the byte sequence does not match the
 encoding expected by the server, which is controlled by client_encoding.

Hi all,
I'm writing regarding the recent 0xc320 problem with IMDbPY.
The above notice is extremely interesting, and should be investigated:
how can it be that 0xc320 is not UTF8 encodable?
It should work; from the Python prompt:
   unichr(0xc320).encode('utf8')
  '\xec\x8c\xa0'

Anyway, as a very fast and dirty fix (the main problem is probably some
crap in the data files), try this: after line 1181 of imdbpy2sql.py, add:
  k = k.replace('\xec\x8c\xa0', '')

So that the nearby lines will become:
try:
k = k.replace('\xec\x8c\xa0', '')
t = analyze_name(k)
except IMDbParserError:

Please be aware that this fix was not tested at all, but I'm
almost sure that, at the above point, 'k' is a string encoded in utf8.

Anyway, beside the garbage theory, I have another idea
about the source of the error, but I have to verify it later...

Bye, and let me know if it works!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 
application availability and disaster protection. Learn more about boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-16 Thread Davide Alberani
On Wed, Apr 13, 2011 at 08:46, darklow dark...@gmail.com wrote:
 Maybe someone knows some fast dirty fix at least how to skip such invalid
 byte sequence strings while there are no official fix, so i can finish the
 import?
 Can we detect invalid byte characters?

Hi again,
actually my problem is that I'm unable to reproduce this bug. :-)
Using Postgresql and SQLObject, my run goes on smooth.

I have downloaded the 'actors.list.gz' file today, so it's possible that some
garbage was removed.

Anyway, the previously proposed solution was obviously flawed, since
the problem was on _character_ names.

So, let's edit again the imdbpy2sql.py file and change the lines around 1540
so that they become:

movieid = CACHE_MID.addUnique(title)
if role is not None:
roles = filter(None, [x.strip() for x in role.split('/')])
for role in roles:
role = role.replace('\xec\x8c\xa0', '')  # TEMPORARY FIX
cid = CACHE_CID.addUnique(role)
sqldata.add((pid, movieid, cid, note, order))

Maybe this will help... who knows? :-)

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 
application availability and disaster protection. Learn more about boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-17 Thread Davide Alberani
On Sun, Apr 17, 2011 at 14:04, darklow dark...@gmail.com wrote:
 Updated this morning to latest data files, no change and unfortunately this
 fix also doesn't work.

Hmm...  to debug a problem like this without being able to reproduce,
is extremely difficult. :-/

 This error started when we uninstalled imdbpy (left all the dependency libs)
 and started run it without installation. Maybe there is some kind of problem
 and some kind of hidden unicode dependencies? Maybe you can try to run
 without installation, jus from source?

Have you some very good reason to do so? :-)
Can't you try to purge every reference to IMDbPY left on the
system (search for the scripts in /usr/bin/ and /usr/local/bin/ and
be sure that import imdb fails, at the python prompt) and see
if the problem is solved, after IMDbPY 4.7 is reinstalled?

If you have problems locating the IMDbPY package, just open
the Python prompt and:
 import imdb
 print imdb

 Also every time i start the script i receive two warnings:
 2011-04-17 11:13:37,398 WARNING [imdbpy.parser.sql.aux]
 /data/web/imdb/imdbpy4.7-159671/imdb/parser/sql/__init__.py:125: Unable to
 import the cutils.ratcliff function.  Searching names and titles using the
 sql data access system will be slower.

This will force IMDbPY to use some pure-python fall-back functions.
It's entirely possible that there are some bug in these functions, even
if a run without cutils.so is running fine, for me (so far).

 IMPORTING psyco... FAILED (not a big deal, everything is alright...)

That's not a problem for sure.

Right now, my first guess is that somewhere, after the *.list files ar
read and turned into utf-8 encoded strings, the imdbpy2sql.py
script does Something Very Wrong(tm) to a string (like cutting it at a certain
place, ending up cutting a single utf-8 encoded char in two: this could
explain the error).

I've tried the conversion suggested by Petite Abeille, and it works fine.

Please, could you cut a small piece (few kilobytes) of the actors.list file,
and attach it (no cut-and-paste)?
It goes without saying that you should chose a portion where you see
(or guess are) the strange chars :-)

Thanks!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 
application availability and disaster protection. Learn more about boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-18 Thread Davide Alberani
On Mon, Apr 18, 2011 at 08:53, darklow dark...@gmail.com wrote:

 We have Debian linux on our server and our sysadmin allows only stable
 packs. However latest version of imdbpy has these md5 checksum that are
 quite important in our situation, that is why i have to run it from source.

Ehhh... what about a virtual machine or - even easier - virtualenv [0]

Thanks for the file, I hope to look at it within a day or two.


+++
[0] http://pypi.python.org/pypi/virtualenv
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 
application availability and disaster protection. Learn more about boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-19 Thread Davide Alberani
On Mon, Apr 18, 2011 at 09:30, Davide Alberani
davide.alber...@gmail.com wrote:

 Thanks for the file, I hope to look at it within a day or two.

Ok: the file is correctly encoded in iso8859-1, as expected, and contains
no garbage.

Using it as the only input for imdbpy2sql.py (putting the attached file in
a directory by itself), I can run the script with no errors (besides
the expected
warnings about missing files).

I'm using the version from the Mercurial repository, without the cutils.so
library.

Please, if you can't install IMDbPY in your system, consider the use
of virtualenv.
Having tried that, I have to recommend you to double check the
settings of your Postgresql server for some kind of incoherences
about encodings and collations.

HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/


actors.list.gz
Description: GNU Zip compressed data
--
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 
application availability and disaster protection. Learn more about boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-23 Thread Davide Alberani
On Wed, Apr 20, 2011 at 14:08, darklow dark...@gmail.com wrote:
 Still no luck :/ maybe the problem is in some environmental variables or
 settings, which on installed version are present, but running from source
 are missing or incorrect?

Seems unlikely to me.

 What about this, i printed out some variables:
 print sys.stdout.encoding - UTF-8
 print sys.stdin.encoding   - UTF-8
 print sys.getdefaultencoding(); - ascii
 Is it ok that  sys.getdefaultencoding(); == ascii ?

These are fine.

I've reproduced - at the best of my capabilities - your environment:
- no IMDbPY installed in the system.
- IMDbPY from source (the latest version in the Mercurial repository),
  setting the PYTHONPATH environment variable to point to the
  source directory.
- the cutils C module was not compiled.
- the last actors.list.gz file.
- postgres 8.4; my database was created with these settings:
  CREATE DATABASE imdb
WITH OWNER = postgres
   ENCODING = 'UTF8'
   TABLESPACE = pg_default
   LC_COLLATE = 'it_IT.utf8'
   LC_CTYPE = 'it_IT.utf8'
   CONNECTION LIMIT = -1;

I've run it with your and other portions of the actors.list.gz file, and
everything went fine.

Now... if I were you, I'd:
- create a virtualenv environment with:
virtualenv --no-site-packages
- install in it IMDbPY, using easy_install or pip (the executable in
  your virtualenv, I mean) so that you'll have all the correct dependecies
  available.
- run the imdbpy2sql.py within your virtualenv.

If it still fails:
- check your postgres settings.
- try using SQLite (just for a test) - see notes in README.sqldb


HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Fulfilling the Lean Software Promise
Lean software platforms are now widely adopted and the benefits have been 
demonstrated beyond question. Learn why your peers are replacing JEE 
containers with lightweight application servers - and what you can gain 
from the move. http://p.sf.net/sfu/vmware-sfemails
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-24 Thread Davide Alberani
On Sun, Apr 24, 2011 at 20:03, Thomas Stewart tho...@stewarts.org.uk wrote:

 I've just had a try using sqlite with fresh lists and on my Debian
 system and I get this:

 thomas@ikaite:~$ /tmp/imdbpy2sql.py -d /home/thomas/Desktop/imdb/lists -u 
 sqlite:///home/thomas/Desktop/imdb/imdb.db --sqlite-transactions
 IMPORTING psyco... DONE!
  [...]
    CURS.executemany(self.sqlstr, self.converter(dataList))
 pysqlite2.dbapi2.ProgrammingError: You must not use 8-bit bytestrings unless 
 you use a text_factory that can interpret 8-bit bytestrings (like 
 text_factory = str). It is highly recommended that you instead just switch 
 your application to Unicode strings.

This specific bug (a bad interaction between SQLObject and SQLite) should
be fixed in the version in the Mercurial repository; isn't it?


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Fulfilling the Lean Software Promise
Lean software platforms are now widely adopted and the benefits have been 
demonstrated beyond question. Learn why your peers are replacing JEE 
containers with lightweight application servers - and what you can gain 
from the move. http://p.sf.net/sfu/vmware-sfemails
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-24 Thread Davide Alberani
On Sun, Apr 24, 2011 at 21:03, darklow dark...@gmail.com wrote:

 I tried reinstalling all installed dependancies and run from clean sources,
 but no luck.
 I tried to run scripts with SQLAlchemy instead of SQLObject, but same error,
 so the problem is not there.

Perfect - these tests are really important to spot the problem.

 Every test takes about 1h, because error takes place in Actors Cast list.

Wait: I'll read the rest of your mails tomorrow, but this can help you
to do things faster: you don't need the other files at all.
Simply put the actors.list.gz file in a directory by itself, and run
imdbpy2sql.py
with this directory as -d argument.
You can even use a shorter version of actors.list.gz, just remember to leave
the lines at the begin and at the end (various separators are used to identify
where the data begin), like I did with the actors.lists.gz file that I attached
some days ago.

In the 'docs/goodies' directory you'll find the 'reduce.sh' script, which
takes a whole directory of *.list.gz files and reduce them to 1% of
their length.

 It makes me think that problem might be
 somewhere in cutting line in peaces, maybe it does something wrong, cuts
 some good unicode character into peaces and so invalid byte sequence
 appears.

My guess, too... it's just that I can't see where it happens... :-/

Thanks for your tests!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Fulfilling the Lean Software Promise
Lean software platforms are now widely adopted and the benefits have been 
demonstrated beyond question. Learn why your peers are replacing JEE 
containers with lightweight application servers - and what you can gain 
from the move. http://p.sf.net/sfu/vmware-sfemails
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-24 Thread Davide Alberani
On Sun, Apr 24, 2011 at 22:44, darklow dark...@gmail.com wrote:
 Yes i can confirm - Script version 4.6 works perfectly on same server with
 same files.
 And i think by this we come closer to solution.

Excellent!  (well, it still baffles me why I'm absolutely unable to
reproduce the problem on my system, but that's another story...)

 Maybe this helps to identify the problem, this is what we did on our server.
 (Remember, we are doing this copying because there are only stable versions
 for Debian on server allowed, but we need those md5 hashes from 4.7 version)

I'll look at your setup tomorrow.  I'll surely sound pedantic, but... seriously:
why you don't use a virtualenv environment?  It's easy to install and
doesn't require root privileges.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Fulfilling the Lean Software Promise
Lean software platforms are now widely adopted and the benefits have been 
demonstrated beyond question. Learn why your peers are replacing JEE 
containers with lightweight application servers - and what you can gain 
from the move. http://p.sf.net/sfu/vmware-sfemails
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-26 Thread Davide Alberani
On Tue, Apr 26, 2011 at 09:36, darklow dark...@gmail.com wrote:
 Thanks, let me know if you have any ideas, how to fix the problem...

Eh... As usual, right now I'm really busy. :-(

 I looked at virtualenv documentation, i didn't understand how to use it,

Ok, let's try:
- download virtualenv from http://pypi.python.org/pypi/virtualenv#downloads
- tar xvfz virtualenv-1.6.tar.gz
- cd virtualenv-1.6
- python virtualenv.py --no-site-packages ~/myvenv
- cd ~/myvenv
- . ./bin/activate # notice the initial dot
- pip install formencode # bug with the dependencies. :(
- pip install IMDbPY # or download from the Mercurial repository and
run 'python setup.py install'

The most important step is the activation of the virtualenv: your prompt
should change to something like (myvenv)$ to denote that your virtualenv
is active.

Now, always from inside the virtualenv, you can run the imdbpy2sql.py script:
everything was installed locally to your ~/myvenv/ directory (the local python
interpreter is in ~/myvenv/bin/python).
If you need to deactivate the virtualenv, simply run the  deactivate command.

HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-04-28 Thread Davide Alberani
On Thu, Apr 28, 2011 at 22:52, darklow dark...@gmail.com wrote:

 However last command pip install IMDbPY didn't succeeded so well, looks like
 i got exactly the same error, that another user reported some days ago in
 the same discussion and he has also UTF-8 encoding problem:

Sure: you don't have the python-dev package installed
in your system. :-/
A per-user installation is possible, but a little tricky...

 By running python setup.py install  I receive the same error. I also tried
 latest version (4.8dev20110425) but got same error.

Using the latest version sources, run (after you've activated your
virtualenv!):
  python setup.py install --without-cutils

 Maybe this explains the problem why the script doesn't handle UTF-8 at first
 place - some strange incapabilities with cutils.c

I've run some tests without the compiled C module, so I think this
is not the cause, but at this point... who knows. :-)



-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-05-02 Thread Davide Alberani
On Mon, May 2, 2011 at 08:47, darklow dark...@gmail.com wrote:

 Thank you for your patience and guiding through the tests, i really glad we
 finally found the problem and fixed it.

Yep, even if it took a little too long. :-)

 Just curious, why only me and one another user encountered this problem, but
 when you run the same tests, you didn't see the error? :)

It may have something to do with the use python library to connect to
Postgres.  Maybe some libraries handle gracefully this kind of error; I have
to check better the versions installed on my system and on the virtualenv
I've used to reproduce the bug.
In fact the right thing to do in such cases is to raise an exception (like in
our case); other databases - or libraries to connect to databases - like MySQL
simply ignore with a warning these errors (not a great idea).

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Key Error when trying to access show['episodes']

2011-05-09 Thread Davide Alberani
On Mon, May 9, 2011 at 09:26, Fruch israel.fruch...@gmail.com wrote:
 Found the problem:
 in DOMHTMLEpisodesParser
 file: \imdb\parser\http\movieParser.py
 lines: 1703 - 1704
 should change to:
            if key.startswith('filter-season-'):
                 season_key = key[len('filter-season-'):]

Wasn't it already fixed in the mercurial repository [1]?


Thanks for your help!

+++
[1] http://bitbucket.org/alberanid/imdbpy/
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign

2011-07-19 Thread Davide Alberani
On Tue, Jul 19, 2011 at 16:57, Alberto Malagoli albem...@gmail.com wrote:
 Hello,
 I am an habitual user of IMDbPY, I use it for my projects, and compared with
 other similar tools I must say this is the best one!

Thanks!
Unfortunately in the last months it's affected by many problems,
due to the IMDb redesign and my total lack of free time. :-(

 So, as an homage to the
 hard work you do on it, I would like to contribute to the project a bit by
 proposing a new style for the website.

Ehi, thank you for the offer!  Any help is greatly appreciated.
Do you have any ideas for the layout and the technologies to use?


Thanks!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Magic Quadrant for Content-Aware Data Loss Prevention
Research study explores the data loss prevention market. Includes in-depth
analysis on the changes within the DLP market, and the criteria used to
evaluate the strengths and weaknesses of these DLP solutions.
http://www.accelacomm.com/jaw/sfnl/114/51385063/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign

2011-07-22 Thread Davide Alberani
On Thu, Jul 21, 2011 at 09:25, Alberto Malagoli albem...@gmail.com wrote:

 By the way, could you add it to the list of programs using IMDbPY?

Done!  Thank you!

 Yes, I have some ideas, but I want to think about them a bit more, than I'll
 send you some mockups..
 But I want to inform you that in the next days I will be on holiday, so I'll
 start working on it the next week!

eheheh, we're in no hurry.  Have a nice holiday. :-)

 I also would like to ask you if want to add or remove something from the
 current website, I mean some parts or pages, some text,

Don't know... the altar tab can go away; the polls can be replaced
with something more functional than the ones I've written.
Besides that, it's just informative text.  I don't have any preferences
about how it's organized (ok: no frames and no blinking texts ;-)

 p.s. Anyway, I'm italian too :)

Noticed. :-)


Bye!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
10 Tips for Better Web Security
Learn 10 ways to better secure your business today. Topics covered include:
Web security, SSL, hacker attacks  Denial of Service (DoS), private keys,
security Microsoft Exchange, secure Instant Messaging, and much more.
http://www.accelacomm.com/jaw/sfnl/114/51426210/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Imdbpy

2011-07-26 Thread Davide Alberani
On Tue, Jul 26, 2011 at 02:25, Zach Anderson zpanderso...@gmail.com wrote:

 I would love to use imdbpy in an add-on that I'm writing for HTPC program
 XBMC.  The only issue that I'm having is that I'd prefer to only have 1
 include file.  Is this possible?

I guess you can try to create an egg.
From the source tree, you have to run the command:
  python ./setup.py bdist_egg

After that you can install the egg on the target system with something like:
  easy_install name_of_the.egg

I'm not sure that the current setup can suit your needs.
Maybe you can try to set the zip_safe parameter in
the setup.py file to True, instead of False.

By the way, there's a known problem with the dependencies of
SQLObjects, which depends on FormEncode.
If the installation fails, try to run it again.  If it fails again,
manually install FormEncode before (or exclude it, if you
don't need it).

HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Magic Quadrant for Content-Aware Data Loss Prevention
Research study explores the data loss prevention market. Includes in-depth
analysis on the changes within the DLP market, and the criteria used to
evaluate the strengths and weaknesses of these DLP solutions.
http://www.accelacomm.com/jaw/sfnl/114/51385063/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign

2011-07-30 Thread Davide Alberani
On Fri, Jul 29, 2011 at 16:51, Alberto Malagoli albem...@gmail.com wrote:

 http://dl.dropbox.com/u/921872/IxD/IMDbPY%20logo.png

Cool!  I like the second of the second row, but I'm not sure if
it can work on a light/white background...

 Ok.. I decided to join some menus, for a simpler navigation, renaming some
 of them.

Ok, good for me.

 I still don't know how to replace the polls.. Which technology did you used
 to write the current website?

It was manually coded by me in PHP, using a MySQL database.
Since I'm out of ideas for new polls subjects, I think we can also
leave it out, for the moment! :-)

Thanks!
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Got Input?   Slashdot Needs You.
Take our quick survey online.  Come on, we don't ask for help often.
Plus, you'll get a chance to win $100 to spend on ThinkGeek.
http://p.sf.net/sfu/slashdot-survey
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign

2011-07-31 Thread Davide Alberani
On Sun, Jul 31, 2011 at 12:22, Alberto Malagoli albem...@gmail.com wrote:

 Don't worry, I would like to use the same colors I used on that mockup for
 the header on top of the page. Do you like them?

Hmmm... have I missed a link, here? :-)
I've seen only the logos.
Anyway, my taste for colors is not really meaningful, so I'm fine with
anything you think is ok. ;-)


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Got Input?   Slashdot Needs You.
Take our quick survey online.  Come on, we don't ask for help often.
Plus, you'll get a chance to win $100 to spend on ThinkGeek.
http://p.sf.net/sfu/slashdot-survey
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] [Imdbpy-devel] Getting information about genres only

2011-07-31 Thread Davide Alberani
On Sun, Jul 31, 2011 at 09:58, Saravanan
saravanan.thirumuruganat...@mavs.uta.edu wrote:

 I am trying to write some Ubuntu Unity lens using IMDBPy.

Great!  Let me know when it's done!

 For this purpose, I need only genre information. I took a look at the code and
 did not find any get_movie_genre kinda function. So currently, I am
 using ia.update(x,main) . Is there anyway to just the genre information?

There's no way to get only the 'genres' key; you're doing right, asking for
only the 'main' info set (after that, you can access the 'genres' key of the
Movie object).

If you're worried about performances, you can consider using the
'mobile' data access system (i.e.: ia = imdb.IMDb('mobile') ) and
leaving the rest of your code untouched.
It should be slightly faster, but... I notice right now that it fails
to parse the 'genres' key.  I have to fix it. :-)


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Got Input?   Slashdot Needs You.
Take our quick survey online.  Come on, we don't ask for help often.
Plus, you'll get a chance to win $100 to spend on ThinkGeek.
http://p.sf.net/sfu/slashdot-survey
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] [Imdbpy-devel] Getting information about genres only

2011-07-31 Thread Davide Alberani
On Sun, Jul 31, 2011 at 13:38, Davide Alberani
davide.alber...@gmail.com wrote:

 It should be slightly faster, but... I notice right now that it fails
 to parse the 'genres' key.  I have to fix it. :-)

Ok, fixed in the Mercurial repository and added your name to the
credits.

Thanks!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Got Input?   Slashdot Needs You.
Take our quick survey online.  Come on, we don't ask for help often.
Plus, you'll get a chance to win $100 to spend on ThinkGeek.
http://p.sf.net/sfu/slashdot-survey
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-08-23 Thread Davide Alberani
On Tue, Aug 23, 2011 at 12:29, Alberto Malagoli albem...@gmail.com wrote:

 Anyway, let me know if you like it

Hi!

Yeah, I like it a lot, thanks!
Nice design: very clean.

  - links in the middle of the page are like fast paths to the most used
 pages. Is there a guests counter in the current website, so we could know
 which pages are currently the most visited ones?

There are google analytics and piwik; do you need some stats?

By the way, in the last week a lot of bugs were fixed, and so I think
we can be ready for a news release... let's say before the end
of September. :-)  It would be cool to have the new web site, for it.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Get a FREE DOWNLOAD! and learn more about uberSVN rich system, 
user administration capabilities and model configuration. Take 
the hassle out of deploying and managing Subversion and the 
tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-08-23 Thread Davide Alberani
On Tue, Aug 23, 2011 at 15:04, ronaldhatcher ronaldhatc...@mac.com wrote:
  was wondering what the changes of moving
 from hg to git would be?

Not much, not only for the fact that I slightly prefer mercurial. :-D
No, seriously: they are both wonderful tools, and both github
and bitbucket are great services.  The differences from each
other are very few, compared to the differences with svn.
Did I already mentioned that I prefer mercurial? ;-)

 I personally find github excellent and since most
 of my other work is in git so I think it would be great.

But IMDbPY already has a github official repository:
  https://github.com/alberanid/imdbpy

I keep it in sync with the mercurial repository on bitbucket [0]
using the hg-github plugin [1].
So if you want to fork the project on github and develop
using git, feel free!

+++
[0] http://bitbucket.org/alberanid/imdbpy/
[1] http://hg-git.github.com/
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Get a FREE DOWNLOAD! and learn more about uberSVN rich system, 
user administration capabilities and model configuration. Take 
the hassle out of deploying and managing Subversion and the 
tools developers use with it. http://p.sf.net/sfu/wandisco-d2d-2
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-08-25 Thread Davide Alberani
On Wed, Aug 24, 2011 at 21:27, Alberto Malagoli albem...@gmail.com wrote:

 There are google analytics and piwik; do you need some stats?

 Yes, I would like to know which pages are the most visited one. So I will
 add these pages on the home as fast links..

Ok, I'll send you the data.


 That's great! I saw IMDB pages are chaged in the last few weeks, so the last
 fixes takes into account these changements?

Mosto of them, I hope.

 I also would like to ask you if there are any plans about a Python 3
 porting...

Not sure about it: it also depends on the porting of our dependencies,
like SQLAlchemy/SQLObject.

  - I cannot understend the difference, under the downloads page, between
 official packages and unofficial packages. May I merge them into one or
 not?

If you want, yes.
The difference is that the official ones were made by the package managers
of the distribution, the other by me, so they may not be fully compliant with
the policies of the distribution.

  - Is the mobile version still under development? Where can I put it, under
 a development page or under the downloads page?

No, the Symbian interface is no longer developed or supported. :-/

 I fixed some problems on the home page, and added some new stuff, so check
 it:
 http://dl.dropbox.com/u/921872/IxD/IMDbPY/index.html

 I also created the License page:

Good - I link them!


Thanks!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
EMC VNX: the world's simplest storage, starting under $10K
The only unified storage solution that offers unified management 
Up to 160% more powerful than alternatives and 25% more efficient. 
Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-08-31 Thread Davide Alberani
On Mon, Aug 29, 2011 at 23:21, Alberto Malagoli albem...@gmail.com wrote:

 No, the Symbian interface is no longer developed or supported. :-/

 Mhh, ok... I'm still thinking about where I could put it, but I have no
 ideas..

I guess it can can removed.
After all, there's no way that we'll return to work on a Symbian. :-)

 I'm working also on new pages like the downloads one. Here's the last
 version:
 http://dl.dropbox.com/u/921872/IxD/IMDbPY/downloads.html

 From now on, you can start navigating the website through the menu on top.
 Take a look at the home page also (I added some new stuff).

Wow!  This site is awesome! It's really wonderful.

 Taking a look at the documentation part, I have to say that by now is a bit
 confusing. Contents should be organized in a different way. I don't want to
 say that it has to be changed now, but a reorganization of the documentation
 could be a necessary future work.

True, I've always created a new text document every time there was
something new, but they are too many and too long. :-(

By the way, if you need access to the sourceforge web server, let me know.
What's your username on sourceforge?

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free Love Thy Logs t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Limit imdbpy2sql to only movies - KIND_ID=1

2011-09-02 Thread Davide Alberani
On Fri, Sep 2, 2011 at 09:09, darklow dark...@gmail.com wrote:

  Since i dont need other movie types than Movies, which KIND_ID=1
  so it means i could narrow down from 1 985 650 to  560 606 titles, which
 would greatly improve performance for some of my scripts. So i could get rid
 of ~1.4 million titles, it means event more million names.

Hmmm... I have to think about and look at the code, but I fear it's nearly
impossible.

The information about the kind of a movie is not taken from a single file.
Instead, each time a title is found (in any of the parsed files), it's analyzed
(and amongst other things its kind is detected) and checked to see if
we already known this title and if it has an titleID already given (otherwise,
a new progressive one is assigned).

The whole process is automated, and I don't think it would be easy to
exclude only a certain kind of title, sorry.

This way of working makes possible to import only a single file (e.g.
special-effects-companies.list.gz) and incidentally makes the movies.list.gz
file almost useless.

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free Love Thy Logs t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-09-09 Thread Davide Alberani
On Thu, Sep 8, 2011 at 16:46, Alberto Malagoli albem...@gmail.com wrote:
 Sorry if I haven't posted any news or reply for a while but I had some
 problems with my computer, which is to the doctor by now.. :(

No problem at all... I'm as busy as hell, at the moment. :-/

 Mhh, I don't think I need it by now, anyway my username is alpert_ (with an
 underscore at the end).

Perfect.  Once you're ready, you can give me your public ssh key, so that
you can directly upload files to the web server (if you need instructions,
just ask).

Bye!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Why Cloud-Based Security and Archiving Make Sense
Osterman Research conducted this study that outlines how and why cloud
computing security and archiving is rapidly being adopted across the IT 
space for its ease of implementation, lower cost, and increased 
reliability. Learn more. http://www.accelacomm.com/jaw/sfnl/114/51425301/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Problem with importing to MySQL database

2011-09-20 Thread Davide Alberani
On Tue, Sep 20, 2011 at 12:50, K Hein khei...@gmail.com wrote:

 It throws the following error. It
 seems like the exception handler code tries to insert a duplicated record
 into cast_info table with existing primary key.

Yep, that's the case.

 Can anyone help me fix this
 problem or suggest any work around solution?

Unfortunately I'm unable to reproduce the problem.
Do you have the latest version of IMDbPY installed from the Mercurial
repository?
You can download it from here: https://bitbucket.org/alberanid/imdbpy/
(see the get source link)

By the way, do you have SQLObject or SQLAlchemy installed? (or both?)

Another thing you can try, is to use the -c /path/to/an/empty/directory
argument: imdbpy2sql.py will write a series of .csv files in that directory,
that will be later imported into the database.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Fwd: imdbpy2sql 4.7 - invalid byte sequence for encoding UTF8

2011-09-20 Thread Davide Alberani
On Mon, Sep 19, 2011 at 21:36, darklow dark...@gmail.com wrote:

 This fix worked for about some months and unfortunately there is similar
 encoding error in latest data files (16.sep.2011)

Tried right now, and everything went fine for me, using
the same version. :-(

Are you using SQLObject or SQLAlchemy?
Version of MySQL?

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-09-22 Thread Davide Alberani
On Thu, Sep 22, 2011 at 11:10, Alberto Malagoli albem...@gmail.com wrote:

 After a long time, I had my computer back and I had a chance to work on the
 IMDbPY website, so here you are the last version:
 http://dl.dropbox.com/u/921872/IxD/IMDbPY/index.html

As usual, very very cool. :-)  I like a lot the various effects.

 I would like to ask if someone could help me on that, because it's a big
 work and if I do it all alone I will probably make mistakes... Thanks!

I can surely, in the weekend (which and of which month, is still to
be decided... :)

What's the best way to coordinate the effort?  A repository somewhere?


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-10-01 Thread Davide Alberani
On Fri, Sep 30, 2011 at 18:43, Alberto Malagoli albem...@gmail.com wrote:

 What's the best way to coordinate the effort?  A repository somewhere?

 Well, probably a repository would be a good idea, anyway I have a kind of
 hate against them, I always have problems and I've never completly
 understood them :P

Eheh, no problem.
I've arranged a Mercurial repository on sourceforge:
  http://imdbpy.hg.sourceforge.net/hgweb/imdbpy/imdbpy-site/

You can clone it (using the 'hg' command from the mercurial package):
  hg clone 
ssh://your_user_n...@imdbpy.hg.sourceforge.net/hgroot/imdbpy/imdbpy-site

and added a site-sync rsync script to it, to syncronize
the local content to http://imdbpy.sf.net/new/ (for now)
If you are working on a Windows environment, let me
know: I'm sure there are replacements for these tools.

Basic introduction to the sf repository is here:
  https://sourceforge.net/apps/trac/sourceforge/wiki/Mercurial

Veery basically:
- to add a new file or directory: hg add FILE_NAME
- to update from the repository: hg pull
- once you're happy with a set of changes you made:
  # download new things
 hg pull
  # apply them to the local files - there may be conflicts.
 hg update
 # if any, resolve conflicts; then save your changes locally:
 hg commit -m 'description of what was changes'
 # send your changes to the server:
 hg push
- to see if there's something new in the repository:
 hg in
- to see if you've something not committed (locally!):
 hg status
- to see if you've something to send to the repository:
  hg out
- to see what you've changed and not committed yet (locally!):
  hg diff

A very good introduction:
   http://hginit.com/

But fear not: you only need very basic operations, trust me. :-)
And if you need any help, ask!

 and here you can find a zipped version of it:
 http://dl.dropbox.com/u/921872/IxD/IMDbPY/IMDbPY.7z

Copied to the repository and synced on the web site (on the /new path,
for now)

 I will not work on the website on this weekend

Eh, I'm busy too. :-/

 Bye, have a good weekend!

You too!


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] website redesign - first proposal

2011-10-03 Thread Davide Alberani
On Tue, Aug 23, 2011 at 15:04, ronaldhatcher ronaldhatc...@mac.com wrote:

 since most of my other work is in git so I think it would be great.

And now bitbucket also supports Git. :-)
  http://blog.bitbucket.org/2011/10/03/bitbucket-now-rocks-git/


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql.py

2011-10-10 Thread Davide Alberani
On Mon, Oct 10, 2011 at 13:47, Tiago Serra tiagofse...@gmail.com wrote:
 Thanks i find it in the /usr/share/doc…..

Ok, then you have an old version. :-P

Sorry, you have to use the one in the Mercurial repository:
  https://bitbucket.org/alberanid/imdbpy/

(notice the get the source link, on the right).
To install it (as root):
   python setup.py install

You may need to remove the installed package and to compile it you
need python-dev and possibly sqlobject, sqlalchemy and some other
packages (easy_install tries to satisfy these dependencies, but sometimes
it fails on sqlobject... if that's the case, let me know)


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] cant retrieve movie list from person

2011-10-28 Thread Davide Alberani
On Fri, Oct 28, 2011 at 19:57, Gustaf Nilsson gus...@laserpanda.com wrote:
 Hi
 Im trying to retrieve a list of movies a person has worked on, but it only
 seems to work on famous people (?)

Hi!

First of all, if you're not already doing it, I suggest
to use the version from the Mercurial repository:
  http://bitbucket.org/alberanid/imdbpy/

The stable version is very old: I hope to release a
new one in a few days.

 b = x.get_person(1459663)
 b.get_titlesRefs()

Ok, get_titlesRefs is (mostly) an internal method,
and only returns links to titles as found in
the texts (biographies, plot summaries and so on).

To get the other information, the objects instances
of the Movie class work like python dictionaries.
You can get a list of available information with:
  b.keys()

After that, you can do things like:
  b.get('actor')
  b['visual effects']


Hope this helps! (by the way: wow, you worked on some
very cool movies!)

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY 4.8 and new site released

2011-11-01 Thread Davide Alberani
Hi all,
I've just released the long-awaited IMDbPY 4.8, with too many bug fixes
to mention.  Pardon for the slowdown in the development; I'm sure there
are still many bugs and I'd like to see some fixes to some core pieces of
code (after more than 7 years and almost 50 releases, it's probably a good
idea to rewrite a function or two ;-)

So, if anyone wants to help, let us know!

With this release, we also have a shiny new web site, courtesy of
Alberto Malagoli who kindly joined the development team (thanks
and welcome aboard, Alberto!)

As usual, you can download IMDbPY from: http://imdbpy.sf.net/

Enjoy!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
RSAreg; Conference 2012
Save #36;700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] getting grossing data

2011-11-13 Thread Davide Alberani
On Sat, Nov 12, 2011 at 20:44, Zsolt Ero zsolt@gmail.com wrote:

 I have just started using IMDbPY. I would like to get the grossing
 field of a given movie, but I don't know how.

These information are included in the 'business' data set, that
you've to retrieve, first.

An example:
import imdb
ia = imdb.IMDb()
avatar = ia.get_movie(0499549)
ia.update(avatar, 'business') # get the business information

business = avatar.get('business') or {} # may be empty
print business.get('gross') # to see what else is available: print
business.keys()


Unfortunately I notice now that a lot of garbage is colleted, too:
you should just get a list of information... :-/
I'll try to fix it ASAP.
When the data is retrieved from a SQL db, there are no problems.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] getting grossing data

2011-11-13 Thread Davide Alberani
On Sun, Nov 13, 2011 at 15:00, Zsolt Ero zsolt@gmail.com wrote:

 Yesterday I finally figured out how to get the grossing data out.

Good. :-)

 match = re.match(r\$([1-9][0-9,]+), movie['business']['gross'][0])
 gross = match.group()[1:]
 grossint = int(gross.replace(',', ''))

Ok, but keep in mind that the currency may be English pound
or anything else, also... (and I think it can be before the value
or even after)

 Do you say that you have implemented this now in the SVN?

No, the previous code introduced in the list things that were not
business information at all: I just stripped those.

 Also, can you help me how to get the English title of a movie, what is
 listed on the www site not on the aka site?

Hmmm... do they ever differ?  Do you have an example?

 Or to figure out which is the www title from the akas list?

I'd probably need to do some tests (and have at least an example
to work on).
Maybe you can use the list of akas and the 'guessLanguage' of
the Movie instances (it tries to guess the language of the title/movie),
but I'm not too sure.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql: KeyError: md5sum

2011-11-15 Thread Davide Alberani
On Tue, Nov 15, 2011 at 16:33, Frederik Vaassen
frederik.vaas...@gmail.com wrote:
 Hi,

 I'm attempting to transform the IMDb text files (acquired from one of
 the mirrors today) into a mySQL database using imdbpy2sql

Hi!

Are you working on a brand new database, or you're using one where
there was an older version of IMDbPY?

Can you provide the whole command line that you've used?

Thanks for the report!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Retrieving actor info set from Person

2011-11-17 Thread Davide Alberani
On Thu, Nov 17, 2011 at 07:10, Nicholas Cafarchio cafni...@pct.edu wrote:

 I'm attempting to create a script that lets you find the connections 
 between two actors (i.e. through movies and actors they have worked with, how 
 are they connected, 6 Degrees of Kevin Bacon?). Retrieving the information is 
 easy enough, but the logic that looks for the connections is slightly more 
 complicated than I can handle in Python, so, being a Java programmer, I 
 decided to use Jython. Here's where I run in to some issues, and I'm fairly 
 confused to say the least.

Right, 'cause the task was not already difficult enough,
so better put jython into the equation!  Makes sense... :P


 5a. When I call i.update(bacon) in the interactive python shell, the 'actor' 
 info set is now listed as an info set of bacon
 5b. When I call pi.exec(i.update(bacon)); inside my Jython script, about 
 1/4 of the info sets that should be returned do not show up when you call 
 .keys(), 'actor' being one of them.

:-/

 6b. Using http instead of mobile in my Jython script gets me some of the info 
 sets, but still not 'actor'.
 7b. Even though I import sys and append the path to lxml to path, the script 
 always falls back to beautifulsoup, and it takes forever. This doesn't happen 
 in the interactive shell.

lxml is (partially) a C python module.  I don't remember how Jython works
with these.

 So, now that the background is out of the way, does anyone know why I'm 
 getting different results in the shell and the script? Better yet, does 
 anyone know how to fix it? If someone can help me fix the lxml thing, that 
 would be helpful, but it's not a primary concern.

My only idea is crazy and already disproved by a fast test I made:
while I did swear some time ago that the keys used by the
Movie/Person/... classes
to store information are strings (besided the few cases where they are instances
of other objects), the truth is that many times they are unicode.
Not that it could (should?) create any problem, but I noticed that for 'mobile'
some are strings and some are unicode (u'actor' is unicode)
The same is true for 'http', but in that case 'actor' is a string... :-/

Another absolutely crazy option: is it possible that in the environments
you're using (jython and the normal python), there's a proxy used in one
an not in the other?  (i.get_proxy() should tell you).

And, by the way, how does Jython and its PythonInterpreter share
memory, if they do?  i is a very common variable name... ;-)

Ok, I'm officially out of ideas. :D

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Error returned when running imdbpy2sql.py with MySQL database

2011-11-19 Thread Davide Alberani
On Sat, Nov 19, 2011 at 20:17, Cody Bohn codybohnh...@gmail.com wrote:

 I am getting this issue when I run imdbpy2sql.py with the proper arguments
 with a mysql scheme and I can't seem to figure out the issue. I posted on
 Stack Overflow but I only managed to resolve part of the problem and this is
 what remains.

Hi!
Looks like you've some serious problem with dependencies.

Can you tell us how and where you're installing IMDbPY, and what's
your environment?
It whould help to know:
1. your distribution,
2. if you're installing IMDbPY from the tar.gz, or if you're using a
package or you're installing it using easy_install or pip.
3. are you installing it in a virtualenv?

Also, check to not have other versions of IMDbPY installed
on your system.

If you're using easy_install, please be sure to have only one
executable installed in your path.

Thanks for the report!


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Adding foreign keys fail with postgresql database

2011-12-26 Thread Davide Alberani
On Wed, Dec 14, 2011 at 20:58, Davide Alberani
davide.alber...@gmail.com wrote:

 Thank you very much for the detailed report.

I think to have fixed it in the Mercurial repository: the problem was created
by the fact that right now, the crazy credits doesn't contain any reference
to movie titles. :-/
I hope they will fix this soon.

Thank you very much for your report - I'll add your name to the credits.

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql - some movies are missing nr_order for the cast

2011-12-26 Thread Davide Alberani
On Tue, Dec 13, 2011 at 19:58, Davide Alberani
davide.alber...@gmail.com wrote:

 However, this doesn't explain your 19 and 0... :-/
 My db is not up-to-date: I'll recreate it tomorrow and see the result
 of your queries.

Pardon for the delay.
I have the same data, and they are from the 'writer.list' file.  Most probably,
they are wrong submissions or have some arcane meaning for the IMDb
database. :-)

Happy holidays!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] IMDB database creation

2012-02-15 Thread Davide Alberani
On Wed, Feb 15, 2012 at 15:39, Márton Szabó habaku...@gmail.com wrote:

 First of all thanks for IMDbPY it works great.

Thanks. :-)

 I wanted to play with the IMDB UNIX search programs, that can be found here:
 http://www.imdb.com/interfaces

Wow!  They are a little... demodé. :-)
I don't really expect them to work on a recent set of data:
since some time all the titles in the plain text data files
are listed in the The Title format, while previously they
were Title, The.

We supported the output of moviedb up to IMDbPY 4.1,
after that the changes were so many that it was no more
worth the effort (and the 'sql' method works much better).

 I was able to compile the programs, but I couldn't create the databases.
 Lots of .list files failed to convert to .data/.names/.titles files with
 error codes like these:

 /usr/local/moviedb-3.24/etc/mkdb  -movie
 Adding Movies List...
 make[3]: *** [movies.data] Bus error

Here's the most important excerpt from my old README.local:

NOTE: the current (3.24) moviedb version is old an it was not
thought with tv series episodes support in mind.
It can still work very well, but you've to modify some constants
in the code: edit the moviedb.h file in the src directory,
and change MAXTITLES to _at least_ 160, MAXNAKAENTRIES
to 70, MAXFILMOGRAPHIES to 20470, LINKSTART to 100
and MAXBIOENTRIES to 50.
Also, setting MXLINELEN to 1023 is a good idea.
See http://us.imdb.com/database_statistics for more up-to-date
statistics.


You can read a complete copy here:
http://erlug.linux.it/~da/erlugtmp/README.local
The current version:
https://bitbucket.org/alberanid/imdbpy/src/74e6f583f9cf/docs/README.local

If you need the other tools we developed to use these data, you
can download IMDbPY 4.1 from
http://sourceforge.net/projects/imdbpy/files/IMDbPY/4.1/
but I guess that at this point you've changed your mind. :-P


HTH,
-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Virtualization  Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] install fails

2012-03-18 Thread Davide Alberani
On Sun, Mar 18, 2012 at 18:50, Jan Behrendt jan.behre...@gmail.com wrote:

 Hi all! I've a problem with the installation. Could you help me please?

 root@ubuntu:/home/ubuntu/imdbpy/IMDbPY-4.8.2# python ./setup.py
 /usr/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution
 option: 'zip_safe'
   warnings.warn(msg)

I admit that this error is somewhat strange, and I'm unable
to reproduce it (python 2.7, ubuntu 11.10 - but distutils doesn't
seem to be changed).
Are you sure to have the python-setuptools package installed,
and that there are no other packages/modules with the
same name installed somewhere else on the system?

Anyway, the command to install it should be:
  python ./setup.py install

Let us know if this fixes the problem.

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] timing out

2012-03-24 Thread Davide Alberani
On Thu, Mar 22, 2012 at 09:29, Tomáš Hnyk tomash...@gmail.com wrote:

 Thanks a lot, that will be great:-).

Ok, in the mercurial repository you can set the 'reraiseExceptions' argument
(of the IMDb() instance or in the imdbpy.cfg file, if you use it) to True, to
re-raise every caught exception.  By default, it's false.

Thanks for the idea, I've added your name to the credits.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] timing out

2012-03-24 Thread Davide Alberani
On Sat, Mar 24, 2012 at 11:51, Tomáš Hnyk tomash...@gmail.com wrote:

 and it has been hanging at this for about two minutes now and counting.
 Should not it raise some sort of exception, or is it going to wait for
 something indefinitely?

:-/
It's very possible that it will wait until the end of time, I fear.

 I was originally asking if there were a way to set a timeout to retrieving
 film info. Is there some internal timeout in imdbpy after which it justs
 gives up?

My fault: I misunderstood the problem.
No, IMDbPY uses urllib, and it doesn't provide a way to set
a timeout.  urllib2 has it, but only in very recent python versions,
so I'd prefer not to use it (and, by the way, a conversion from urllib
to urllib2 would take me some time).

 I found a way to do this on in internet:
 import signal

Yes, it's more or less reasonable, but I want to think twice about
it, before I introduce an alarm in the IMDbPY code... after
all i's a library and it could heavily interfere with signal handling
made by the programs that uses it. :-(

Real point is, it's very strange that it happens so methodically on
your system... I think I've never seen it hangs during a request.

Anyway, if you can try to put _before_ IMDb is imported/instanced this:
  import socket
  socket.setdefaulttimeout(10)

With that, at least the 'open' should fail after at most 10 seconds.
If this doesn't work for you, it means that the connection is established,
but hangs (or is really slow) reading the data.
If it's so, I fear an alarm-based solution is the only way... :-/

Thanks for the debugging!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Episode information of not available

2012-03-24 Thread Davide Alberani
On Sat, Mar 24, 2012 at 14:44, David Kaufman david.kauf...@gmx.de wrote:

 I was looking if I could retreive episode information through imdbpy,
 but was unable to do so:

Recently the episode pages changed a lot.
A fix is already available on the Mercurial repository:
  https://bitbucket.org/alberanid/imdbpy/

Let me know if you encounter other problems!


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] SQL Server Express 2008 R2 - Connection Error

2012-03-31 Thread Davide Alberani
On Mon, Mar 26, 2012 at 09:25, Damien damien.stew...@profero.com wrote:

 Traceback (most recent call last):
  File imdbpy2sql.py, line 519, in module
    connectObject = conn.getConnection()
  File C:\Python27\lib\site-packages\sqlobject\dbconnection.py, line 336, in 
 g
 etConnection
    conn = self.makeConnection()
  File C:\Python27\lib\site-packages\sqlobject\mssql\mssqlconnection.py, line
 94, in makeConnection
    con = self.dbconnection( *self.make_conn_str(self) )
  File pymssql.pyx, line 552, in pymssql.connect (pymssql.c:7162)
 pymssql.InterfaceError: Connection to the database failed for an unknown 
 reason.

 Any ideas?

First of all, pardon for the long delay. :-/

Hmmm... not much, not knowing enough of SQL Server.
Maybe you can just try interfacing with it using SQLAlchemy, instead of the
default SQLObject.  Try adding the -o sqlalchemy argument to the
imdbpy2sql.py command line...


HTH,

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] SQL Server Express 2008 R2 - Connection Error

2012-04-08 Thread Davide Alberani
On Tue, Apr 3, 2012 at 05:42, Damien damien.stew...@profero.com wrote:

  File pymssql.pyx, line 549, in pymssql.connect (pymssql.c:7110)
 sqlalchemy.exc.OperationalError: (OperationalError) (20009, 'Net-Lib error 
 durin
 g Unknown error') None None
   [...]
 I have created a blank DB in SQL Server which I can connect to okay using SQL
 Server Management Studio. I would expect the error messages to be a little 
 more
 helpful, perhaps stating the login is incorrect or something - but the last 
 few
 lines seem to indicate something is broken :(

Yep. :-(
Sorry, but I know next to nothing about databases on a Windows environment,
so I can't be of any help on this.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
For Developers, A Lot Can Happen In A Second.
Boundary is the first to Know...and Tell You.
Monitor Your Applications in Ultra-Fine Resolution. Try it FREE!
http://p.sf.net/sfu/Boundary-d2dvs2
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] Want to create xml files for every type of supported imdbpy object from mysql database

2012-04-29 Thread Davide Alberani
On Sat, Apr 28, 2012 at 00:48, Benjamin benjaminbondt...@gmail.com wrote:

 I've followed the instructions and populated a mysql db w/ the info from the
 imdb flat files.
 I want to output from this db an xml file for every single imdbpy object
 type(Movie, Person, etc.).
 Is there a way to do this?

Sort of, but I can't guarantee you that it will not be painfully slow. :-)

First of all, you have to know the maximum ID of the object
you're considering (movies, persons, ...)
For movies, you can use something like:
  SELECT MAX(id) FROM title;

After that you can write a Python script to extract the information.
Something like this (beware that it's mostly pseudo-code) should work:

from imdb import IMDb

ia = IMDb('sql', uri='mysql://USERNAME:PASSWORD@localhost/imdb')

for idx in xrange(idx, MAX_ID+1):
# get the movie object.
movie = ia.get_movie(idx)
# fetch all other information, if you need them.
# ia.update(movie, 'all')
# or maybe you're only interested in goofs:
# ia.update(movie, 'goofs')
# get the XML representation, and do what you want with it.
xml = movie.asXML()
# YOUR XML PROCESSING HERE

You better put all the code inside the 'for' cycle in a try/except clause,
since a lot of things can go wrong... just skip to the next item.

A recent suggestion that I want to implement is a JSON output,
but I still had no time to look at it. :-(


HTH

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] help running imdbpy2sql

2012-05-24 Thread Davide Alberani
On Thu, May 24, 2012 at 11:14 PM, Brian Abelson
br...@harmony-institute.org wrote:

 4. Downloaded the stable version of IMDbPY  - 4.8.2

If possible, use the development version:
  https://bitbucket.org/alberanid/imdbpy/

 Traceback (most recent call last):
   File bin/imdbpy2sql.py, line 34, in module
     from imdb.parser.sql.dbschema import *
 ImportError: No module named imdb.parser.sql.dbschema

You can't use the downloaded file without installing them.
You can try to run, from the main directory of the IMDbPY
package, this:
  python setup.py install

it should take care of all the dependencies, provided that you
have a working environment to compile the needed libraries
and/or that they are pre-packaged for MacOS X (I don't know,
since I don't use it).

Basically, the point is:
- try to install it (it will try to fetch some dependencies and
compile some modules)
- if some dependency fails, try to satisfy it manually.
- if there's no way and/or the compile step stops you, see the options shown
  by the setup.py file to exclude some pieces of IMDbPY.

It would be cool to have an auto-installer for Mac OS X or at least
the steps to follow to manage the installation of python modules;
is anyone out there expert on the subject? :-)


HTH,

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] workaround for imdbpy2sql.py slowness

2012-05-27 Thread Davide Alberani
Hi all,
lately I've got some reports about a huge slowdown of the imdbpy2sql.py script,
especially on Ubuntu 12.04, with both MySQL and Postgresql.  Maybe even
other databases.

I'm not too sure about what the problem is, but the workaround seems easy:
do not use the default SQLOjbect, but install SQLAlchemy and add to the
command line this argument: -o sqlalchemy

If someone knows about the reason and has any idea about a real fix (maybe
it's just an option that must be added to recent versions of SQLObject), please
let me know.



-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


[Imdbpy-help] IMDbPY 4.9 released

2012-06-16 Thread Davide Alberani
Hi all,
I've just released IMDbPY 4.9, containing the fix for search queries and
all the latest fixes and improvements.
It can be downloaded from http://imdbpy.sf.net/

As usual, let me know of any problems.

Enjoy, and thanks to anyone for the help!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy2sql.py problem on WINDOWS xp--hanging on FLUSHING MoviesCache

2012-06-30 Thread Davide Alberani
On Sat, Jun 30, 2012 at 9:31 AM, qiuxia lu luqiu...@gmail.com wrote:

 I am running imdbpy2sql.py using IMDbPY 4.9. It hangs on FLUSHING
 MoviesCache for hours, is that normal? The message is as follows:

Hi,
it seems to be a known problem with recent versions of SQLObject.

Until we find a solution, as a workaround you can install SQLAlchemy and
use it adding the -o sqlalchemy argument to the command line.

I think I'll add a warning about SQLObject in the code...

Let us know if it fix your problem.


-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] About imdbPY plugin.

2012-07-13 Thread Davide Alberani
On Tue, Jul 10, 2012 at 4:09 PM, Venom Panic venom.pa...@yahoo.com wrote:

 Is there a way to integrate  IMDbPy in Wordpress (as plugin, maybe or some)?

Not that I'm aware of.

 It's hard to do that, can you help me, please?

I don't know how hard it can be... I guess it should be quite easy, given
how many wordpress plugins exist. :-)

If you don't find any help here, I suggest to try asking on some wordpress
development mailing lists.

Good luck and let us know if you release it publicly!

-- 
Davide Alberani davide.alber...@gmail.com  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


  1   2   >