Re: [Imdbpy-help] imdbpy to mysql help

2013-02-24 Thread D L



> Date: Sun, 24 Feb 2013 11:52:45 +0100
> Subject: Re: [Imdbpy-help] imdbpy to mysql help
> From: davide.alber...@gmail.com
> To: dlm...@hotmail.com
> CC: imdbpy-help@lists.sourceforge.net
> 
> On Sun, Feb 24, 2013 at 12:32 AM, D L  wrote:
> >
> > Ok, well here's an update. I just let the foreign keys run for a little over
> > a full day and it actually completed for mysql:
> > # TIME FINAL : 1883min, 1sec (wall) 23min, 57sec (user) 0min, 5sec (system)
> 
> I see.
> I've just run it with a subset of the db (1% taken from each file) and
> my numbers are:
> # TIME TOTAL TIME TO INSERT/WRITE DATA : 12min, 18sec (wall) 5min,
> 23sec (user) 0min, 43sec (system)
> building database indexes (this may take a while)
> # TIME createIndexes() : 1min, 25sec (wall) 0min, 0sec (user) 0min,
> 0sec (system)
> adding foreign keys (this may take a while)
> # TIME createForeignKeys() : 10min, 2sec (wall) 0min, 0sec (user)
> 0min, 0sec (system)
> RESTORING imdbIDs values for movies... DONE! (restored 0 entries out of 0)
> # TIME restore movies : 0min, 0sec (wall) 0min, 0sec (user) 0min, 0sec 
> (system)
> RESTORING imdbIDs values for people... DONE! (restored 0 entries out of 0)
> # TIME restore people : 0min, 0sec (wall) 0min, 0sec (user) 0min, 0sec 
> (system)
> RESTORING imdbIDs values for characters... DONE! (restored 0 entries out of 0)
> # TIME restore characters : 0min, 0sec (wall) 0min, 0sec (user) 0min,
> 0sec (system)
> RESTORING imdbIDs values for companies... DONE! (restored 0 entries out of 0)
> # TIME restore companies : 0min, 0sec (wall) 0min, 0sec (user) 0min,
> 0sec (system)
> # TIME FINAL : 23min, 45sec (wall) 5min, 23sec (user) 0min, 43sec (system)
> 
> What kind of CPU/RAM/disk have you used?

I'm doing it on a laptop that has an i3 2.53GHZ, 4GB DDR3, with about 200gb of 
space (not SSD). I'm planning to just try to get everything working on my 
laptop before I buy web hosting to put it on there.


> > One of my main questions  right now is the difference in results between the
> > web search and the sql search. For example, if I ran a search on all the
> > movies that Denzel Washington has acted in via the web search, it basically
> > outputs all  the main ones,
> 
> Yep, they are just grouped in a different way.
> It would be not easy for us (even if it's not impossible, I guess) to identify
> alle the various categories used on the web and the rules used to categorize
> the movies, but...
> 
> For the moment, I think you could take the whole filmography and search
> for tv series and/or movies in which an actor is playing Himself (or anything
> that starts with Himself/Herself/Themselves)

Yeah, but filtering that may require even more processing time..?

> > And I haven't tested it that much, but it appears that sqlite and mysql have
> > roughly the same speeds in running these queries, but I'm not completely
> > sure yet.
> 
> I expect them to be comparable in speed, but not to be slower than a
> web search. :-/

They may (hopefully) be faster once I get it up on a web hosting machine 
instead of my laptop. 

> --
> Davide Alberani   [PGP KeyID: 0x465BFD47]
> http://www.mimante.net/
  --
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_feb___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help


Re: [Imdbpy-help] imdbpy to mysql help

2013-02-24 Thread Davide Alberani
On Sun, Feb 24, 2013 at 12:32 AM, D L  wrote:
>
> Ok, well here's an update. I just let the foreign keys run for a little over
> a full day and it actually completed for mysql:
> # TIME FINAL : 1883min, 1sec (wall) 23min, 57sec (user) 0min, 5sec (system)

I see.
I've just run it with a subset of the db (1% taken from each file) and
my numbers are:
# TIME TOTAL TIME TO INSERT/WRITE DATA : 12min, 18sec (wall) 5min,
23sec (user) 0min, 43sec (system)
building database indexes (this may take a while)
# TIME createIndexes() : 1min, 25sec (wall) 0min, 0sec (user) 0min,
0sec (system)
adding foreign keys (this may take a while)
# TIME createForeignKeys() : 10min, 2sec (wall) 0min, 0sec (user)
0min, 0sec (system)
RESTORING imdbIDs values for movies... DONE! (restored 0 entries out of 0)
# TIME restore movies : 0min, 0sec (wall) 0min, 0sec (user) 0min, 0sec (system)
RESTORING imdbIDs values for people... DONE! (restored 0 entries out of 0)
# TIME restore people : 0min, 0sec (wall) 0min, 0sec (user) 0min, 0sec (system)
RESTORING imdbIDs values for characters... DONE! (restored 0 entries out of 0)
# TIME restore characters : 0min, 0sec (wall) 0min, 0sec (user) 0min,
0sec (system)
RESTORING imdbIDs values for companies... DONE! (restored 0 entries out of 0)
# TIME restore companies : 0min, 0sec (wall) 0min, 0sec (user) 0min,
0sec (system)
# TIME FINAL : 23min, 45sec (wall) 5min, 23sec (user) 0min, 43sec (system)

What kind of CPU/RAM/disk have you used?

> One of my main questions  right now is the difference in results between the
> web search and the sql search. For example, if I ran a search on all the
> movies that Denzel Washington has acted in via the web search, it basically
> outputs all  the main ones,

Yep, they are just grouped in a different way.
It would be not easy for us (even if it's not impossible, I guess) to identify
alle the various categories used on the web and the rules used to categorize
the movies, but...

For the moment, I think you could take the whole filmography and search
for tv series and/or movies in which an actor is playing Himself (or anything
that starts with Himself/Herself/Themselves)

> And I haven't tested it that much, but it appears that sqlite and mysql have
> roughly the same speeds in running these queries, but I'm not completely
> sure yet.

I expect them to be comparable in speed, but not to be slower than a
web search. :-/

--
Davide Alberani   [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

--
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_feb
___
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help