Thank you, for your response!
I'm testing with exactly that table structure to evaluate flexibility vs
speed. I only use images, as it gives me easy a vast amount of sample date.
I'm testing with 30,000 images. :-)
So far I was testing multi-table joins in various combinations, trying to
help mysql's optimization routines using IN() etc.
But comparing for 2 attributes costs 1.8 seconds/ query with 7 attributes
per image and 30,000 images.
Not acceptable. ;-)
Views without conditions on the values resulted in no speed advantages.
(figures ... )
I see chances, that temporary tables will speed things up a bit. I'm testing
Another attempt could be to switch DB engines from InnoDB to MyISAM ... I'll
see what it results in. :D.
This is just a private test on a sandbox machine. I earn my living flying
airliners, not programming.
Thanks and Regards,
From: joerg.bru...@sun.com [mailto:joerg.bru...@sun.com]
Sent: Wednesday, June 16, 2010 2:49 PM
To: MYSQL General List
Cc: Jan Reiter
Subject: Re: [PHP] SQL Syntax
Daniel Brown wrote:
> You'll probably have much better luck on the MySQL General list.
> CC'ed on this email.
> On Tue, Jun 15, 2010 at 20:58, Jan Reiter <the-fal...@gmx.net> wrote:
>> Hi folks!
>> I have 2 tables. Table A containing 2 fields. A user ID and a picture ID
>> A(uid,pid) and another table B, containing 3 fields. The picture ID, an
>> attribute ID and a value for that attribute => B(pid,aid,value).
>> Table B contains several rows for a single PID with various AIDs and
>> Each AID is unique to a PID. (e.g. AID = 1 always holding the value for
>> image size and AID = 3 always holding a value for the image type)
>> The goal is now to join table A on table B using pid, and selecting the
>> based on MULTIPLE attributes.
>> So the result should only contain rows for images, that relate to an
>> attribute ID = 1 (size) that is bigger than 100 AND!!!!!!! an attribute
>> 5 that equals 'jpg'.
You need to do a multi-table join, table A joined to one instance of
table B for each attribute relevant to your search.
Roughly, syntax not tested, it is something like
SELECT a.uid, a.pid FROM a JOIN b AS b1 ON a.pid=b1.pid
JOIN b AS b2 ON a.pid=b2.pid
WHERE b1.aid = 1 AND b1.value > 100
AND b2.aid = 3 AND b2.value = 5
(assuming 'jpg' is coded as 5, what I take from your text).
Now, I see some difficulties with this:
1) You are using the "value" column for anything, that may cause data
2) AFAIR, there was a post recently claiming the alias names (b1, b2,
...) could not be used in WHERE conditions, and the recommendation
was to replace WHERE by HAVING.
3) If you need to support many attributes in one search, the number of
tables joined grows, and the amount of data to handle (cartesian
product!) will explode.
What works fine with 3 criteria on 10 pictures (10 * 10 * 10 = 1000)
may totally fail with 4 criteria on 200 pictures
(200**4 = 800.000.000 = 800 million)
4) The more different attributes you store per picture, the larger your
table B will become, and this will make the data grow for each join
If you store 4 attributes each for 200 pictures, table B will already
have 800 entries. In itself, that isn't much, but now the 4-way join
will produce a cartesian product of
800**4 = 8**4 * 100**4 = 4096 * 100.000.000 = 409.600.000.000
In your place, I would use a separate table for attributes which are
expected to be defined for all pictures, like size and image type.
Then your general attributes table B will hold much fewer rows, thus
each join step will profit.
5) Because of that explosion, it may be better to work with a temporary
table, joining it to B for one attribute and thus reducing the data,
then looping over such a step for all the relevant attributes.
Good luck in experimenting!
Joerg Bruehe, MySQL Build Team, joerg.bru...@sun.com
Sun Microsystems GmbH, Komturstrasse 18a, D-12099 Berlin
Geschaeftsfuehrer: Juergen Kunz
Amtsgericht Muenchen: HRB161028
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php