Re: A common request

Gregory Magarshak Tue, 29 Mar 2011 11:28:08 -0700

Yes, this would be fine. But often, the list of friends is obtained froma social network like facebook, and is not stored internally. Basically,I obtain the friend list in a request to facebook, and then see which ofthose users have created things. So would I have to create a temporarytable and insert all those uids just to make a join? Why not optimizethe IN ( ... ) to do the same type of thing?

There is also a second problem: I want to use MySQL Cluster, because Iexpect to have many users. Would it be efficient to use JOIN between thefriends table and the articles table? Both tables are partitioned byuser_id as the primary key, so the join would have to hit many differentnodes. I always tried to avoid joins because I am planning tohorizontally partition my data. But if MySQL cluster can handle thisjoin transparently and split it up based on the partition, then that'sfine. Do you have any info on this?


Greg

On 3/29/11 2:10 PM, Peter Brawley wrote:

> How can I quickly find all the articles written by this user'sfriends, and not just random articles?
Taking the simplest possible case, with table friends(userID,friendID)where each friendID refers to a userID in another row, the friends ofuserID u are ...
select friendID from user where userID=u;

so articles by those friends of u are ...
select a.* from article a join ( select friendID from user whereuserID=u ) f on a.userID=f.friendID;
PB

-----

On 3/29/2011 12:50 PM, Gregory Magarshak wrote:
Hey there. My company writes a lot of social applications, and thereis one operation that is very common, but I don't know if MySQLsupports it in a good way. I thought I'd write to this list for tworeasons:
1) Maybe MySQL has a good way to do this, and I just don't knowabout it
2) Propose to MySQL developers a simple algorithm which wouldgreatly improve MySQL support for social networking apps.
Here is the situation. Let's say I have built a social networkingapplication where people create and edit some item (article, photo,music mix, whatever). Now, a typical user logs in, and this user has3000 friends. How can I quickly find all the articles written by thisuser's friends, and not just random articles?
    Ideally, I would want to write something like this:
SELECT * FROM article WHERE user_id IN (345789, 324875, 398, ...,349580)
basically, execute a query with a huge IN ( ... ). Maybe if thiswould exceed the buffer size for the MySQL wire protocol, I wouldbreak up the list into several lists, and execute several queries,and union the results together myself.
But my point is, this is very common for social networking apps.Every app wants to show "the X created by your friends", or "friendsof yours (given some list from a social network) who have takenaction X".
Here is how I would do it if I had raw access to the MySQL indexin memory:
    a) Sort the list of entries in the IN, in ascending order.
b) Do *ONE* binary search through the index (assuming it's aBTREE index) and get them all in one pass. If it's a HASH index orsomething, I would have to look up each one individually.
The benefits of this approach would be that this common operationwould be done extremely quickly. If the index fits entirely inmemory, and I just want to get the primary keys (i.e. get the list offriends who did X), the disk isn't even touched. In addition, forBTREE indexes, I would just need ONE binary search, because theentries have been sorted in ascending order.
Does MySQL have something like this? And if not, perhaps you canadd it in the next version? It would really boost MySQL's support forsocial networking apps tremendously. Alternative, how can I add thisto my MySQL? Any advice would be appreciated.
Sincerely,
Gregory Magarshak
Qbix



--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[email protected]

Re: A common request

Reply via email to