Re: [Tutor] design advise

Dave Angel Thu, 27 Aug 2009 09:29:55 -0700

Alan Gauld wrote:

<div class="moz-text-flowed" style="font-family: -moz-fixed">
<[email protected]> wrote
The thing that bothers me is that I ma have 10 users or 100,000 usersand really wanted to get an opinion as to which option would scalebetter, leaving aside the relational DB approach.
If you have to cater for 100,000 users all with different views on acommon set of resources I don;t think you can afford to leave asidethe database approach! Almost anything else will run like a dog with abroken leg...
10 users with 100,000 resources would be fine but 100,000 users
will be a problem if you try to use the filesystem as an organising tool.
The trick to using the database is to build the relationships in thedatabase but keep the resources in the filesystem. You can then querythe database for which resources to display then access the resourcesdirectly from disk using their filenames etc
HTH,

+1 you need a database to keep track of 100,000 users. Scaling iswhat databases do best.

That doesn't mean you necessarily need a "real database" yet. But ifyou start with a database-compatible approach, then you'll be able toscale it into a database when the users grows enough.

However, whether or not you use a database, you still have to design theinteractions. If these filenames have to be unique, then it'd be quitedifficult to check that if they were all in separate directories. Soadding a new file would require that you check in all the directories tomake sure the selected name is unique. So updating a single file wouldbe very slow. Cure for that is either to put them all centrally, ormake the name arbitrary. This is equivalent in database terms to usingan ID (integer) abstraction to identify people, since their name fieldmight not be unique.

If you're trying to use the file system as your database, you have toconsider three tradeoffs:

1) how to make sure things are self-consistent, according to whateverthe business rules are.

2) how to minimize access time for more than one kind of query

3) how to reconstruct things when something goes wrong (which it will).And of course you have to decide which problems are to be recoverableand which ones are catastrophic.



DaveA
_______________________________________________
Tutor maillist  -  [email protected]
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] design advise

Reply via email to