subject:"\[9fans\] threads vs forks"

Hi,
this is not really a plan 9 question, but since you are the wisest
guys I know I am hoping that you can help me.
You see, I have to launch many tasks running in parallel (~5000) in a
cluster running linux. Each of the task performs some astronomical
calculations and I am not pretty sure if using fork is the best answer
here.
First of all, all the programming is done in python and c, and since
we are using os.fork() python facility I think that it is somehow
related to the underlying c fork (well, I really do not know much of
forks in linux, the few things I do know about forks and threads I got
them from Francisco Ballesteros' Introduction to operating system
abstractions).
The point here is if I should use forks or threads to deal with the job at hand?
I heard that there are some problems if you fork too many processes (I
am not sure how many are too many) so I am thinking to use threads.
I know some basic differences between threads and forks, but I am not
aware of the details of the implementation (probably I will never be).
Finally, if this is a question that does not belong to the plan 9
mailing list, please let me know and I'll shut up.
Saludos

-- 
Hugo

Re: [9fans] threads vs forks

2009-03-03 Thread David Leimbach

On Tue, Mar 3, 2009 at 3:52 AM, hugo rivera uai...@gmail.com wrote:

 Hi,
 this is not really a plan 9 question, but since you are the wisest
 guys I know I am hoping that you can help me.
 You see, I have to launch many tasks running in parallel (~5000) in a
 cluster running linux. Each of the task performs some astronomical
 calculations and I am not pretty sure if using fork is the best answer
 here.
 First of all, all the programming is done in python and c, and since
 we are using os.fork() python facility I think that it is somehow
 related to the underlying c fork (well, I really do not know much of
 forks in linux, the few things I do know about forks and threads I got
 them from Francisco Ballesteros' Introduction to operating system
 abstractions).


My knowledge on this subject is about 8 or 9 years old, so check with
your local Python guru

The last I'd heard about Python's threading is that it was cooperative only,
and that you couldn't get real parallelism out of it.  It serves as a means
to organize your program in a concurrent manner.

In other words no two threads run at the same time in Python, even if you're
on a multi-core system, due to something they call a Global Interpreter
Lock.



 The point here is if I should use forks or threads to deal with the job at
 hand?
 I heard that there are some problems if you fork too many processes (I
 am not sure how many are too many) so I am thinking to use threads.
 I know some basic differences between threads and forks, but I am not
 aware of the details of the implementation (probably I will never be).
 Finally, if this is a question that does not belong to the plan 9
 mailing list, please let me know and I'll shut up.
 Saludos


I think you need to understand the system limits, which is something you can
look up for yourself.  Also you should understand what kind of runtime model
threads in the language you're using actually implements.

Those rules basically apply to any system.



 --
 Hugo

Re: [9fans] threads vs forks

thanks a lot guys.
I think I should study this issue in greater detail. It is not as easy
as I tought it would be.

2009/3/3, David Leimbach leim...@gmail.com:


 On Tue, Mar 3, 2009 at 3:52 AM, hugo rivera uai...@gmail.com wrote:
  Hi,
  this is not really a plan 9 question, but since you are the wisest
  guys I know I am hoping that you can help me.
  You see, I have to launch many tasks running in parallel (~5000) in a
  cluster running linux. Each of the task performs some astronomical
  calculations and I am not pretty sure if using fork is the best answer
  here.
  First of all, all the programming is done in python and c, and since
  we are using os.fork() python facility I think that it is somehow
  related to the underlying c fork (well, I really do not know much of
  forks in linux, the few things I do know about forks and threads I got
  them from Francisco Ballesteros' Introduction to operating system
  abstractions).

 My knowledge on this subject is about 8 or 9 years old, so
 check with your local Python guru

 The last I'd heard about Python's threading is that it was cooperative only,
 and that you couldn't get real parallelism out of it.  It serves as a means
 to organize your program in a concurrent manner.

 In other words no two threads run at the same time in Python, even if you're
 on a multi-core system, due to something they call a Global Interpreter
 Lock.

 
  The point here is if I should use forks or threads to deal with the job at
 hand?
  I heard that there are some problems if you fork too many processes (I
  am not sure how many are too many) so I am thinking to use threads.
  I know some basic differences between threads and forks, but I am not
  aware of the details of the implementation (probably I will never be).
  Finally, if this is a question that does not belong to the plan 9
  mailing list, please let me know and I'll shut up.
  Saludos
 

 I think you need to understand the system limits, which is something you can
 look up for yourself.  Also you should understand what kind of runtime model
 threads in the language you're using actually implements.

 Those rules basically apply to any system.

 
  --
  Hugo
 
 




-- 
Hugo

Python 'threads' are the same pthreads turds all other lunix junk
uses. The only difference is that the interpreter itself is not
threadsafe, so they have a global lock which means threads suck even
more than usual.

Forking a python interpreter is a *bad* idea, because python's start
up takes billions of years. This has nothing to do with the merits of
fork, and all with how much python sucks.

There is Stackless Python, which has proper CSP threads/procs and
channels, very similar to limbo.

http://www.stackless.com/

But that is too sane for the mainline python folks obviously, so they
stick to the pthrereads turds, ...

My advice: unless you can use Stackless, stay as far away as you can
from any concurrent python stuff. (And don't get me started on twisted
and their event based hacks).

Oh, and as I mentioned in another thread, in my experience if you are
going to fork, make sure you compile statically, dynamic linking is
almost as evil as pthreads. But this is lunix, so what do you expect?

uriel

On Tue, Mar 3, 2009 at 4:19 PM, David Leimbach leim...@gmail.com wrote:

On Tue, Mar 3, 2009 at 3:52 AM, hugo rivera uai...@gmail.com wrote:

Hi,
this is not really a plan 9 question, but since you are the wisest
guys I know I am hoping that you can help me.
You see, I have to launch many tasks running in parallel (~5000) in a
cluster running linux. Each of the task performs some astronomical
calculations and I am not pretty sure if using fork is the best answer
here.
First of all, all the programming is done in python and c, and since
we are using os.fork() python facility I think that it is somehow
related to the underlying c fork (well, I really do not know much of
forks in linux, the few things I do know about forks and threads I got
them from Francisco Ballesteros' Introduction to operating system
abstractions).

My knowledge on this subject is about 8 or 9 years old, so check with your local Python guru
The last I'd heard about Python's threading is that it was cooperative only,
and that you couldn't get real parallelism out of it. It serves as a means
to organize your program in a concurrent manner.
In other words no two threads run at the same time in Python, even if you're
on a multi-core system, due to something they call a Global Interpreter
Lock.

The point here is if I should use forks or threads to deal with the job at
hand?
I heard that there are some problems if you fork too many processes (I
am not sure how many are too many) so I am thinking to use threads.
I know some basic differences between threads and forks, but I am not
aware of the details of the implementation (probably I will never be).
Finally, if this is a question that does not belong to the plan 9
mailing list, please let me know and I'll shut up.
Saludos

I think you need to understand the system limits, which is something you can
look up for yourself. Also you should understand what kind of runtime model
threads in the language you're using actually implements.
Those rules basically apply to any system.

--
Hugo

Re: [9fans] threads vs forks

2009-03-03 Thread ron minnich

On Tue, Mar 3, 2009 at 3:52 AM, hugo rivera uai...@gmail.com wrote:

 You see, I have to launch many tasks running in parallel (~5000) in a
 cluster running linux. Each of the task performs some astronomical
 calculations and I am not pretty sure if using fork is the best answer
 here.


lots of questions first .

how  many cluster nodes. how long do the jobs run. input files or
args? output files? how big? You can't say much with the information
you gave.

ron

Re: [9fans] threads vs forks

2009/3/3, Uriel urie...@gmail.com:

  Oh, and as I mentioned in another thread, in my experience if you are
  going to fork, make sure you compile statically, dynamic linking is
  almost as evil as pthreads. But this is lunix, so what do you expect?


not much. Wish I could get it done with plan 9.

-- 
Hugo

Re: [9fans] threads vs forks

2009/3/3, ron minnich rminn...@gmail.com:

 lots of questions first .

  how  many cluster nodes. how long do the jobs run. input files or
  args? output files? how big? You can't say much with the information
  you gave.

It is a small cluster, of 6 machines. I think each job runs for a few
minutes (~5), take some input files and generate a couple of files (I
am not really sure about how many output files each proccess
generates). The size of the output files is ~1Mb.

-- 
Hugo

Re: [9fans] threads vs forks

2009-03-03 Thread John Barham

On Tue, Mar 3, 2009 at 3:52 AM, hugo rivera uai...@gmail.com wrote:

 I have to launch many tasks running in parallel (~5000) in a
 cluster running linux. Each of the task performs some astronomical
 calculations and I am not pretty sure if using fork is the best answer
 here.
 First of all, all the programming is done in python and c...

Take a look at the multiprocessing package
(http://docs.python.org/library/multiprocessing.html), newly
introduced with Python 2.6 and 3.0:

multiprocessing is a package that supports spawning processes using
an API similar to the threading module. The multiprocessing package
offers both local and remote concurrency, effectively side-stepping
the Global Interpreter Lock by using subprocesses instead of threads.

It should be a quick and easy way to set up a cluster-wide job
processing system (provided all your jobs are driven by Python).

It also looks like it's been (partially?) back-ported to Python 2.4
and 2.5: http://pypi.python.org/pypi/processing.

  John

Re: [9fans] threads vs forks

2009-03-03 Thread ron minnich

On Tue, Mar 3, 2009 at 8:28 AM, hugo rivera uai...@gmail.com wrote:

 It is a small cluster, of 6 machines. I think each job runs for a few
 minutes (~5), take some input files and generate a couple of files (I
 am not really sure about how many output files each proccess
 generates). The size of the output files is ~1Mb.

for that size cluster, and jobs running a few minutes, fork ought to be fine.

ron

Re: [9fans] threads vs forks

2009-03-03 Thread Roman V. Shaposhnik

On Tue, 2009-03-03 at 07:19 -0800, David Leimbach wrote:

 My knowledge on this subject is about 8 or 9 years old, so check with your 
 local Python guru
 
 
 The last I'd heard about Python's threading is that it was cooperative
 only, and that you couldn't get real parallelism out of it.  It serves
 as a means to organize your program in a concurrent manner.  
 
 
 In other words no two threads run at the same time in Python, even if
 you're on a multi-core system, due to something they call a Global
 Interpreter Lock.  

I believe GIL is as present in Python nowadays as ever. On a related
note: does anybody know any sane interpreted languages with a decent
threading model to go along? Stackless python is the only thing that
I'm familiar with in that department.

Thanks,
Roman.

Re: [9fans] threads vs forks

2009-03-03 Thread Bakul Shah

On Tue, 03 Mar 2009 10:11:10 PST Roman V. Shaposhnik r...@sun.com  wrote:
 On Tue, 2009-03-03 at 07:19 -0800, David Leimbach wrote:
 
  My knowledge on this subject is about 8 or 9 years old, so check with your 
 local Python guru
  
  
  The last I'd heard about Python's threading is that it was cooperative
  only, and that you couldn't get real parallelism out of it.  It serves
  as a means to organize your program in a concurrent manner.  
  
  
  In other words no two threads run at the same time in Python, even if
  you're on a multi-core system, due to something they call a Global
  Interpreter Lock.  
 
 I believe GIL is as present in Python nowadays as ever. On a related
 note: does anybody know any sane interpreted languages with a decent
 threading model to go along? Stackless python is the only thing that
 I'm familiar with in that department.

Depend on what you mean by sane interpreted language with a
decent threading model and what you want to do with it but
check out www.clojure.org.  Then there is Erlang.  Its
wikipedia entry has this to say:
Although Erlang was designed to fill a niche and has
remained an obscure language for most of its existence,
it is experiencing a rapid increase in popularity due to
increased demand for concurrent services, inferior models
of concurrency in most mainstream programming languages,
and its substantial libraries and documentation.[7][8]
Well-known applications include Amazon SimpleDB,[9]
Yahoo! Delicious,[10] and the Facebook Chat system.[11]

Re: [9fans] threads vs forks

On Tue, Mar 3, 2009 at 1:11 PM, Roman V. Shaposhnik r...@sun.com wrote:
 On Tue, 2009-03-03 at 07:19 -0800, David Leimbach wrote:

 My knowledge on this subject is about 8 or 9 years old, so check with your 
 local Python guru


 The last I'd heard about Python's threading is that it was cooperative
 only, and that you couldn't get real parallelism out of it.  It serves
 as a means to organize your program in a concurrent manner.


 In other words no two threads run at the same time in Python, even if
 you're on a multi-core system, due to something they call a Global
 Interpreter Lock.

 I believe GIL is as present in Python nowadays as ever. On a related
 note: does anybody know any sane interpreted languages with a decent
 threading model to go along? Stackless python is the only thing that
 I'm familiar with in that department.

I thought part of the reason for the big break with Python 3000 was
to get rid of the GIL and clean that threading mess up. Or am I way
off?


 Thanks,
 Roman.

Re: [9fans] threads vs forks

2009-03-03 Thread Uriel

You are off. It is doubtful that the GIL will ever be removed.

But that really isn't the issue, the issue is the lack of a decent
concurrency model, like the one provided by Stackless.

But apparently one of the things stackless allows is evil recursive
programming, which Guido considers 'confusing' and wont allow in
mainline python (I think another reason is that porting it to jython
and .not would be hard, but I'm not familiar with the details).

uriel


On Wed, Mar 4, 2009 at 12:08 AM, J.R. Mauro jrm8...@gmail.com wrote:
 On Tue, Mar 3, 2009 at 1:11 PM, Roman V. Shaposhnik r...@sun.com wrote:
 On Tue, 2009-03-03 at 07:19 -0800, David Leimbach wrote:

 My knowledge on this subject is about 8 or 9 years old, so check with your 
 local Python guru


 The last I'd heard about Python's threading is that it was cooperative
 only, and that you couldn't get real parallelism out of it.  It serves
 as a means to organize your program in a concurrent manner.


 In other words no two threads run at the same time in Python, even if
 you're on a multi-core system, due to something they call a Global
 Interpreter Lock.

 I believe GIL is as present in Python nowadays as ever. On a related
 note: does anybody know any sane interpreted languages with a decent
 threading model to go along? Stackless python is the only thing that
 I'm familiar with in that department.

 I thought part of the reason for the big break with Python 3000 was
 to get rid of the GIL and clean that threading mess up. Or am I way
 off?


 Thanks,
 Roman.

Re: [9fans] threads vs forks

On Tue, Mar 3, 2009 at 6:15 PM, Uriel urie...@gmail.com wrote:
 You are off. It is doubtful that the GIL will ever be removed.

That's too bad. Things like that just reinforce my view that Python is a hack :(

Oh well, back to C...


 But that really isn't the issue, the issue is the lack of a decent
 concurrency model, like the one provided by Stackless.

 But apparently one of the things stackless allows is evil recursive
 programming, which Guido considers 'confusing' and wont allow in
 mainline python (I think another reason is that porting it to jython
 and .not would be hard, but I'm not familiar with the details).

Concurrency seems to be one of those things that's too hard for
everyone, and I don't buy it. There's no reason it needs to be as hard
as it is.

And nevermind the fact that it's not really usable for every (or even
most) jobs out there. But Intel is pushing it, so that's where we have
to go, I suppose.


 uriel
 - Show quoted text -

 On Wed, Mar 4, 2009 at 12:08 AM, J.R. Mauro jrm8...@gmail.com wrote:
 On Tue, Mar 3, 2009 at 1:11 PM, Roman V. Shaposhnik r...@sun.com wrote:
 On Tue, 2009-03-03 at 07:19 -0800, David Leimbach wrote:

 My knowledge on this subject is about 8 or 9 years old, so check with your 
 local Python guru


 The last I'd heard about Python's threading is that it was cooperative
 only, and that you couldn't get real parallelism out of it.  It serves
 as a means to organize your program in a concurrent manner.


 In other words no two threads run at the same time in Python, even if
 you're on a multi-core system, due to something they call a Global
 Interpreter Lock.

 I believe GIL is as present in Python nowadays as ever. On a related
 note: does anybody know any sane interpreted languages with a decent
 threading model to go along? Stackless python is the only thing that
 I'm familiar with in that department.

 I thought part of the reason for the big break with Python 3000 was
 to get rid of the GIL and clean that threading mess up. Or am I way
 off?


 Thanks,
 Roman.

Re: [9fans] threads vs forks

2009-03-03 Thread Devon H. O'Dell

2009/3/3 J.R. Mauro jrm8...@gmail.com:
 Concurrency seems to be one of those things that's too hard for
 everyone, and I don't buy it. There's no reason it needs to be as hard
 as it is.

That's a fact. If you have access to The ACM Queue, check out
p16-cantrill-concurrency.pdf (Cantrill and Bonwich on concurrency).

 And nevermind the fact that it's not really usable for every (or even
 most) jobs out there. But Intel is pushing it, so that's where we have
 to go, I suppose.

That's simply not true. In my world (server software and networking),
most tasks can be improved by utilizing concurrent programming
paradigms. Even in user interfaces, these are useful. For mathematics,
there's simply no question that making use of concurrent algorithms is
a win. In fact, I can't think of a single case in which doing two
lines of work at once isn't better than doing one at a time, assuming
that accuracy is maintained in the result.

--dho

Re: [9fans] threads vs forks

On Tue, Mar 3, 2009 at 6:54 PM, Devon H. O'Dell devon.od...@gmail.com wrote:
 2009/3/3 J.R. Mauro jrm8...@gmail.com:
 Concurrency seems to be one of those things that's too hard for
 everyone, and I don't buy it. There's no reason it needs to be as hard
 as it is.

 That's a fact. If you have access to The ACM Queue, check out
 p16-cantrill-concurrency.pdf (Cantrill and Bonwich on concurrency).

Things like TBB and other libraries to automagically scale up repeated
operations into parallelized ones help alleviate the problems with
getting parallelization to work. They're ugly, they only address
narrow problem sets, but they're attempts at solutions. And if you
look at languages like LISP and Erlang, you're definitely left with a
feeling that parallelization is being treated as harder than it is.

I'm not saying it isn't hard, just that there are a lot of people who
seem to be throwing up their hands over it. I suppose I should stop
reading their material.


 And nevermind the fact that it's not really usable for every (or even
 most) jobs out there. But Intel is pushing it, so that's where we have
 to go, I suppose.

 That's simply not true. In my world (server software and networking),
 most tasks can be improved by utilizing concurrent programming
 paradigms. Even in user interfaces, these are useful. For mathematics,
 there's simply no question that making use of concurrent algorithms is
 a win. In fact, I can't think of a single case in which doing two
 lines of work at once isn't better than doing one at a time, assuming
 that accuracy is maintained in the result.

I should have qualified. I mean *massive* parallelization when applied
to average use cases. I don't think it's totally unusable (I
complain about synchronous I/O on my phone every day), but it's being
pushed as a panacea, and that is what I think is wrong. Don Knuth
holds this opinion, but I think he's mostly alone on that,
unfortunately.

Of course for mathematically intensive and large-scale operations, the
more parallel you can make things the better.


 --dho

Re: [9fans] threads vs forks

 I should have qualified. I mean *massive* parallelization when applied
 to average use cases. I don't think it's totally unusable (I
 complain about synchronous I/O on my phone every day), but it's being
 pushed as a panacea, and that is what I think is wrong. Don Knuth
 holds this opinion, but I think he's mostly alone on that,
 unfortunately.

it's interesting that parallel wasn't cool when chips were getting
noticably faster rapidly.  perhaps the focus on parallelization
is a sign there aren't any other ideas.

- erik

Re: [9fans] threads vs forks

On Tue, Mar 3, 2009 at 7:54 PM, erik quanstrom quans...@quanstro.net wrote:
 I should have qualified. I mean *massive* parallelization when applied
 to average use cases. I don't think it's totally unusable (I
 complain about synchronous I/O on my phone every day), but it's being
 pushed as a panacea, and that is what I think is wrong. Don Knuth
 holds this opinion, but I think he's mostly alone on that,
 unfortunately.

 it's interesting that parallel wasn't cool when chips were getting
 noticably faster rapidly.  perhaps the focus on parallelization
 is a sign there aren't any other ideas.

Indeed, I think it is. The big manufacturers seem to have hit a wall
with clock speed, done a full reverse, and are now just trying to pack
more transistors and cores on the chip. Not that this is evil, but I
think this is just as bad as the obsession with upping the clock
speeds in that they're too focused on one path instead of
incorporating other cool ideas (i.e., things Transmeta was working on
with virtualization and hosting foreign ISAs)


 - erik

Re: [9fans] threads vs forks

2009-03-03 Thread John Barham

On Tue, Mar 3, 2009 at 4:54 PM, erik quanstrom quans...@quanstro.net wrote:
 I should have qualified. I mean *massive* parallelization when applied
 to average use cases. I don't think it's totally unusable (I
 complain about synchronous I/O on my phone every day), but it's being
 pushed as a panacea, and that is what I think is wrong. Don Knuth
 holds this opinion, but I think he's mostly alone on that,
 unfortunately.

 it's interesting that parallel wasn't cool when chips were getting
 noticably faster rapidly.  perhaps the focus on parallelization
 is a sign there aren't any other ideas.

That seems to be what Knuth thinks.  Excerpt from a 2008 interview w/ InformIT:

InformIT: Vendors of multicore processors have expressed frustration
at the difficulty of moving developers to this model. As a former
professor, what thoughts do you have on this transition and how to
make it happen? Is it a question of proper tools, such as better
native support for concurrency in languages, or of execution
frameworks? Or are there other solutions?

Knuth: I don’t want to duck your question entirely. I might as well
flame a bit about my personal unhappiness with the current trend
toward multicore architecture. To me, it looks more or less like the
hardware designers have run out of ideas, and that they’re trying to
pass the blame for the future demise of Moore’s Law to the software
writers by giving us machines that work faster only on a few key
benchmarks! I won’t be surprised at all if the whole multithreading
idea turns out to be a flop, worse than the Itanium approach that
was supposed to be so terrific—until it turned out that the wished-for
compilers were basically impossible to write.

Full interview is at http://www.informit.com/articles/article.aspx?p=1193856.

Re: [9fans] threads vs forks

2009-03-03 Thread James Tomaschke


J.R. Mauro wrote:

On Tue, Mar 3, 2009 at 7:54 PM, erik quanstrom quans...@quanstro.net wrote:

I should have qualified. I mean *massive* parallelization when applied
to average use cases. I don't think it's totally unusable (I
complain about synchronous I/O on my phone every day), but it's being
pushed as a panacea, and that is what I think is wrong. Don Knuth
holds this opinion, but I think he's mostly alone on that,
unfortunately.

it's interesting that parallel wasn't cool when chips were getting
noticably faster rapidly.  perhaps the focus on parallelization
is a sign there aren't any other ideas.


Indeed, I think it is. The big manufacturers seem to have hit a wall
with clock speed, done a full reverse, and are now just trying to pack
more transistors and cores on the chip. Not that this is evil, but I
think this is just as bad as the obsession with upping the clock
speeds in that they're too focused on one path instead of
incorporating other cool ideas (i.e., things Transmeta was working on
with virtualization and hosting foreign ISAs)


Die size has been the main focus for the foundries, reduced transistor 
switch time is just a benefit from that.  Digital components work well 
here, but Analog suffers and creating a stable clock at high frequency 
is done in the Analog domain.


It is much easier to double the transistor count than it is to double 
the clock frequency.  Also have to consider the power/heat/noise costs 
from increasing the clock.


I think the reason why you didn't see parallelism come out earlier in 
the PC market was because they needed to create new mechanisms for I/O. 
 AMD did this with Hypertransport, and I've seen 32-core (8-socket) 
systems with this.  Now Intel has their own I/O rethink out there.


I've been trying to get my industry to look at parallel computing for 
many years, and it's only now that they are starting to sell parallel 
circuit simulators and still they are not that efficient.  A 
traditionally week-long sim is now taking a single day when run on 
12-cores.  I'll take that 7x over 1x anytime though.


/james

Re: [9fans] threads vs forks

 I think the reason why you didn't see parallelism come out earlier in 
 the PC market was because they needed to create new mechanisms for I/O. 
   AMD did this with Hypertransport, and I've seen 32-core (8-socket) 
 systems with this.  Now Intel has their own I/O rethink out there.

i think what you're saying is equivalent to saying
(in terms i understand) that memory bandwidth was
so bad that a second processor couldn't do much work.

but i haven't found this to be the case.  even the
highly constrained pentium 4 gets some milage out of
hyperthreading for the tests i've run.

the intel 5000-series still use a fsb.  and they seem to
scale well from 1 to 4 cores.

are there benchmarks that show otherwise similar
hypertransport systems trouncing intel in multithreaded
performance?  i don't recall seeing anything more than
a moderate (15-20%) advantage.

- erik

Re: [9fans] threads vs forks

2009-03-03 Thread James Tomaschke


erik quanstrom wrote:
I think the reason why you didn't see parallelism come out earlier in 
the PC market was because they needed to create new mechanisms for I/O. 
  AMD did this with Hypertransport, and I've seen 32-core (8-socket) 
systems with this.  Now Intel has their own I/O rethink out there.


i think what you're saying is equivalent to saying
(in terms i understand) that memory bandwidth was
so bad that a second processor couldn't do much work.

Yes bandwidth and latency.


but i haven't found this to be the case.  even the
highly constrained pentium 4 gets some milage out of
hyperthreading for the tests i've run.

the intel 5000-series still use a fsb.  and they seem to
scale well from 1 to 4 cores.


Many of the circuit simulators I use fall flat on their face after 4 
cores, say.  However I blame this on their algorithm not hardware.


I wasn't making an AMD vs Intel comment, just that AMD had created HTX 
along with their K8 platform to address scalability concerns with I/O.



are there benchmarks that show otherwise similar
hypertransport systems trouncing intel in multithreaded
performance?  i don't recall seeing anything more than
a moderate (15-20%) advantage.


I don't have a 16-core Intel system to compare with, but:
http://en.wikipedia.org/wiki/List_of_device_bandwidths#Computer_buses

I think the reason why Intel developed their Common Systems Interconnect 
(now called QuickPath Interconnect) was to address it's shortcomings.


Both AMD and Intel are looking at I/O because it is and will be a 
limiting factor when scaling to higher core counts.




- erik

Re: [9fans] threads vs forks

On Tue, Mar 3, 2009 at 11:44 PM, James Tomaschke ja...@orcasystems.com wrote:
 erik quanstrom wrote:

 I think the reason why you didn't see parallelism come out earlier in the
 PC market was because they needed to create new mechanisms for I/O.  AMD did
 this with Hypertransport, and I've seen 32-core (8-socket) systems with
 this.  Now Intel has their own I/O rethink out there.

 i think what you're saying is equivalent to saying
 (in terms i understand) that memory bandwidth was
 so bad that a second processor couldn't do much work.

 Yes bandwidth and latency.

 but i haven't found this to be the case.  even the
 highly constrained pentium 4 gets some milage out of
 hyperthreading for the tests i've run.

 the intel 5000-series still use a fsb.  and they seem to
 scale well from 1 to 4 cores.

 Many of the circuit simulators I use fall flat on their face after 4 cores,
 say.  However I blame this on their algorithm not hardware.

 I wasn't making an AMD vs Intel comment, just that AMD had created HTX along
 with their K8 platform to address scalability concerns with I/O.

 are there benchmarks that show otherwise similar
 hypertransport systems trouncing intel in multithreaded
 performance?  i don't recall seeing anything more than
 a moderate (15-20%) advantage.

 I don't have a 16-core Intel system to compare with, but:
 http://en.wikipedia.org/wiki/List_of_device_bandwidths#Computer_buses

 I think the reason why Intel developed their Common Systems Interconnect
 (now called QuickPath Interconnect) was to address it's shortcomings.

 Both AMD and Intel are looking at I/O because it is and will be a limiting
 factor when scaling to higher core counts.

And soon hard disk latencies are really going to start hurting (they
already are hurting some, I'm sure), and I'm not convinced of the
viability of SSDs.


There was an interesting article I came across that compared the
latencies of accessing a register, a CPU cache, main memory, and disk,
which put them in human terms. As much as we like to say we understand
the difference between a millisecond and a nanosecond, seeing cache
access expressed in terms of moments and a disk access in terms of
years was rather illuminating, if only to me.

Same article also put a google search at only slightly slower latency
than hard disk access. The internet really is becoming the computer, I
suppose.



 - erik

Re: [9fans] threads vs forks

2009-03-03 Thread David Leimbach

On Tue, Mar 3, 2009 at 10:11 AM, Roman V. Shaposhnik r...@sun.com wrote:

 On Tue, 2009-03-03 at 07:19 -0800, David Leimbach wrote:

  My knowledge on this subject is about 8 or 9 years old, so check with
 your local Python guru
 
 
  The last I'd heard about Python's threading is that it was cooperative
  only, and that you couldn't get real parallelism out of it.  It serves
  as a means to organize your program in a concurrent manner.
 
 
  In other words no two threads run at the same time in Python, even if
  you're on a multi-core system, due to something they call a Global
  Interpreter Lock.

 I believe GIL is as present in Python nowadays as ever. On a related
 note: does anybody know any sane interpreted languages with a decent
 threading model to go along? Stackless python is the only thing that
 I'm familiar with in that department.


I'm a fan of Erlang.  Though I guess it's technically a compiled virtual
machine of sorts, even when it's escript.

But I've had an absolutely awesome experience over the last year using it,
and so far only wishing it came with the type safety of Haskell :-).

I love Haskell's threading model actually, in either the data parallelism or
the forkIO interface, it's pretty sane.  Typed data channels even between
forkIO'd threads.




 Thanks,
 Roman.

Re: [9fans] threads vs forks

2009-03-03 Thread David Leimbach

On Tue, Mar 3, 2009 at 5:54 PM, J.R. Mauro jrm8...@gmail.com wrote:

 On Tue, Mar 3, 2009 at 7:54 PM, erik quanstrom quans...@quanstro.net
 wrote:
  I should have qualified. I mean *massive* parallelization when applied
  to average use cases. I don't think it's totally unusable (I
  complain about synchronous I/O on my phone every day), but it's being
  pushed as a panacea, and that is what I think is wrong. Don Knuth
  holds this opinion, but I think he's mostly alone on that,
  unfortunately.
 
  it's interesting that parallel wasn't cool when chips were getting
  noticably faster rapidly.  perhaps the focus on parallelization
  is a sign there aren't any other ideas.

 Indeed, I think it is. The big manufacturers seem to have hit a wall
 with clock speed, done a full reverse, and are now just trying to pack
 more transistors and cores on the chip. Not that this is evil, but I
 think this is just as bad as the obsession with upping the clock
 speeds in that they're too focused on one path instead of
 incorporating other cool ideas (i.e., things Transmeta was working on
 with virtualization and hosting foreign ISAs)


Can we bring back the Burroughs? :-)




 
  - erik

Re: [9fans] threads vs forks

2009-03-03 Thread John Barham

 I believe GIL is as present in Python nowadays as ever. On a related
 note: does anybody know any sane interpreted languages with a decent
 threading model to go along? Stackless python is the only thing that
 I'm familiar with in that department.

Check out Lua's coroutines: http://www.lua.org/manual/5.1/manual.html#2.11

Here's an implementation of the sieve of Eratosthenes using Lua
coroutines similar to the Limbo one:
http://www.lua.org/cgi-bin/demo?sieve

Re: [9fans] threads vs forks

 Now there is another use that would at least be intellectually interesting
 and possible useful in practice.  Use the transistors for a really big
 memory running at cache speed.  But instead of it being a hardware
 cache, manage it explicitly.  In effect, we have a very high speed
 main memory, and the traditional main memory is backing store.
 It'd give a use for all those paging algorithms that aren't particularly
 justified at the main memory-disk boundary any more.  And you
 can fit a lot of Plan 9 executable images in a 64MB on-chip memory
 space.  Obviously, it wouldn't be a good fit for severely memory-hungry
 apps, and it might be a dead end overall, but it'd at least be something
 different...

ken's fs already has the machinery to handle this.  one could imagine
a cachefs that knew how to manage this for venti.  (though venti seems
like a poor fit.)  there are lots of interesting uses of explicitly managed,
heirarchical caches.  yet so far hardware has done it's level best to hide
this.

- erik

Re: [9fans] threads vs forks