Re: [Haskell-cafe] Re: DDC compiler and effects; better than Haskell?

John A. De Goes Sun, 16 Aug 2009 06:34:05 -0700

I forgot about links. In that case, consider:getUniqueFilesInDirRecursive.

Attacking irrelevant details in an argument is often called a"strawman attack". Such attacks are pointless because they do notaddress the real substance of the issue. My example is easily modifiedto avoid the issues you raise.

Consider the fact that many file-based operations _can and areparallelized manually by developers_. The challenge for nextgeneration language and effect system designers is to figure out _how_such operations can be automatically parallelized, given sufficientconstraints, high-level constructs, and a powerful effect system.

Saying, "I don't know exactly how it will look," is quite a bitdifferent from saying "It can't be done." I claim the former.


Regards,

John A. De Goes
N-Brain, Inc.
The Evolution of Collaboration

http://www.n-brain.net    |    877-376-2724 x 101

On Aug 16, 2009, at 12:38 AM, Artem V. Andreev wrote:

"John A. De Goes" <j...@n-brain.net> writes:
On Aug 15, 2009, at 6:36 AM, Jason Dusek wrote:
2009/08/14 John A. De Goes <j...@n-brain.net>:
Hmmm, my point (perhaps I wasn't clear), is that different
effects have different commutability properties. In the case
of a file system, you can commute two sequential reads from
two different files.
I think this is a bad example -- it's not something that's
safe in general and discredits your idea. How would the
compiler even know that two files are not actually the same
file?
I don't think the file system is the best example. However, I dothink it's a reasonable one.
Let's say the type of the function getFilesInDir is annotated insuch a way as to tell the effectsystem that every file in the returned array is unique. Further,let's say the type of thefunction makeNewTempFile is annotated in such a way as to tell theeffect system that thefunction will succeed in creating a new temp file with a nameunique from any other existing
file.
Sorry, but this example is ridiculuous. While file *names* in thiscase might be reasonably assumedto be unique, the *files* themselves may not. Any modern filesystemdoes support file aliasing,and usually several forms thereof. And what does makeNewTempFilefunction do? Does it create a newfile like POSIX mktemp() and return its name, or does it ratherbehave as POSIX mkstemp()?The first case is a well known security hole, and the second casedoes not, as it seems to me, fit
well into the rest of your reasoning.
However, let's consider further file system tree traversal. In somecases you might not care, whethersome of the directories you descend into are actually the samedirectory, so your proposed optimizationwould be `safe'. However, in other cases sequential traversal wouldwork, while a parallelized versionwould not, unless special additional measures are taken. E.g.consider a case of a build system. Ittraverses a source tree, finds sources files and if correspondingobject files are non-existent oroutdated, does something to regenerate them. Now if you have adirectory that's actually a link toanother directory, and you do sequential traversal, everything isfine: you descend into the directorythe first time, build everything there and when you descend into itthe second time, there's just nothingto do. If you do parallel traversal, you may well end up in thesituation where two threads checksimultaneously for an object file, discover it's outdated and runtwo build processes simultaneously,
with the most likely effect of corrupted object file.
Then if you write a recursive function that loops through all filesin a directory, and for eachfile, it parses and compiles the file into a new temp file, then asufficiently sophisticatedcompiler should be able to safely transform the recursion intoparallel parsing and compilation-- in a way that's provably correct, assuming the original programwas correct.
The promise of a language with a purely functional part and apowerful effect system foreverything else is very great. And very important in the massivelyconcurrent world we are
entering.
Well, yes -- which sounds like, there are no guarantees
in general. Something that works half the time leaves you with
two responsibilities -- the old responsibility of the work you
did when you didn't have it and the new responsibility of
knowing when it applies and when it doesn't.
In the other thread, I brought up the example of buffering reads.Library authors make thedecision to buffer for one reason: because if some other programis messing with the data, you're
screwed no matter  what.
And yeah, "they might be screwing with the data in just the wayyou need it to be screwed with,"(Sebastian), in which case my advice is use C and hope for thebest. :-)
Regards,

John A. De Goes
N-Brain, Inc.
The Evolution of Collaboration

http://www.n-brain.net    |    877-376-2724 x 101


_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe
--

                                        S. Y. A(R). A.
_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe


_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] Re: DDC compiler and effects; better than Haskell?

Reply via email to