On Monday, 25 February 2013 at 15:07:47 UTC, Vladimir Panteleev wrote:
On Monday, 25 February 2013 at 06:41:32 UTC, Lars T. Kyllingstad wrote:
Sure, I can think of another example. But I wouldn't read too much into this one; it was never meant as a demonstration of the "correct" way to open a web page. It was just a simple example of spawnProcess() usage that uses a cross-platform application everyone's heard of.

After all, you *could* argue this way about almost any kind of application which wasn't just invented for the sake of the example. (In the last one, shouldn't we open the user's preferred word processor, etc?)

The question is, what is the intent? Is it to just open some URL, or to specifically start Firefox? The same applies to the word processor case - if the document is in a file format understood by several applications, is the intent to simply open the document, or to open the document in that specific application?

Now, the documentation clearly says that the example specifically launches Firefox. However, that doesn't mean that someone won't reach out for that example when hastily putting together an application that needs to open an URL. After all, it's at the top of the file, and they may not even know about the existence of the browse function which actually does what they intend.

How about using "lynx -dump http://dlang.org/";? Dumping a text representation of a webpage is a feature specific to lynx, so the intent is clearer.

That is also incredibly obscure. I'd venture a guess that only ~10% of D's user base have even heard of Lynx. Everyone knows firefox, and will understand what the example is supposed to illustrate. (I admit that the ls/grep examples will also be rather incomprehensible to someone not familiar with the *NIX command line, and I will replace them with something else. The D toolchain, as you suggest below, is a very good idea.)

BTW, browse() should never have been added to std.process, in my opinion. Maybe to some other utility module, but then it should at least be done right, and be properly documented. What does it actually do? There is no way to tell unless you read the source. (And then, it turns out that it spawns a new process for the browser and returns immediately, but it does not return a process ID that you can poll or wait for. Bah.)


2. (Nitpick) The grep example uses a POSIX quoting syntax (single quotes). Would be better to use double quotes, or pass as array to avoid one more needless OS-specific element.

Actually, the quotes can just be removed altogether.

OK, and now it's worse: your example uses syntax that's specific to std.process2. If you type that command in the shell, you'll get different behavior (the backslash will escape the . as a shell escape, not a RE escape).

[I am going to let slip here that you almost have me convinced with many of your arguments below, but I am still going to play devil's advocate for a bit.]

It is not worse. It is a lot simpler, because the programmer does not need to know anything about the underlying platform. They only need to know one rule: If your arguments contain spaces, use the array functions. I don't think the generic process-spawning functions in std.process should be bound by, or tied to, the syntax of whatever shell the programmer (or the end user) prefers.


[...]

Personally, I don't think they should be part of the public API. They are inherently platform-specific, and we've tried to keep the module as platform-agnostic as possible.

Constructing scripts is bound to be platform-specific. The current module version allows constructing batch files on POSIX.

Here's a practical use case example for this feature: DMD uses the same syntax for response files on all platforms, and it follows the Windows command-line parsing rules. Currently, rdmd uses escapeWindowsArgument to build that response file on all platforms.

Point taken.


Besides, they are not really usable with any of the other functions, and I am afraid it will be interpreted that way if we make them public.

This is actually a design problem in the new module, which I haven't discussed yet. Have a look at the very last example in the current std.process docs. How do you accomplish that correctly in the new version, without manually piping the inputs yourself? You can't.

I grudgingly admit that this is true.


[...]

The way it is now, the rules (if you can call it that) are exceedingly simple, and they are the same on all platforms. This has the added benefit of discouraging platform-dependent client code.

OK, then picture the following situation.

A user of the new module starts using the module, and invokes a specific command using the spawnProcess overload that takes it as a single string. Convenient, right? Then, as the program evolves, the string becomes an enum, then a config variable, which the user can adjust.

Then, a end-user tries setting the config variable to a path that contains spaces, and everything breaks. Wrapping the path in quotes does not help either. Due to the way the function is designed, it is IMPOSSIBLE for the end-user to configure the application to launch a program located at a path containing spaces. To end-users, this comes off as a classical problem in badly written applications that don't handle command-line escaping properly.

Exposing the specifics of whatever programming language you are using to the end user? I would just call that bad application programming. If anything, you should be using one of the 'shell' functions in this case, not spawnProcess.


This problem is as with any case of an interface which works in simple cases, but behaves unexpectedly in more complicated cases: it is bad design (convenience or not), and must be avoided.

I suggest that either the overloads which take a single string be removed, or that they spawn a shell instead, and let the shell do the command-line splitting. Together with my command and filename escaping functions, they should allow the user to achieve any combination of executing commands with arbitrary punctuation in the program path or arguments, as well as redirecting the output to files (again, with correctly-escaped filenames) or other programs using the existing shell syntax present on both platforms.

You almost have me convinced that the single-string non-shell functions must go. In the case of pipeProcess() and execute(), pipeShell() and shell() do the same job, with (arguably, still) less surprises. Maybe it would then be a good idea to add a spawnShell() function to go with spawnProcess().

The escape*() functions need to be better documented. Am I correct that they quote according to 'cmd.exe' rules on Windows and 'sh' rules on POSIX?

Lars

Reply via email to