Re: Rosetta Commatizing numbers

Ivan Kazmenko via Digitalmars-d-learn Tue, 30 May 2017 21:36:36 -0700

On Tuesday, 30 May 2017 at 10:54:49 UTC, Solomon E wrote:

I ran into a Rosetta code solution in D that had obviouserrors. It's like the author or the previous editor wasn't eventrying to do it right, like a protest against how many detailedrules the task had. I assumed that's not the way we want to dothings in D.
...
Does anyone have any thoughts about this? Did I do right by D?

I'd say the previous version (by bearophile) suited the task muchbetter, but both aren't perfect.

As a general note, consider the following paragraph of theproblem statement:

"Some of the commatizing rules (specified below) are arbitrary,but they'll be a part of this task requirements, if only to makethe results consistent amongst national preferences and otherdisciplines."

This literally means that, while there are complex rules in thereal world for commatizing numbers, the problem is kept simple byenforcing strict rules. The minute concerns of the Real World,like "Current New Zealand dollar format overrides old Zimbabwedollar format", are irrelevant to the formal problem beingsolved. Perhaps the example inputs section ("Strings to be usedas a minimum") gets misleading, but that's what they are:examples, not general rules. By the way, as it's a wiki page,problem statement text could also be improved ;) .

Why? For example, look at Indian numbering system wherecommatizing is visibly different(https://en.wikipedia.org/wiki/Indian_numbering_system) - and wedon't know whether the string should use it or not without thecontext. Or consider that hexadecimal numbers are usually splitin groups of four digits, not three - and we don't know whether a[0-9]+ number is decimal or hexadecimal without the context.See, trying to provide an ultimate solution to real-worldcommatizing, while keeping it a single function without thecontext, can't possibly succeed.

What can be done, then? Well, the page authors already did thedifficult part for us: they extracted the essence of a complexreal-world problem into a small set of formal rules, which arenow the formal problem statement. Now comes the easy part: to doexactly what is asked in the problem statement. The flexibilitycomes from having function parameters. If we have a solution toa formal problem, using it for the real-world version of theproblem is either just specifying the right parameters(hopefully), or changing the function if the real world gets toocomplex for it. In the latter case, the more short and readablethe existing solution is, the faster can we change the functionto suit our real-world case.


-----

Now, where is the old version wrong? Turns out it just calls thefunction with default parameters for every line of input - whichis wrong since the first two input lines need to be handledspecially. Well, that's what the function parameters are for.To have a correct solution, we have to use custom parameters forthe first two lines of input. The function itself is fine.

Your solution addresses this problem by special-casing the inputsinside the function, perhaps because of the misleading inputssection in the problem statement. That's a wrong approach.First, it introduces magic numbers 33 and 36 into the code, whichis a bad programming practice (see here:https://en.wikipedia.org/wiki/Magic_number_(programming)#Unnamed_numerical_constants). Second, it's plain wrong. According to the problem statement, we don't have these rules for every possible line of >33 standalone decimals, or >36 characters in total. We just have to call our function with a concrete set of custom parameters for one concrete example, and other set of parameters for another example. That's to demonstrate that our function accepts and makes proper use of custom parameters! Special-casing example inputs inside the function is not a solution: if we go down this path, the perfect solution would be a bunch of "if" statements for every possible example input producing the respective example outputs, and empty function for all other possible inputs.

So, how do we call with special parameters? Currently, we canlook at every other language except C# as inspiration: ALGOL 68,J, Java, Perl 6, Phix, Racket, and REXX. Your solution also hasa good way to check example inputs: a unittest block. It evenshows one of D's strengths compared to other languages. Andthere, you do use custom parameters to check that the functionworks. A good approach would be to put all the examples in theunittest instead of reading them from a file. This way, theprogram will be immediately usable and runnable: no need tocreate an additional arbitrarily-named file just to test it.


-----

All in all, the only thing I'd change in bearophile's solution isto remove the file reading loop, add the unittest block from yoursolution instead, and place all the examples there. Printing theresult does not seem imperative on Rosettacode, and there are atleast some entries in D which already use unittest for checkingthe problem requirements (for example,https://rosettacode.org/wiki/Sorting_algorithms/Cocktail_sort#D).

Lastly, please note that Rosettacode supports multiple versionsin a single language (example:http://rosettacode.org/wiki/99_Bottles_of_Beer#D). Asbearophile's version certainly has its merits, I strongly suggestto keep it available, either merged with your current version toproduce the right solution, or as a second version.


Ivan Kazmenko.

Re: Rosetta Commatizing numbers

Reply via email to