Re: Implementing Half Floats in D

John Colvin Thu, 31 Jan 2013 09:05:24 -0800

On Thursday, 31 January 2013 at 15:38:04 UTC, Don wrote:

On Thursday, 31 January 2013 at 13:41:13 UTC, AndreiAlexandrescu wrote:
On 1/31/13 5:18 AM, Don wrote:
std.numeric is not superficially flawed, it's fundamentallyflawed. Whatis it for? What is its theme? The problem is, std.numeric isone of thefew good names which are left as a possible package name,after Cinsulted the mathematical community by creating a modulecalled 'math'.
Guilty as charged. I've put stuff in std.numeric as I wasworking on my thesis. I recall you added some stuff there too.As I'm sure you remember the state of D in 2007 was ratherdifferent than that of today. Overall no need to get agitatedhere, we're all on the same boat and aiming for the same shore.
Sorry if that came across as agitated, it wasn't intended to be.
As you noted, I have code in there as well.
It's just one of those old modules that needs to be cleaned up,though it reveals a deeper issue - see below.
Let's see what we have there:

entropy
CustomFloat
kullbackLeiblerDivergence
Fft
gapWeightedSimilarityIncremental
gapWeightedSimilarity
gapWeightedSimilarityNormalized
FPTemporary
findRoot
euclideanDistance
dotProduct
cosineSimilarity
gcd
jensenShannonDivergence
normalize
secantMethod
The general theme is obvious - numeric algorithms and datastructures. Many are obvious and with obvious utility to oneinterested in numerics: entropy, various distance andsimilarity measures. I think you wrote findRoot.
Yes.
The basic problem is that there are hundreds of potentialnumeric algorithms and data structures of equal importance tothese ones. In fact, the total number of mathematicalalgorithms is probably a substantial fraction of the totalalgorithms in computer science!
Even a module which contained only FFT, could be quite large,once it included all the important related transforms.
The gapWeightedSimilarity algorithms are string kernels. Theyare somewhat niche but quite powerful to anyone interested instring similarity (technically they are string edit distanceon steroids). They might belong in std.string but I figuredthey have enough numeric algorithm flavor to put them in there.
So let's itemize the grievances and see how we can sort thisout.
I'm not sure that we can solve this without addressing thehigh-level question: What is the scope of Phobos?
How big will it eventually get? Twice its current size? Tentimes? A hundred times?
Both SmallPhobos and LargePhobos are reasonable, but we do haveto pick one. Currently we have aspects of both approaches, butthey aren't compatible.
The current approach of putting everything directly into asingle level in std doesn't scale very far -- it will look veryclumsy once it gets more than (say) three times larger. Thisargues for SmallPhobos.
But if it doesn't get to be at least ten times larger, some ofthis niche stuff shouldn't be in there, they are functions fromLargePhobos. If we go with SmallPhobos then we need to move theniche stuff somewhere else.

I think having a large standard library inspires confidence indevelopers. Rightly or wrongly, code in a standard library has anappearance of permanence, as opposed to being someone's personalproject that may or may not disappear/cease to be maintainedtomorrow.

Re: Implementing Half Floats in D

Reply via email to