Re: [CF-metadata] Cell bounds associated with coordinate variable rather than data variable

Steve Hankin Thu, 12 Nov 2009 12:48:18 -0800


John Caron wrote:

1. The CDM library uses the bounds if they are present. If only thecoordinate values are present, the CDM generates bounds. These gridsbounds are used by ncWMS and other visualization software to drawcolor filled images. The IDV (I think) uses a contouring algorithmwith just the coordinate values.
2. Spatial coordinates probably want to use midpoint values.
3. I think theres a good argument that time coordinates want to usethe end-point. Seth makes the argument for numerical models. In thiscase, all the output variables should have the same time coordinate.Im trying to think of a case where thats not true (point observations,radar data etc), and im not thinking of any.

Hi John,

I'm not understanding the logic that suggests using midpoints forspatial coordinates, but endpoints for times. Whenever an applicationssees a particular reason to place the grid point at something other thanthe midpoint (on whatever axis) of course it should do so. That maylead to placing the grid point at the start, middle or end of theinterval. But the question that is before us is to say what the defaultshould be for the case where the boundaries of cell values is clearlyunderstood, but it is unclear what coordinate value best to use for thegrid point.

All other things being equal using a consistent strategy for space andtime is the simpler, "KISS", approach. Both instantaneous andtime-interval-averaged values are most naturally encoded using midpointrepresentation (disagreeing with both Seth's conclusion and yourspeculation. Are we using terms differently?). The compelling case fora time endpoint may be continuous integrals (e.g. accumulatedprecipitation). If one has a mixture of model variables to output andthe interpretations of their time coordinates needs to be different,then placing two different time axes into the file is the only way toeliminate the confusion. Arbitrarily shifting the grid point locationsby 1/2 time cell will not eliminate confusion, will it? It seems likeit would merely hide the confusion and increase the chances ofmisinterpretation.

(To be frank, although I have seen many CF datasets using both midpointand start-point times, I have never encountered one previously that usesthe end point of the time interval. It seems possible that as apractical matter this choice may introduce confusion rather than reduce it.)


   - Steve

=====================

Seth's argument about confusion remains the same if one

4. Perhaps "interval of accumulation" is different enough that oneshould just encode it in a separate attribute or auxiliary coordinateon the data variable. Numerical models can have different variableswith different intervals, possibly overlapping. This is perhaps notreally the same as the bounds on the coordinate, they just share thesame codomain (time). An advantage of this approach is that you donthave to create new coordinate variables for each data variable, whichseems like more trouble than its worth.
Seth McGinnis wrote:
In the case of 'raw' output from numerical models, it probably makessense touse the end-point of the time interval rather than the mid-point.That's themoment for which the model stores the data, whether they'reinstantaneous
values (intensive variables) or time-averages over the previous timestep
(extensive variables).

If you used the mid-point of the interval for extensive variables, they
wouldn't have the same time coordinates as the intensive variables,which would
be very confusing.  Using the end-point keeps everything aligned.

--Seth


On Thu, 12 Nov 2009 14:41:26 +0000 (UTC)
 Thomas Lavergne <[email protected]> wrote:
Dear Jonathan,

----- "Jonathan Gregory" <[email protected]> wrote:
Dear Thomas

I'm not saying the coordinate *must* be the mid-point. If there's a
good reason
for it being something else, then you could choose it to be so. I was
suggesting that we could recommend it should be the mid-point if there
is
no strong basis for making another choice. We could also say that it
must not
be outside the bounds.
I agree with your recommendation.
But I was also trying to gain support on "which axis value should Ichoose for
my variable" and your answer does not help :-).
I have rather little basis for making the choice of the end time for
representing an accumulated quantity but, at least, CF does notforbid it. Iguess I have to seek agreement inside my scientific community andthat it is
not CF's role to decide upon that.
Are there people interested in taking the discussion further? Weseek theanswer to the question: "In which cases would another choice (otherthan
mid-point) be relevant?".

Thomas
You are right, it cannot be missing data. That would break some
applications,
anyway.

Cheers

Jonathan
_______________________________________________
CF-metadata mailing list
[email protected]
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
_______________________________________________
CF-metadata mailing list
[email protected]
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
_______________________________________________
CF-metadata mailing list
[email protected]
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata
_______________________________________________
CF-metadata mailing list
[email protected]
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata

_______________________________________________
CF-metadata mailing list
[email protected]
http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata

Re: [CF-metadata] Cell bounds associated with coordinate variable rather than data variable

Reply via email to