Dear Ken,
thanks for your response to me below. Would it be fair to suggest that "status" should, as far as possible, reflect a generic objective classification, with terms such as "sensor_nonfunctional" which have a comparable meaning for all datasets, while "quality" is a subjective *measure* with a meaning that may from dataset to dataset? E.g. if dataset A has a maximum "quality" of 11 and dataset B only goes up to 10, it doesn't necessarily imply that dataset A is in any sense better and B. If you want to use it in weighted means, perhaps it should be "quality_measure" rather than "quality_flag"? With "status_flag" the order of integer values does not have any meaning, but with quality perhaps it would make more sense have some concept of a sequence of quality settings (so that, for example "1" always indicates a quality between "0" and "2" within a dataset, but could have different meanings in different datasets). Could the quality also be expressed as a floating point number without any flag meanings? Responding to a point Barna raised: it is certainly possible to have more than one "status_flag" variable, but I don't think it is ideal: if information needs to be split across multiple variables we generally like to describe the difference between the variables in the standard name or in other metadata. In this case, I think there is a good case for using a new standard name. regards, Martin ________________________________ From: CF-metadata <cf-metadata-boun...@cgd.ucar.edu> on behalf of Andrew Barna <aba...@ucsd.edu> Sent: 23 July 2019 00:23 To: Kehoe, Kenneth E. Cc: cf-metadata@cgd.ucar.edu Subject: Re: [CF-metadata] New standard_name of quality_flag for corresponding quality control variables Ken, I guess, I don't see this proposed change as necessary since the distinction between the terms "quality" and "status" is really done in the "flag_meanings" attribute and is basically free form/uncontrolled. These attributes need to be used by this new name as well. Let me rephrase my suggestion/question: If this proposal is not adopted, but an example of how to use a variable, with the standard name of "status_flag", to only indicate data quality is included in the document, would that help? -Barna On Mon, Jul 22, 2019 at 1:22 PM Kehoe, Kenneth E. <kke...@ou.edu> wrote: > > Barna, > > Yes an update to the CF document should follow after the new > standard_name is implemented. I think multiple examples are needed since > status_flag covers many different types of state variables. > > Ken > > > > On 2019-7-22 10:35, Andrew Barna wrote: > > Hi Martin, Ken, > > > > Is there anything wrong with including multiple "status_flag" > > variables to capture all separate state you wish? The CF document > > unfortunately only includes an example of how to encode the status of > > a sensor, but the actual meanings of the flag values are entirely up > > to you, and this will not change with this proposal. Perhaps the CF > > document would benefit from additional examples (e.g. one that only > > shows data quality flags). > > > > -Barna > > > > > > On Mon, Jul 22, 2019 at 9:04 AM Kehoe, Kenneth E. <kke...@ou.edu> wrote: > >> Hi Martin, > >> > >> I see status encompassing multiple metadata pieces of information. For > >> example it could be a state of the instrument as it cycles through a > >> pre-programed routine (Look at calibration target, look at sky, look at > >> ground, look at second calibration target, repeat...). Or the sources of > >> the inputs for a model where the availability or some other reason could > >> require making a decision on what source(s) to use. For provenance this > >> source information is important to report on a time step basis. Or the > >> status could be a data providers method to provide uncertainty > >> information (I see this as incorrect but some people do see it this > >> way). Each of these are important metadata but the method of use is > >> different than a strictly quality variable. A quality variable provides > >> information indicating if the data should be used or possibly could be > >> used in a weighted mean method to favor high quality data over low > >> quality data. The way the metadata is used is different depending on the > >> metadata type. A state of the instrument would be used for sub-setting > >> calibration vs. data. There is no ambiguity in this as data from a > >> calibration target is not used in a weather research analysis. But > >> quality is more subjective and is decided by the data user. If the > >> quality variable has 20 different quality tests the user would need to > >> decided if all 20 test results should be used or only a subset. Also, > >> the code for applying the quality is different than the state of the > >> instrument view (in my example above). > >> > >> It is possible to have a quality test result from the state of the > >> instrument, but not the other way around (typically). So I need a way to > >> distinguish the two for automated or semi-automated tools. Hence my > >> point of quality_flag essentially being a subset of status_flag > >> > >> Ken > >> > >> > >> > >> On 2019-7-22 02:57, Martin Juckes - UKRI STFC wrote: > >>> Dear Ken, > >>> > >>> > >>> Can you expand on the distinction between "quality" and "status"? I > >>> understand that they are different in principle, but, in order to support > >>> this new standard name I think we need a clear objective statement of how > >>> we would want to distinguish between them in CF. > >>> > >>> The conventions section on flags (3.5) mixes the two up > >>> (http://cfconventions.org/cf-conventions/cf-conventions.html#flags ), so > >>> some re-wording of the document would also be needed, > >>> > >>> regards, > >>> Martin > >>> > >>> ________________________________ > >>> From: CF-metadata <cf-metadata-boun...@cgd.ucar.edu> on behalf of Kehoe, > >>> Kenneth E. <kke...@ou.edu> > >>> Sent: 19 July 2019 06:42 > >>> To: cf-metadata@cgd.ucar.edu > >>> Subject: [CF-metadata] New standard_name of quality_flag for > >>> corresponding quality control variables > >>> > >>> Dear CF, > >>> > >>> I am proposing a new standard name of "quality_flag" to indicate a > >>> variable is purely a quality control variable. A quality control variable > >>> would use flag_values or flag_masks along with flag_meanings to allow > >>> declaring levels of quality or results from quality indicating tests of > >>> the data variable. This variable be a subset of the more general > >>> "status_flag" standard name. Currently the definition of "status_flag" is: > >>> > >>> - A variable with the standard name of status_flag contains an indication > >>> of quality or other status of another data variable. The linkage between > >>> the data variable and the variable with the standard_name of status_flag > >>> is achieved using the ancillary_variables attribute. > >>> > >>> This definition includes a variable used to define the state or other > >>> status information of a variable and can not be distinguished by standard > >>> name alone from a state of the instrument, processing decision, source > >>> information, needed metadata about the data variable or other ancillary > >>> variable type. Since there is no other way to define a purely quality > >>> control variable, the use of "status_flag" is too general for strictly > >>> quality control variables. By having a method to define a variable as > >>> strictly quality control the results of quality control tests can be > >>> applied to the data with a software tool based on requests by the user. > >>> This would not affect current datasets that do use "status_flag" nor > >>> require a change to the definition outside of the indication that > >>> "quality_flag" standard name is available and a better use for pure > >>> quality control variables. > >>> > >>> Proposed addition: > >>> > >>> quality_flag = A variable with the standard name of quality_flag contains > >>> an indication of quality information of another data variable. The > >>> linkage between the data variable and the variable or variables with the > >>> standard_name of quality_flag is achieved using the ancillary_variables > >>> attribute. > >>> > >>> Proposed change: > >>> > >>> status_flag = A variable with the standard name of status_flag contains > >>> an indication of status of another data variable. The linkage between the > >>> data variable and the variable with the standard_name of status_flag is > >>> achieved using the ancillary_variables attribute. For data quality > >>> information use quality_flag. > >>> > >>> Thanks, > >>> > >>> Ken > >>> > >>> > >>> > >>> -- > >>> Kenneth E. Kehoe > >>> Research Associate - University of Oklahoma > >>> Cooperative Institute for Mesoscale Meteorological Studies > >>> ARM Climate Research Facility - Data Quality Office > >>> e-mail: kke...@ou.edu<mailto:kke...@ou.edu> | Office: 303-497-4754 > >> -- > >> Kenneth E. Kehoe > >> Research Associate - University of Oklahoma > >> Cooperative Institute for Mesoscale Meteorological Studies > >> ARM Climate Research Facility - Data Quality Office > >> e-mail: kke...@ou.edu | Office: 303-497-4754 > >> > >> _______________________________________________ > >> CF-metadata mailing list > >> CF-metadata@cgd.ucar.edu > >> http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata > > -- > Kenneth E. Kehoe > Research Associate - University of Oklahoma > Cooperative Institute for Mesoscale Meteorological Studies > ARM Climate Research Facility - Data Quality Office > e-mail: kke...@ou.edu | Office: 303-497-4754 > > _______________________________________________ > CF-metadata mailing list > CF-metadata@cgd.ucar.edu > http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata _______________________________________________ CF-metadata mailing list CF-metadata@cgd.ucar.edu http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata _______________________________________________ CF-metadata mailing list CF-metadata@cgd.ucar.edu http://mailman.cgd.ucar.edu/mailman/listinfo/cf-metadata