Carriage returns in DV_TEXT not allowed

Leonardo Moretti Wed, 11 Jan 2012 01:30:32 -0800 (PST)

I don't think DV_PARAGRAPH is a good solution for me. I think the construct
of DV_TEXT/DV_PARAGRAPH is too much complex to be used: we need to have an
?ad hoc text editor? to show and set the content of DV_PARAGRAPH. Moreover,
I don't see which are the real advantages this approach gives: although
DV_PARAGRAPH gives a sort of structure to text, this remain a free text, not
coded, and without any metadata associated to DV_TEXT lines.


Whatever I need to use text containing some words that are hyperlinked /
coded / formatted, I prefer to use DV_PARSABLE, where a plain formalism is
used (formalism matches {"text/plain"}).

Anyway, carriage returns, tabs, line feeds are not parts of a specific
formalism, but just characters. I think this is an excessive restriction (at
least because I don't see the need for this restriction). Other data types,
as ISO ST, let to use any valid unicode characters.

leo


Thomas Beale-3 wrote:
> 
> On 10/01/2012 10:05, Leonardo Moretti wrote:
>> If DV_TEXT doesn't allow to use carriage returns, line feeds, or other 
>> non-printing characters, as stated in 
>> http://www.openehr.org/releases/1.0.2/architecture/rm/data_types_im.pdf, 
>> pag 29, there is a way to represent short text with minimal formatting 
>> characters (carriage returns)? Which data type should be used?
>>
> 
> It would be interesting to know how many other implementers agree with 
> this restriction. It was put in (from memory) in the very early days of 
> modelling, based on GEHR, and possibly somewhat on 13606 - nearly 10 
> years ago!
> 
> The idea was that DV_TEXT models a 'text fragment', essentially the idea 
> of a word, string of words, sentence or possibly a group of sentences. 
> No CR/LF were allowed because this is taken as a paragraph delimiter, 
> and the type DV_PARAGRAPH was defined to represent multiple DV_TEXTs 
> making up a long tract of text like a report. In proper word processing 
> & publishing, this is correct; a 'paragraph' has no CR/LF in it, which 
> is what allows resizing to work properly in different screen / form
> widths.
> 
> Additionally, any 'atomic' text item, e.g. a single disease name, single 
> sentence etc - which make up the majority of text data within structured 
> data - should not have a CR/LF.
> 
> This way of thinking may be dated, and it is a good question as to when 
> a piece of text can't be a single DV_TEXT. If we stick with the current 
> model 
> <http://www.openehr.org/uml/release-1.0.1/Browsable/_9_0_76d0249_1109597816149_873308_762Report.html>
>  
> (and remember, a DV_CODED_TEXT is-a DV_TEXT in openEHR), the openEHR RM 
> is imposing a simple word processing model of 'paragraphs' made up of 
> 'text fragments'. An alternative would be to allow anything in a 
> DV_TEXT. The decision about when you have to have a new DV_TEXT is made 
> on the basis of attributes other than the actual string value, i.e.:
> 
>   * hyperlink: if there is a hyperink, it applies to the entire DV_TEXT;
>     therefore, if you only want a link to correspond to 2 words, then
>     those 2 words = 1 DV_TEXT
>   * formattting: simple formatting like bolding, emphasis (about the
>     same level as typical wiki markup) applies to the whole DV_TEXT;
>   * mappings: coded mappings, e.g. ICD code applies to whole DV_TEXT;
>     need to use multiple DV_TEXTs if only some words are to have an
>     associated code mapping
>   * formal coding: if a DV_CODED_TEXT is to be used - i.e. when the
>     string value is the term for the code from its terminology (not just
>     some mapping), then the DV_CODED_TEXT.value can only consist of the
>     exact word string to which the code corresponds; more DV_TEXTs have
>     to be added using a DV_PARAGRAPH to construct a whole paragraph
> 
> The best approach with the current model is:
> 
>   * for atomic text items, e.g. single word/sentence answers to
>     questions, single coded terms like names of diseases, procedures
>     etc, use a single DV_CODED_TEXT or DV_TEXT.
>   * for a tract of text containing some words that are hyperlinked /
>     coded / formatted, use a DV_PARAGRAPH containing multiple DV_TEXTs.
>   * Or else you use a DV_PARSABLE, containing XML, HTML, RTF or whatever
>     you like - but ... no guarantee the receiver can read it!
> 
> This does not actually solve properly the problem of how CR/LFs are 
> added. If we assume one DV_PARAGRAPH = 1 CR/LF (as in word processing) 
> then a report needs to consist of multiple DV_PARAGRAPHs, and we don't 
> currently have a data type for that. To fix the current model we could 
> add a new type DV_DOCUMENT, which contains multiple DV_PARAGRAPHs. Or we 
> could remove the restriction on CR/LF on DV_TEXT, but that then would 
> allow CR/LFs to occur in single DV_CODED_TEXT strings, which is almost 
> certainly an error. But maybe we just assume that software never makes 
> this error?
> 
> The real question is: do we want to have any explicit word-processor 
> like model of text? 10 years ago, the answer seemed obvious: yes, 
> because there is no reliable standard of marked up text (many variants 
> of HTML, XML, wiki markup, etc). I am not sure the answer is any 
> different today. From a clinical perspective, guaranteeing readability 
> of text in a standard way is paramount.
> 
> - thomas
> 
> 
> _______________________________________________
> openEHR-technical mailing list
> openEHR-technical at openehr.org
> http://lists.chime.ucl.ac.uk/mailman/listinfo/openehr-technical
> 
> 

-- 
View this message in context: 
http://old.nabble.com/Carriage-returns-in-DV_TEXT-not-allowed-tp33113080p33119721.html
Sent from the openehr-technical mailing list archive at Nabble.com.

Carriage returns in DV_TEXT not allowed

Reply via email to