Chris Little wrote:
(I'm just going to answer those questions that I can actually offer some advice on. Most of the KJV markup is Troy's work, so he'll have to answer some questions himself.)

DM Smith wrote:
The following construct if fairly frequent:
<w src="19" lemma="strong:G4155" morph="robinson:V-IAI-3S">and took <transChange type="added">him</transChange> by the throat</w>

The problem is that <w> does not allow for <transChange>.
Should this be changed to <seg type="x-transChange" subType="added">him</seg>?
(This is the form that is used within study notes.)

If the markup is correct, this would be something to address to the OSIS TC for correction in the next version of the standard. And I could see where it could be an issue in some languages, even if it doesn't turn out to be one in Greek.

I think it is correct and also valid. In many languages, gender is communicated by context, but in translating to English it may be rendered explicitly. In the KJV these instances are indicated by italics.

Earlier today I saw "give *him* drink" where "give drink" was one word in Greek and *him* was added for clarity.

So I suggest that for OSIS 2.5 that transChange be allowed to be a child of <w>.

Not knowing any meaningful quantity of Greek, I find it a bit hard to believe that they have a single word that means "to take by the throat"--so it may be that the <w> element is capturing too much. If the meaning is something more basic (without an incorporated object) like "to choke", then you can probably do something like:
Unbelievable as it may be, the Strong's entry is:
4155 pnigo pnee'-go strengthened from 4154; to wheeze, i.e. (causative, by implication) to throttle or strangle (drown):--choke, take by the throat. see GREEK for 4154


<w src="19" lemma="strong:G4155" morph="robinson:V-IAI-3S">and took</w> <transChange type="added">him</transChange> <w src="19" lemma="strong:G4155" morph="robinson:V-IAI-3S">by the throat</w>

I don't think I like splitting the word into a before and an after. This might communicate to a casual user of the document that 4155 can be translated as either "took" or "by the throat".

In such a case, the better markup probably would be what is already encoded, since the added "him" is morphologically specified by the verb identified by the <w> element.

On the other hand, <seg>, as you propose, preserves the correct information, permitting quick and easy update to a new version of OSIS, if it is changed to accomodate the more proper encoding.

I like having explicit markup as it is easier to transform later.
_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to