Then answer this question, please.

What value has the Encoding key ?

David

Sent from ProtonMail Mobile

On Sun, Jan 7, 2018 at 12:52, DM Smith <dmsm...@crosswire.org> wrote:

> SWORD too.
>
> I don’t yet see a value in the suggested conf entry.
>
> — DM Smith
> From my phone. Brief. Weird autocorrections.
>
> On Jan 7, 2018, at 4:03 AM, David Haslam <dfh...@protonmail.com> wrote:
>
>> You mean in the JSword API ?
>>
>> If so, that a start. Thanks, DM. :)
>>
>> Does that mean you now support the proposed new config key being accepted 
>> and documented?
>>
>> Best regards,
>>
>> David
>>
>> Sent from ProtonMail Mobile
>>
>> On Sat, Jan 6, 2018 at 23:43, DM Smith <dmsm...@crosswire.org> wrote:
>>
>>> I added -N. To make search work.
>>>
>>> — DM Smith
>>> From my phone. Brief. Weird autocorrections.
>>>
>>> On Jan 6, 2018, at 4:41 PM, David Haslam <dfh...@protonmail.com> wrote:
>>>
>>>> Thanks DM.
>>>>
>>>> Interesting observations.
>>>>
>>>> It prompts the question whether either engine includes the capability to 
>>>> normalize the search index (assuming that it does normalize the search 
>>>> key).
>>>> And that it does this by default ????
>>>> Or does indexing assume that all modules were made without using the -N 
>>>> option and are therefore already in NFC.
>>>> Yet it also remains the case that some front-ends also provide for 
>>>> non-indexed search options.
>>>>
>>>> Moreover, it raise questions as to how the front-end actually displays the 
>>>> set of search results when all or part of the underlying module is not NFC.
>>>>
>>>> It must be the case that the developers of osis2mod had a valid reason to 
>>>> provide the -N option.
>>>> Are those involved back then still with CrossWire?
>>>>
>>>> Best regards,
>>>>
>>>> David
>>>>
>>>> Sent from ProtonMail Mobile
>>>>
>>>> On Sat, Jan 6, 2018 at 21:20, DM Smith <dmsm...@crosswire.org> wrote:
>>>>
>>>>> The purpose of normalization was for the sake of search. Only when the 
>>>>> search index and the search request are normalized to the same form can a 
>>>>> result be found.
>>>>>
>>>>> It doesn’t matter if the normalized form is not readable. If SWORD (or 
>>>>> JSword) normalizes both the same, then it doesn’t matter what Unicode 
>>>>> Normalization or lack of it is used for displaying the text.
>>>>>
>>>>> Assuming that SWORD (or JSword) handles search properly, the only 
>>>>> advantage of canonical over decomposed in the module itself is space.
>>>>>
>>>>> In Him,
>>>>> DM
>>>>>
>>>>>> On Jan 6, 2018, at 2:26 PM, David Haslam <dfh...@protonmail.com> wrote:
>>>>>>
>>>>>> Good question, Tom.
>>>>>>
>>>>>> Assuming that the Latin script part of the source text actually required 
>>>>>> normalization to NFC,
>>>>>> and that at least some of the Biblical Hebrew should not be converted to 
>>>>>> NFC,
>>>>>> you'd build the module using the -N switch of osis2mod, after first 
>>>>>> applying a script
>>>>>> to the source text to ensure that both the requirements were implemented.
>>>>>>
>>>>>> It would be a very simple task for a bespoke TextPipe filter with a 
>>>>>> restrict filter
>>>>>> designed to limit the Convert to NFC subfilter to the text that was not 
>>>>>> Hebrew.
>>>>>>
>>>>>> Ignoring alphabetical presentation forms, all the Hebrew characters are 
>>>>>> in one Unicode block.
>>>>>> A PCRE to exclude the Hebrew would be very simple.
>>>>>> I could almost do it in my sleep after 17 years using TextPipe.
>>>>>> No doubt other programmers could do likewise with Perl or Python, etc.
>>>>>>
>>>>>> Best regards,
>>>>>>
>>>>>> David
>>>>>>
>>>>>> Sent from ProtonMail Mobile
>>>>>>
>>>>>> On Sat, Jan 6, 2018 at 19:14, Tom Sullivan <i...@beforgiven.info> wrote:
>>>>>>
>>>>>>> Y'all: For text, such as in a commentary, which includes both Hebrew 
>>>>>>> and English (or another modern Latin script using language), what do 
>>>>>>> you put for the normalization? Tom Tom Sullivan 
>>>>>>> info@BeForgiven.INFOFAX: 815-301-2835 --------------------- Great News! 
>>>>>>> God created you, owns you and gave you commands to obey. You have 
>>>>>>> disobeyed God - as your conscience very well attests to you. God's 
>>>>>>> holiness and justice compel Him to punish you in Hell. Jesus Christ 
>>>>>>> became Man, was crucified, buried and rose from the dead as a 
>>>>>>> substitute for all who trust in Him, redeeming them from Hell. If you 
>>>>>>> repent (turn from your sin) and believe (trust) in Jesus Christ, you 
>>>>>>> will go to Heaven. Otherwise you will go to Hell. Warning! Good works 
>>>>>>> are a result, not cause, of saving trust. More info is at 
>>>>>>> www.esig.beforgiven.infoDo you believe this? Copy this signature into 
>>>>>>> your email program and use the Internet to spread the Great News every 
>>>>>>> time you email. On 01/06/2018 12:32 PM, David Haslam wrote: > Hi Greg, 
>>>>>>> > > One area where it might turn out to be useful is for the search 
>>>>>>> features > of front-end apps. > It could be important to know that the 
>>>>>>> underlying module text is _not_ > *NFC*. > > That's not to lay down a 
>>>>>>> requirement as to how search features should be > designed, > but at 
>>>>>>> least to provide the information in case it does matter for some > 
>>>>>>> types of search option. > > Like other things in .conf files, a key can 
>>>>>>> also be _educational_. > It may prompt developers and users to ask, 
>>>>>>> /*Why did they do this?*/ > > cf. It was _almost by accident_ that in 
>>>>>>> 2014, I first came across this > aspect of using Unicode for Biblical 
>>>>>>> Hebrew. > /It applies only to texts with _both_ vowel accents and 
>>>>>>> cantillation./ > > Even though it's mentioned in our developers' wiki, 
>>>>>>> it's all too easily > missed by other CrossWire volunteers. > > Best 
>>>>>>> regards, > > David > > Sent with ProtonMail Secure Email. > >> -------- 
>>>>>>> Original Message -------- >> Subject: Re: [sword-devel] Module .conf 
>>>>>>> files, Unicode Normalization >> Local Time: 6 January 2018 5:19 PM >> 
>>>>>>> UTC Time: 6 January 2018 17:19 >> From: greg.helli...@gmail.com >> To: 
>>>>>>> David Haslam , SWORD Developers' >> Collaboration Forum  >> >> Why 
>>>>>>> would the front end or engine need to know this information? Would >> 
>>>>>>> it help the front end developers or users to know it? What do we gain 
>>>>>>> >> by adding this? (I'm not implying it wouldn't be beneficial. But the 
>>>>>>> >> only thing I know about Unicode is how the different UTF encodings 
>>>>>>> >> work, so I have no idea what use this information could be. I also 
>>>>>>> >> think changes to formats and information standards should be >> 
>>>>>>> conservative instead of liberal) >> >> --Greg >> >> On Jan 6, 2018 
>>>>>>> 11:01, "David Haslam" > > wrote: >> >> Dear all, >> >> We've known for 
>>>>>>> quite a few years that there are aspects of >> *Biblical Hebrew* that 
>>>>>>> mean we should _avoid_ converting the >> Unicode source text to *NFC* 
>>>>>>> when we build a module. >> >> This prompts me to suggest that we ought 
>>>>>>> to define a new *key* for >> .conf files. >> >> *Normalization=NFC* 
>>>>>>> (this would be the default, and may be >> _omitted_ for the vast 
>>>>>>> majority of modules) >> *Normalization=Custom* (we should include this 
>>>>>>> in certain Biblical >> Hebrew modules) >> >> This would make it clear 
>>>>>>> to front-end developers and users alike >> that the source text was 
>>>>>>> _not_ converted to NFC during module build. >> i.e. *osis2mod* was used 
>>>>>>> intentionally with the *-N* switch, in >> _accordance with the 
>>>>>>> requirements of the source text provider_. >> >> The Unicode source 
>>>>>>> text may already be encoded in *UTF-8* ; this >> memo is /only /about 
>>>>>>> normalization. >> >> In the rare eventuality that there could arise a 
>>>>>>> requrement for >> any of the other three normalization forms (*NFD*, 
>>>>>>> *NFKC*, *NFKD*) >> defined by the Unicode Consortium, >> these would 
>>>>>>> also be permitted values for the conf file key. >> >> A further benefit 
>>>>>>> arises when a module needs to be updated. >> If the modules team sees 
>>>>>>> that the .conf file includes the line >> *Normalization=Custom* >> they 
>>>>>>> would be forewarned against converting to NFC through >> 
>>>>>>> /inadvertently/ omitting the *-N* switch during module build. >> >> 
>>>>>>> _Aside_: Another language with a need for non-standard >> normalization 
>>>>>>> is *Tibetan*. We don't yet have a module in that script. >> >> Best 
>>>>>>> regards, >> >> David >> >> Sent with ProtonMail Secure Email. >> >> >> 
>>>>>>> _______________________________________________ >> sword-devel mailing 
>>>>>>> list: sword-devel@crosswire.org >>  >> 
>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel >>  >> 
>>>>>>> Instructions to unsubscribe/change your settings at above page > > > 
>>>>>>> ______________________________________________________________________ 
>>>>>>> > This email has been scanned by the Symantec Email Security.cloud 
>>>>>>> service. > For more information please visit 
>>>>>>> http://www.symanteccloud.com > 
>>>>>>> ______________________________________________________________________ 
>>>>>>> > > > _______________________________________________ > sword-devel 
>>>>>>> mailing list: sword-devel@crosswire.org> 
>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel > Instructions to 
>>>>>>> unsubscribe/change your settings at above page > 
>>>>>>> _______________________________________________ sword-devel mailing 
>>>>>>> list: sword-devel@crosswire.org 
>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to 
>>>>>>> unsubscribe/change your settings at above page @crosswire.org> 
>>>>>>> @protonmail.com> @protonmail.com> @crosswire.org> @protonmail.com>
>>>>>>
>>>>>> _______________________________________________
>>>>>> sword-devel mailing list: sword-devel@crosswire.org
>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>>>> Instructions to unsubscribe/change your settings at above page
>>>
>>>> _______________________________________________
>>>> sword-devel mailing list: sword-devel@crosswire.org
>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>> Instructions to unsubscribe/change your settings at above page
>
>> _______________________________________________
>> sword-devel mailing list: sword-devel@crosswire.org
>> http://www.crosswire.org/mailman/listinfo/sword-devel
>> Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to