In other words, it tells the library something significant about the module
that the engine needs to know about in order to perform a certain function
correctly.
Aside: I didn't mean to take this offline. I simply forgot to edit the
addresses.
David
Sent from ProtonMail Mobile
On Sun, Jan 7, 2018 at 13:46, DM Smith <dmsm...@crosswire.org> wrote:
> Library uses it to convert bytes to string.
>
> — DM Smith
> From my phone. Brief. Weird autocorrections.
>
> On Jan 7, 2018, at 8:44 AM, David Haslam <dfh...@protonmail.com> wrote:
>
>> Then answer this question, please.
>>
>> What value has the Encoding key ?
>>
>> David
>>
>> Sent from ProtonMail Mobile
>>
>> On Sun, Jan 7, 2018 at 12:52, DM Smith <dmsm...@crosswire.org> wrote:
>>
>>> SWORD too.
>>>
>>> I don’t yet see a value in the suggested conf entry.
>>>
>>> — DM Smith
>>> From my phone. Brief. Weird autocorrections.
>>>
>>> On Jan 7, 2018, at 4:03 AM, David Haslam <dfh...@protonmail.com> wrote:
>>>
>>>> You mean in the JSword API ?
>>>>
>>>> If so, that a start. Thanks, DM. :)
>>>>
>>>> Does that mean you now support the proposed new config key being accepted
>>>> and documented?
>>>>
>>>> Best regards,
>>>>
>>>> David
>>>>
>>>> Sent from ProtonMail Mobile
>>>>
>>>> On Sat, Jan 6, 2018 at 23:43, DM Smith <dmsm...@crosswire.org> wrote:
>>>>
>>>>> I added -N. To make search work.
>>>>>
>>>>> — DM Smith
>>>>> From my phone. Brief. Weird autocorrections.
>>>>>
>>>>> On Jan 6, 2018, at 4:41 PM, David Haslam <dfh...@protonmail.com> wrote:
>>>>>
>>>>>> Thanks DM.
>>>>>>
>>>>>> Interesting observations.
>>>>>>
>>>>>> It prompts the question whether either engine includes the capability to
>>>>>> normalize the search index (assuming that it does normalize the search
>>>>>> key).
>>>>>> And that it does this by default ????
>>>>>> Or does indexing assume that all modules were made without using the -N
>>>>>> option and are therefore already in NFC.
>>>>>> Yet it also remains the case that some front-ends also provide for
>>>>>> non-indexed search options.
>>>>>>
>>>>>> Moreover, it raise questions as to how the front-end actually displays
>>>>>> the set of search results when all or part of the underlying module is
>>>>>> not NFC.
>>>>>>
>>>>>> It must be the case that the developers of osis2mod had a valid reason
>>>>>> to provide the -N option.
>>>>>> Are those involved back then still with CrossWire?
>>>>>>
>>>>>> Best regards,
>>>>>>
>>>>>> David
>>>>>>
>>>>>> Sent from ProtonMail Mobile
>>>>>>
>>>>>> On Sat, Jan 6, 2018 at 21:20, DM Smith <dmsm...@crosswire.org> wrote:
>>>>>>
>>>>>>> The purpose of normalization was for the sake of search. Only when the
>>>>>>> search index and the search request are normalized to the same form can
>>>>>>> a result be found.
>>>>>>>
>>>>>>> It doesn’t matter if the normalized form is not readable. If SWORD (or
>>>>>>> JSword) normalizes both the same, then it doesn’t matter what Unicode
>>>>>>> Normalization or lack of it is used for displaying the text.
>>>>>>>
>>>>>>> Assuming that SWORD (or JSword) handles search properly, the only
>>>>>>> advantage of canonical over decomposed in the module itself is space.
>>>>>>>
>>>>>>> In Him,
>>>>>>> DM
>>>>>>>
>>>>>>>> On Jan 6, 2018, at 2:26 PM, David Haslam <dfh...@protonmail.com> wrote:
>>>>>>>>
>>>>>>>> Good question, Tom.
>>>>>>>>
>>>>>>>> Assuming that the Latin script part of the source text actually
>>>>>>>> required normalization to NFC,
>>>>>>>> and that at least some of the Biblical Hebrew should not be converted
>>>>>>>> to NFC,
>>>>>>>> you'd build the module using the -N switch of osis2mod, after first
>>>>>>>> applying a script
>>>>>>>> to the source text to ensure that both the requirements were
>>>>>>>> implemented.
>>>>>>>>
>>>>>>>> It would be a very simple task for a bespoke TextPipe filter with a
>>>>>>>> restrict filter
>>>>>>>> designed to limit the Convert to NFC subfilter to the text that was
>>>>>>>> not Hebrew.
>>>>>>>>
>>>>>>>> Ignoring alphabetical presentation forms, all the Hebrew characters
>>>>>>>> are in one Unicode block.
>>>>>>>> A PCRE to exclude the Hebrew would be very simple.
>>>>>>>> I could almost do it in my sleep after 17 years using TextPipe.
>>>>>>>> No doubt other programmers could do likewise with Perl or Python, etc.
>>>>>>>>
>>>>>>>> Best regards,
>>>>>>>>
>>>>>>>> David
>>>>>>>>
>>>>>>>> Sent from ProtonMail Mobile
>>>>>>>>
>>>>>>>> On Sat, Jan 6, 2018 at 19:14, Tom Sullivan <i...@beforgiven.info>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> Y'all: For text, such as in a commentary, which includes both Hebrew
>>>>>>>>> and English (or another modern Latin script using language), what do
>>>>>>>>> you put for the normalization? Tom Tom Sullivan
>>>>>>>>> info@BeForgiven.INFOFAX: 815-301-2835 --------------------- Great
>>>>>>>>> News! God created you, owns you and gave you commands to obey. You
>>>>>>>>> have disobeyed God - as your conscience very well attests to you.
>>>>>>>>> God's holiness and justice compel Him to punish you in Hell. Jesus
>>>>>>>>> Christ became Man, was crucified, buried and rose from the dead as a
>>>>>>>>> substitute for all who trust in Him, redeeming them from Hell. If you
>>>>>>>>> repent (turn from your sin) and believe (trust) in Jesus Christ, you
>>>>>>>>> will go to Heaven. Otherwise you will go to Hell. Warning! Good works
>>>>>>>>> are a result, not cause, of saving trust. More info is at
>>>>>>>>> www.esig.beforgiven.infoDo you believe this? Copy this signature into
>>>>>>>>> your email program and use the Internet to spread the Great News
>>>>>>>>> every time you email. On 01/06/2018 12:32 PM, David Haslam wrote: >
>>>>>>>>> Hi Greg, > > One area where it might turn out to be useful is for the
>>>>>>>>> search features > of front-end apps. > It could be important to know
>>>>>>>>> that the underlying module text is _not_ > *NFC*. > > That's not to
>>>>>>>>> lay down a requirement as to how search features should be >
>>>>>>>>> designed, > but at least to provide the information in case it does
>>>>>>>>> matter for some > types of search option. > > Like other things in
>>>>>>>>> .conf files, a key can also be _educational_. > It may prompt
>>>>>>>>> developers and users to ask, /*Why did they do this?*/ > > cf. It was
>>>>>>>>> _almost by accident_ that in 2014, I first came across this > aspect
>>>>>>>>> of using Unicode for Biblical Hebrew. > /It applies only to texts
>>>>>>>>> with _both_ vowel accents and cantillation./ > > Even though it's
>>>>>>>>> mentioned in our developers' wiki, it's all too easily > missed by
>>>>>>>>> other CrossWire volunteers. > > Best regards, > > David > > Sent with
>>>>>>>>> ProtonMail Secure Email. > >> -------- Original Message -------- >>
>>>>>>>>> Subject: Re: [sword-devel] Module .conf files, Unicode Normalization
>>>>>>>>> >> Local Time: 6 January 2018 5:19 PM >> UTC Time: 6 January 2018
>>>>>>>>> 17:19 >> From: greg.helli...@gmail.com >> To: David Haslam , SWORD
>>>>>>>>> Developers' >> Collaboration Forum >> >> Why would the front end or
>>>>>>>>> engine need to know this information? Would >> it help the front end
>>>>>>>>> developers or users to know it? What do we gain >> by adding this?
>>>>>>>>> (I'm not implying it wouldn't be beneficial. But the >> only thing I
>>>>>>>>> know about Unicode is how the different UTF encodings >> work, so I
>>>>>>>>> have no idea what use this information could be. I also >> think
>>>>>>>>> changes to formats and information standards should be >>
>>>>>>>>> conservative instead of liberal) >> >> --Greg >> >> On Jan 6, 2018
>>>>>>>>> 11:01, "David Haslam" > > wrote: >> >> Dear all, >> >> We've known
>>>>>>>>> for quite a few years that there are aspects of >> *Biblical Hebrew*
>>>>>>>>> that mean we should _avoid_ converting the >> Unicode source text to
>>>>>>>>> *NFC* when we build a module. >> >> This prompts me to suggest that
>>>>>>>>> we ought to define a new *key* for >> .conf files. >> >>
>>>>>>>>> *Normalization=NFC* (this would be the default, and may be >>
>>>>>>>>> _omitted_ for the vast majority of modules) >> *Normalization=Custom*
>>>>>>>>> (we should include this in certain Biblical >> Hebrew modules) >> >>
>>>>>>>>> This would make it clear to front-end developers and users alike >>
>>>>>>>>> that the source text was _not_ converted to NFC during module build.
>>>>>>>>> >> i.e. *osis2mod* was used intentionally with the *-N* switch, in >>
>>>>>>>>> _accordance with the requirements of the source text provider_. >> >>
>>>>>>>>> The Unicode source text may already be encoded in *UTF-8* ; this >>
>>>>>>>>> memo is /only /about normalization. >> >> In the rare eventuality
>>>>>>>>> that there could arise a requrement for >> any of the other three
>>>>>>>>> normalization forms (*NFD*, *NFKC*, *NFKD*) >> defined by the Unicode
>>>>>>>>> Consortium, >> these would also be permitted values for the conf file
>>>>>>>>> key. >> >> A further benefit arises when a module needs to be
>>>>>>>>> updated. >> If the modules team sees that the .conf file includes the
>>>>>>>>> line >> *Normalization=Custom* >> they would be forewarned against
>>>>>>>>> converting to NFC through >> /inadvertently/ omitting the *-N* switch
>>>>>>>>> during module build. >> >> _Aside_: Another language with a need for
>>>>>>>>> non-standard >> normalization is *Tibetan*. We don't yet have a
>>>>>>>>> module in that script. >> >> Best regards, >> >> David >> >> Sent
>>>>>>>>> with ProtonMail Secure Email. >> >> >>
>>>>>>>>> _______________________________________________ >> sword-devel
>>>>>>>>> mailing list: sword-devel@crosswire.org >> >>
>>>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel >> >>
>>>>>>>>> Instructions to unsubscribe/change your settings at above page > > >
>>>>>>>>> ______________________________________________________________________
>>>>>>>>> > This email has been scanned by the Symantec Email Security.cloud
>>>>>>>>> service. > For more information please visit
>>>>>>>>> http://www.symanteccloud.com>
>>>>>>>>> ______________________________________________________________________
>>>>>>>>> > > > _______________________________________________ > sword-devel
>>>>>>>>> mailing list: sword-devel@crosswire.org>
>>>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel > Instructions
>>>>>>>>> to unsubscribe/change your settings at above page >
>>>>>>>>> _______________________________________________ sword-devel mailing
>>>>>>>>> list: sword-devel@crosswire.org
>>>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-develInstructions to
>>>>>>>>> unsubscribe/change your settings at above page @crosswire.org>
>>>>>>>>> @protonmail.com> @protonmail.com> @crosswire.org> @protonmail.com>
>>>>>>>>
>>>>>>>> _______________________________________________
>>>>>>>> sword-devel mailing list: sword-devel@crosswire.org
>>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>>>>>> Instructions to unsubscribe/change your settings at above page
>>>>>
>>>>>> _______________________________________________
>>>>>> sword-devel mailing list: sword-devel@crosswire.org
>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>>>> Instructions to unsubscribe/change your settings at above page
>>>
>>>> _______________________________________________
>>>> sword-devel mailing list: sword-devel@crosswire.org
>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>> Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page