Re: [sword-devel] Hyphens in book names

2010-10-26 Thread Troy A. Griffitts
I wanted to mention that I did have time to add preliminary support for hyphenated book names in the recently released 1.6.2 package. Please let me know if you find it sufficient. Thanks to everyone who provided valuable feedback and suggestions, especially DM who offered helpful advice for

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread David Haslam
Chris, So is the problem that these languages generally have U+002D (hyphen/minus) originating from an easy to type workaround for normal keyboards (even going back to mechanical typewriters!), and that everyone got used to doing this? If so, what would the SWORD API do with U+2010 (General

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread David Haslam
There are 28 letters in the modern Filipino alphabet. See http://en.wikipedia.org/wiki/Filipino_alphabet http://en.wikipedia.org/wiki/Filipino_alphabet The modern Filipino alphabet may be used as the alphabet for all autochthonous languages of the Philippines. Whence the hyphen then? Is it an

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread David Haslam
See also http://en.wikipedia.org/wiki/Hyphen#Hyphens_in_computing http://en.wikipedia.org/wiki/Hyphen#Hyphens_in_computing David -- View this message in context: http://sword-dev.350566.n4.nabble.com/Hyphens-in-book-names-tp2719769p2720471.html Sent from the SWORD Dev mailing list archive at

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread Daniel Owens
On 09/29/2010 05:42 PM, Troy A. Griffitts wrote: I do have a question to the original poster (and regarding Vietnamese). Do you also use '-' in ranges or do you use another character? This might introduce localized logic differences to the parser, not simply localization strings. Troy

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread Greg Hellings
All of this discussion about whether the hyphen is the proper character that should have been used or whether some holy, blessed POSIX/Unicode committee deemed - is valid for use as a letter or whether it can only be punctuation is probably interesting. But it's probably not interesting to

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread David Troidl
Hi Robert, There are many Unicode characters for hyphens and dashes. Could you substitute, for example, the hyphen from General Punctuation (#x2010;)? This would give the proper appearance, without conflicting with the 'normal' hyphen separator. Peace, David On 9/29/2010 5:28 PM,

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread DM Smith
On 09/30/2010 11:11 AM, David Troidl wrote: Hi Robert, There are many Unicode characters for hyphens and dashes. Could you substitute, for example, the hyphen from General Punctuation (#x2010;)? This would give the proper appearance, without conflicting with the 'normal' hyphen separator.

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread jhphx
Best practice and convention are important and pointing out why something breaks them is not just pedantic. Structure and order in these things helps leads to code and standards that are clear and efficient. It can however also lead to code and standards that are less flexible and usable in

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread Weston Ruter
I think the fundamental problem here is that the SWORD reference parser is too simple. Namely, the parser needs to not blindly split on a hyphen character but rather tokenize the input stream and contextually determine what each token is as it processes the tokens in sequence. For example, if I

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread DM Smith
It's not quite as simple as working with the fully spelled out names. SWORD allows other alternates as well. For example, perhaps the following would work just as well for Apostle-Works: A-W AW Wrks Wrk Wks Wk and any proper prefix of Apostle-Works that does not conflict with another books

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread Weston Ruter
So there would have to be a tokenizer and parser that determines the meaning of the token based on context. On Thu, Sep 30, 2010 at 1:16 PM, DM Smith dmsm...@crosswire.org wrote: It's not quite as simple as working with the fully spelled out names. SWORD allows other alternates as well. For

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread Greg Hellings
On Thu, Sep 30, 2010 at 3:24 PM, Weston Ruter westonru...@gmail.com wrote: So there would have to be a tokenizer and parser that determines the meaning of the token based on context. A superior method of doing this task is what DM suggested with constructing a trie. Then, character by

Re: [sword-devel] Hyphens in book names

2010-09-30 Thread Ben Morgan
On Thu, Sep 30, 2010 at 8:42 AM, Troy A. Griffitts scr...@crosswire.orgwrote: :) In principle I agree with Chris, but I can't decide what people do with names. One of my colleagues in this country (England) is named Instone-Brewer (sorry to use you as an example David). We've been wanting

[sword-devel] Hyphens in book names

2010-09-29 Thread Robert Hunt
New Zealand. Hello all, I am spending today studying the documentation on the Crosswire Sword wiki so I'm likely to have a few questions. Please let me know if this is not the right forum to ask questions. I see in http://www.crosswire.org/wiki/DevTools:SWORD that localised book

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Daniel Owens
On 09/29/2010 03:55 PM, Robert Hunt wrote: New Zealand. Hello all, I am spending today studying the documentation on the Crosswire Sword wiki so I'm likely to have a few questions. Please let me know if this is not the right forum to ask questions. I see in

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Greg Hellings
OP was not talking about a transliteration from the sounds of his email, but rather the original language where the hyphen is a letter. You are equivalently proposing an English speaker to not use the letter s in the Bible names list. It might be comprehensible but it would be horrible usability

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Robert Hunt
On 30/09/10 10:17, Greg Hellings wrote: OP was not talking about a transliteration from the sounds of his email, but rather the original language where the hyphen is a letter. You are equivalently proposing an English speaker to not use the letter s in the Bible names list. It might be

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Weston Ruter
Is this limitation in SWORD due to the OSIS requirement that book names not have hyphens? OSIS defines that a book (first segment) in an osisID must match ((\p{L}|\p{N}|_|(\\[^\s]))+). This means that a book must contain only letters or numbers or underscores... or it may contain another character

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Chris Little
Seriously? Someone designed an orthography in which \x2D (hyphen-minus) is used as an alphabetic character? What's its phonemic value? Hyphen-minus is simply not a letter. You can see its character properties at http://www.fileformat.info/info/unicode/char/2d/index.htm. Scroll down to see

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Chris Little
These are completely unrelated. You're talking about OSIS identifiers, which are non-linguistic data, which is obviously unlocalizable. Robert is talking about book name localization. --Chris On 9/29/10 2:47 PM, Weston Ruter wrote: Is this limitation in SWORD due to the OSIS requirement

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Ben Morgan
On Thu, Sep 30, 2010 at 7:17 AM, Greg Hellings greg.helli...@gmail.comwrote: Perhaps allowing each locale to define its own numerals and hyphen-like character would be a good solution? This is exactly what BPBible allows. Numerals are defined in the text section with the identifier 0123456789

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Troy A. Griffitts
:) In principle I agree with Chris, but I can't decide what people do with names. One of my colleagues in this country (England) is named Instone-Brewer (sorry to use you as an example David). We've been wanting to internationalize the numerals and action symbols in our verse parser for a while

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Robert Hunt
On 30/09/10 11:06, Chris Little wrote: Seriously? Someone designed an orthography in which \x2D (hyphen-minus) is used as an alphabetic character? What's its phonemic value? Oh! I guess I've been using hyphenated words in English since I learnt to write. I unthinkingly used it in the word

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Jonathan Marsden
All the Filipino languages I came across when I was living there consistently used a Spanish-derived orthography, and I don't remember any of them treating - as a letter. Of course, I didn't deal with the huge majority of the little tribal languages out there! On 9/29/2010 2:28 PM, Robert Hunt

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Jonathan Marsden
Robert, On 9/29/2010 3:57 PM, Robert Hunt wrote: Oh! I guess I've been using hyphenated words in English since I learnt to write. I unthinkingly used it in the word work-around ... That does not make it a letter. It just makes it a symbol used during writing. Letters are what make up the

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Weston Ruter
In English, a hyphen is a orthographic convention required when spelling various compound words: http://en.wikipedia.org/wiki/English_compound#Hyphenated_compound_adjectives I imagine the Philippine language Robert is working with has a book name like Apostle-Works (i.e. Acts) On Wed, Sep 29,

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Jonathan Marsden
On 9/29/2010 5:46 PM, Weston Ruter wrote: In English, a hyphen is a orthographic convention required when spelling various compound words: http://en.wikipedia.org/wiki/English_compound#Hyphenated_compound_adjectives Therefore, in English, it is not a letter. Q.E.D. (since someone on this

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread Robert Hunt
My apologies. I didn't expect such lectures on this list over picky definitions. :-( (As well as hyphenated names, consider the difference in meaning between English prayer and pray-er. Or load the SWORD Tagalog Ang Biblia module [that's the Philippine national language] and look at 1 Peter

Re: [sword-devel] Hyphens in book names

2010-09-29 Thread DM Smith
I've read the thread and I'd like to add my thoughts: I don't think the discussion regarding whether - is a letter is constructive. We have a problem to solve. Right now - is a meta-character indicating a range. I think we should extend the book name parser to work with Bible book names as they