Re: [sword-devel] osis2mod very happily generates modules with no text visible
Matěj Cepl wrote, - czap is an element for “text quoted by Czech apostrophe”, and I am not sure whether in all cases it means a quotation of the Old Testament text in the New Testament (as in this case). - All the instances of q marker=' ... /q are in the New Testament, so this seems very likely. Converting these to the milestoned form would be useful to prevent nesting errors if we wish to add other XML elements later. I have created a TextPipe subfilter that can do this. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4249115.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 29/12/11 09:05, David Haslam wrote: Thanks for the detailed explanations. I guess this was a comment to Matej's last email rather than a facetious one to my one-liner? :-) But FWIW - I would suggest that cs-utf8.conf is looked through regarding any missed out abbreviations or abbreviations which do not make sense/are wrong in the context of this module. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Yes - threaded views in email clients (or Nabble) help to contextualize such one liners! It would be helpful to module developers to know where to obtain the conf files necessary for localization of book names, etc. The fledgeling wiki page does not yet cover this topic. http://crosswire.org/wiki/Localization As you're one of those who knows where they are, perhaps you might find time to expand on this topic in the wiki. This suggestion does not debar others from volunteering, of course. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4245779.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Matěj Cepl wrote, ⌈ and ⌉ are more complicated. They are originally bkzavorka/ and ekzavorka/ and they limit the text for which the appropriate translator note is used. - I think ⌜top corners⌝ are visually less obtrusive for this purpose than ⌈ceilings⌉, so I have included such a replacement in my derived copy of the XML file. I have also made the assumption that each left ⌜corner should be preceded by a space. Wherever a space is not already present, I have inserted one. The lef/right counts mismatch issue is still unresolved. I have put on hold the idea of replacing these marks by XML tags, at least until we have a better source text, one in which there would be no conflicts that lead to nesting errors. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4245971.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Matěj Cepl wrote, pomlcka/ stands for em-dash; and the original code snippet he cited from Mark.1.1 includes, 61,1pomlcka/3; yet upon examining Matěj's OSIS file, this snippet appears as 61,13; rather than 61,1—3; It would seem that his XSLT script has not converted any of the pomlcka/ to an em-dash; Certainly, there are no — found anywhere in the file. To repair such errors of omission is impossible at this end of things, not having the original text. It requires a fix to the XSLT script. So although I have a working module, many of the notes must include incorrect references. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4246189.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Thanks for the detailed explanations. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4242256.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
There are many locations in the source text where spaces were omitted. I've been able to tackle most of these by searching for various patterns. The following questions remain: Is it correct for the left and right ceiling characters to be found within words? Or should the left ceiling always be preceded by a space (or a start quotation mark), and the right ceiling by either a space (or a suitable punctuation mark)? David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4243953.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Matěj, I am not a Linux user. I keep putting off the day when I have to use Linux. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4239363.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Everyone, I have spent a good part of today using TextPipe to fix the pretty OSIS XML file, and have made substantial progress. Here's a bulleted list of fixes already done... * A second bookGroup division for the NT * Book divisions (66) * Chapter elements (1089) * Section titles (2084) * Verse end milestones (31169) At this stage the OSIS XML still validates. Currently working on the paragraph tags, each of which merely enclosed a pilcrow ¶ sign. Wrapped the actual paragraphs instead, and moved the pilcrow to paragraph start. btw. As with the KJV, the first verse of a chapter does not have a pilcrow. Hit complications in Revelation where some quote marks and paragraphs are in conflict. i.e. A nesting problem to be solved. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4240218.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Fixed the p q ... nesting bug in Revelation, only four locations needed tweaking. Beyond the OSIS, there are some differences in character counts for these three left/right pairs. 28 ( LEFT PARENTHESIS8,278 29 ) RIGHT PARENTHESIS 8,323 Difference =45 5B [ LEFT SQUARE BRACKET 6,321 5D ] RIGHT SQUARE BRACKET6,256 Difference = -65 002308 ⌈ LEFT CEILING6,663 002309 ⌉ RIGHT CEILING 6,635 Difference = -28 This may be more of concern to the translation team. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4240496.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Everyone, I now have a working CzeCSP module! A further question for Matěj I suspect that the following codepoints may need converting from a legacy font. 002045 ⁅ LEFT SQUARE BRACKET WITH QUILL 1,648 002046 ⁆ RIGHT SQUARE BRACKET WITH QUILL 1,648 002308 ⌈ LEFT CEILING6,663 002309 ⌉ RIGHT CEILING 6,635 If that's not the case, their use in the Czech language requires explaining. I have added the following line to the .conf file so that these strange characters are displayed. Font=Code2000 I have also disabled the following two filters, as I could not find any corresponding OSIS tags. ;GlobalOptionFilter=OSISVariants ;GlobalOptionFilter=OSISLemma NB. Not yet pursued Peter's suggestion to code the other cross references that are still plain text. It feels like a day well spent. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4240818.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 28/12/11 19:40, David Haslam wrote: NB. Not yet pursued Peter's suggestion to code the other cross references that are still plain text. If you frame them with a reference tag, xreffix.pl can create proper OSIS refs. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 28.12.2011 20:40, David Haslam wrote: I suspect that the following codepoints may need converting from a legacy font. 002045 ⁅ LEFT SQUARE BRACKET WITH QUILL 1,648 002046 ⁆ RIGHT SQUARE BRACKET WITH QUILL 1,648 002308 ⌈ LEFT CEILING6,663 002309 ⌉ RIGHT CEILING 6,635 If that's not the case, their use in the Czech language requires explaining. They are not relevant to Czech, but these were my attempts to somehow preserve apparatus of variants in the Czech text ... ... hmm, looking at the original text once more it seems like my bug here. See for example note 1 in 62_Mk.xml (the original source file of the Mark’s gospel): defpozn n=v1Řec. slovo italiceuangelion/italic se nachází už v#x00A0;Homérově Odyseji (14, 152.166), a#x00A0;to ve významu bczuv/odměna za dobrou zprávueczuv/. Zároveň tento výraz označoval bczuv/dobrou zprávueczuv/ samu. Základem užití tohoto slova v#x00A0;NS je Stará smlouva (LXX). Velmi důležité jsou texty, kde se vyskytuje příslušné odvozené sloveso italiceuangelizesthai/italic (bczuv/zvěstovat / hlásat dobrou zprávueczuv/). Viz zvl. Iz#x00A0;52,7 a#x00A0;61,1pomlcka/3; srv. Sk#x00A0;5,42p/defpozn Here ⁅ ⁆ are translations of bczuv/ and eczuv/ respectively which are just (beginning|end)-of-Czech-úvozovek (quotes), so I guess these should be translated to plain Unicode „ and “ (which is the way we use double quotes). I have to check once more why I didn't translated it to double-quotes in the first place. (The other tags here should be simple ... defpozn is “defition of poznámka” (note), italic is obvious, pomlcka/ stands for em-dash; preservation of unbreakable space is important for following of Czech grammar rules about breaking lines). italic and bczuv are just two examples of many cases of mixing semantic and typographic markup in the text. I haven't seen the code really for the last couple of months, so I have to recheck why I didn't go the obvious way and make bczuv/ and eczuv/ into proper double-quotes. --- ⌈ and ⌉ are more complicated. They are originally bkzavorka/ and ekzavorka/ and they limit the text for which the appropriate translator note is used. See again the first two verses of the Mark’s gospel: kap n=1/ vers n=1/Počátekodkazo n=o1/ evangeliaodkaz n=v1/ Ježíše Krista, hzavorkaSyna Božího/hzavorkaodkazo n=o2/ bkzavorka/. vers n=2/Jakekzavorka/odkaz n=t2/ je perfnapsáno/perf bkzavorka/v#x00A0;proroku Izaiášoviekzavorka/:odkaz n=t3/ czapHle, hzavorkajá/hzavorka posílám svého poslaodkaz n=t4/ před tvou tváří, který futupraví/fut tvou cestu italic[před tebou]/italic.odkazo n=o3/ Here odkaz n=t2/ (“odkaz” is “a reference” in Czech) referes to the text over the border of verse “. Jak” (and apparently, IMHO, this is a mistake in the original markup of the text, but it is so printed in the paper version of the Bible). Other codes, just to make more examples: - kap/ is a chapter milestone - vers/ is a verse milestone - odkazo is a “o-type” reference (meaning a reference to a footnote containing reference to the other verse of the Bible, the other one is odkaz/ for a footnote with the actual text of translator’s note). - hzavorka element is a markup for the text used in less important manuscripts (although John 7:59-8:11 is not marked up at all and there is only a odkaz at Jn 7:59; the same goes for Mk 16:9- which has also only a text note explaining that the rest of the chapter is not present in many important manuscripts), so they used it probably only for small portions of text. - perf is an element marking the verb in the Greek perfectum, which is difficult to translate correctly to Czech as we don't have perfectum as a special case for verbs. - czap is an element for “text quoted by Czech apostrophe”, and I am not sure whether in all cases it means a quotation of the Old Testament text in the New Testament (as in this case). - fut is another case of marking up a Greek verb which cannot be well translated into Czech; here it is the other way around ... Czech future tenses are more rich than Greek ones (there are grammatical aspects http://en.wikipedia.org/wiki/Grammatical_aspect in Czech, which are not in Greek; so a Czech translation could make a difference which isn’t in the original text. I hope I've shown you what's the battle I am fighting with. Guys, thank you so so much for helping with this! Blessings, Matěj ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 28/12/11 23:10, David Haslam wrote: Thanks Peter, Do we have a list of Czech bookname abbreviations for use in the locale for running xreffix.pl ? Yes. cs-utf8.conf Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Replying to the entire thread: Debugging: The -d flag is very helpful to see what osis2mod is doing. You can use the -d flag multiple times as in -d 1 -d 4 or add them together as in -d 5 or both -d 5 -d 8 To get full debugging -d 1023 SIGSEGV I think Troy and I fixed this in SVN, revision 2669 in versekey.cpp and 2671 in osis2mod.cpp. You are using 2562. This bug is hit when a chapter is not in the chosen versification. Since the output is not flushed with every verse, the raw dat file is not helpful in determining where the problem occurs, but only that it occurs after that point. David's suggestion for raw text: During development, it is best to use raw text as you can open the dat file(s) and see what is present. Pretty much it should be everything except verse markers. If you use -d 2, it will put verse markers into the output. Sub-identifiers: SWORD engine cannot handle ! in the osisID, so osis2mod strips them off of the osisIDs for verses. I did not see any for verses, but only for notes. Markup of cross references: Peter mentioned that references should be marked up. In your module you frequently have something like: note osisRef=Gen.1.1 osisID=Gen.1.1!o1 type=crossReference n=o1 14,19; Ex 20,11; Iz 37,16; Jr 32,17; 1Pa 16,26; Neh 9,6; Ž 8,4; 115,15; Sk 4,24 /note The problem with this is that the SWORD and JSword engines won't create clickable references for such a note. You should wrap each reference with: reference osisRef=Gen.14.1914,19/reference; reference osisRef=Exod.20.11Ex 20,11/reference; While OSIS allows for more than one reference in an osisRef, SWORD does not handle the linking for it, but only the first listed. Unless the osisRef is for adjacent verses in the same chapter, split each reference into its own tag. Sub-identifiers: osis2mod handles sub-identifiers on a verse's osisID by stripping it off. Are there any? I didn't see any. -d 128 will output DEBUG(REF): Found a grain suffix when it finds one. If it finds Gen.1.1!a and later Gen.1.1!b, it will append Gen.1.1!b to what was already found for Gen.1.1. To generalize: osis2mod does not care what the order of verses are in the module. It will quite happily re-order them. And if it finds the same verse reference a subsequent time, it will append the text to the verse. However, an osisID=Gen.1.1 Gen.1.2!a followed by osisID=Gen.1.2!b Gen.1.3 will not be handled as you might expect. All the text will be stored in the dat file, but the idx files might not point properly. Book tags: Can be either(in order of recommendation): div type=book osisID=Gen sID=Gen/ ... div eID=Gen/ div type=book osisID=Gen.../div Osis2mod uses the book divs to determine what titles are book titles. Osis2mod also uses them to determine whether the book is in the v11n. If they are not present, it assumes they are. This could cause SIGSEGV/ Osis2mod does not require book divs. But you should have them. Chapter tags: Can be one of (in order of recommendation): chapter osisID=Gen.1 sID=Gen.1/ ... chapter eID=Gen.1/ div type=chapter sID=Gen.1/...div eID=Gen.1/ chapter osisID=Gen.1.../chapter div type=chapter osisID=Gen.1.../div Osis2mod uses the chapter tags to help determine what titles before verse 1 are chapter titles. Osis2mod does not require chapter tags, but you should have them. Verse tags: Can be either (in order of recommendation): verse osisID=Gen.1.1 sID=Gen.1.1/ ... verse eID=Gen.1.1/ verse osisID=Gen.1.1.../verse The milestoned version is probably required when you have robust structural markup (e.g. poetry, section divs, quotations). eID: For each sID, you must have a corresponding eID. The use of sID and eID must be properly nested. The value of the sID/eID pair can be anything, but typically it matches the osisID. I didn't look for other problems in the module. Hope this helps. In His Service, DM On Dec 27, 2011, at 3:27 AM, Matej Cepl wrote: On 27.12.2011 01:48, Peter von Kaehne wrote: BTW the text contains an enormous number of references which are not marked up. Once you got the above corrected please attend to this too. It would make the module a lot better. Yes, amount of notes and references is one of the reasons I really want to make this to work. However, I am not sure what you mean exactly by “references which are not marked up”. Could you
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 27.12.2011 01:48, Peter von Kaehne wrote: BTW the text contains an enormous number of references which are not marked up. Once you got the above corrected please attend to this too. It would make the module a lot better. Yes, amount of notes and references is one of the reasons I really want to make this to work. However, I am not sure what you mean exactly by “references which are not marked up”. Could you elaborate a bit, please? Also, so far (I may relax this in the future) the OSIS file is generated by a XSLT transformation from the original XML files used in translation (https://gitorious.org/sword/czekms-csp_bible/blobs/master/CEP2OSIS.xsl). I am not the best XSLT (try the other end of spectrum) so some things may take some time to do. Maybe I rewrite the script in Python (although XSLT works so far too well for me to be tempted) or I may relax this requirement. We’ll see. More after New Year! Blessings on all of you, Matěj ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
I downloaded it with Firefox 3.6.25, though I could just as easily have used Chrome or IE. I doubt that the browser makes any changes to what is within a binary file such as .gz I regularly download all sorts of things, and this has never happened before. I used 7-Zip to extract things locally. I could just as easily have used IZArc. I merely guessed that after the first extraction, the file bible.xml should have been bible.xml.tar After all, even a Windows user with many friends who are Linux users gets to know what sort of things to expect from that system. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4238119.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
DM wrote, Sub-identifiers: SWORD engine cannot handle ! in the osisID, so osis2mod strips them off of the osisIDs for verses. I did not see any for verses, but only for notes. You are quite correct. When I stripped out all the note elements (using TextPipe) there were no sub-identifiers left within the resulting XML file. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4238153.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
I suppose that strictly speaking a SWORD module can be made without having bookGroup or x-testament division[s]. Otherwise the append option -a in osis2mod could not be used to build a successful module. i.e. If you append each book (apart from the first) from a separate OSIS file (as I have on other occasions), the resulting module could not have any testament divs, could it? Yet it still works. Has anyone else made a pretty-print version of Matěj's OSIS file? It certainly helps to see the wood from the trees. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4238187.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 27.12.2011 21:58, David Haslam wrote: Has anyone else made a pretty-print version of Matěj's OSIS file? It certainly helps to see the wood from the trees. tidy -i -xml -utf8 -m bible.xml doesn't work for you? Matěj ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On the Mac it downloaded as bible.xml using chrome. I renamed that to bible.zip and double clicked that to get bible, to which I added the .xml extension. On the Mac it is the OS that handles the unpacking. This happens all the time on the Mac and I am used to it. It probably is tied w the mime type that the server provides for the download in the http header. Curl ignores that. Cent from my fone so theer mite be tipos. ;) On Dec 27, 2011, at 3:33 PM, David Haslam dfh...@googlemail.com wrote: I downloaded it with Firefox 3.6.25, though I could just as easily have used Chrome or IE. I doubt that the browser makes any changes to what is within a binary file such as .gz I regularly download all sorts of things, and this has never happened before. I used 7-Zip to extract things locally. I could just as easily have used IZArc. I merely guessed that after the first extraction, the file bible.xml should have been bible.xml.tar After all, even a Windows user with many friends who are Linux users gets to know what sort of things to expect from that system. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4238119.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On Wed, 2011-12-28 at 00:10 +0100, Matěj Cepl wrote: On 27.12.2011 21:58, David Haslam wrote: Has anyone else made a pretty-print version of Matěj's OSIS file? It certainly helps to see the wood from the trees. tidy -i -xml -utf8 -m bible.xml doesn't work for you? David does not use Linux. He is only partially aware of how much he is missing out on. ;-) Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On Wed, Dec 28, 2011, at 12:42 AM, Peter von Kaehne wrote: On Wed, 2011-12-28 at 00:10 +0100, Matěj Cepl wrote: On 27.12.2011 21:58, David Haslam wrote: Has anyone else made a pretty-print version of Matěj's OSIS file? It certainly helps to see the wood from the trees. tidy -i -xml -utf8 -m bible.xml doesn't work for you? David does not use Linux. He is only partially aware of how much he is missing out on. ;-) He doesn't have to miss out on tidy, there is a (fairly old) GNUwin32 port of it, for example at http://gd.tuwien.ac.at/pub/gnu/gnu-win32/release/tidy/ Jonathan -- Jonathan Marsden jmars...@fastmail.fm ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On Dec 27, 2011, at 8:32 AM, DM Smith dmsm...@crosswire.org wrote: Chapter tags: Can be one of (in order of recommendation): chapter osisID=Gen.1 sID=Gen.1/ ... chapter eID=Gen.1/ div type=chapter sID=Gen.1/...div eID=Gen.1/ chapter osisID=Gen.1.../chapter div type=chapter osisID=Gen.1.../div Osis2mod uses the chapter tags to help determine what titles before verse 1 are chapter titles. Osis2mod does not require chapter tags, but you should have them. div type=chapter should not be used in Bibles, only chapter. The latter alone refers to Bible chapters. The former would be used for chapters in all other types of books. --Chris ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
What was the reason for not generating eID milestones for verses? These should be present as per the example shown in the wiki page. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4234233.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 25.12.2011 22:57, David Haslam wrote: The file bible.xml within http://mcepl.fedorapeople.org/tmp/bible.xml.gz seems to be not a text file but a binary! In fact, it needs to be renamed as bible.xml.tar and decompressed yet again to get the real bible.xml mitmanek:~ $ curl -L -O http://mcepl.fedorapeople.org/tmp/bible.xml.gz % Total% Received % Xferd Average Speed TimeTime Time Current Dload Upload Total SpentLeft Speed 100 3708k 100 3708k0 0 767k 0 0:00:04 0:00:04 --:--:-- 792k mitmanek:~ $ file bible.xml.gz bible.xml.gz: gzip compressed data, was bible.xml, from Unix, last modified: Sun Dec 25 07:43:38 2011, max compression mitmanek:~ $ gunzip bible.xml.gz mitmanek:~ $ file bible.xml bible.xml: XML document text mitmanek:~ $ head bible.xml ?xml version=1.0 encoding=utf-8? osis xmlns=http://www.bibletechnologies.net/2003/OSIS/namespace; xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance; xmlns:xi=http://www.w3.org/2003/XInclude; osisText osisIDWork=CzeKMS osisRefWork=bible xml:lang=cs canonical=true header revisionDesc resp=Matěj Cepl date2010.09.07/date pThis is just an information about the book./p /revisionDesc work osisWork=CzeKMS titleČeský studijní překlad Bible/title mitmanek:~ $ Sorry, that's your browser which did some repackaging behind the scenes. Matěj ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 26.12.2011 08:18, David Haslam wrote: Thechapter ... tags should end with rather than / Also there are no/chapter tags. Meaning, chapter/ couldn't be a milestone? How does it align with this? (from http://crosswire.org/wiki/OSIS_Bibles#OSIS_Milestones): # It is allowable to use milestone elements for verses alone, or for # both verses and chapters. The body example above is for the former. Also, see the chapter 7.5 “The div element” (page 36) and example therein in “OSIS Users Manual” (http://www.bibletechnologies.net/utilities/fmtdocview.cfm?id=28871A67-D5F5-4381-B22EC4947601628Bmethod=title). What I am missing? Thanks for helping me. Merry Christmass (and thoughtful feast of St. Stephen, let’s we all have courage to stand up for what we believe!) Matěj ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 26/12/11 19:55, Matěj Cepl wrote: On 26.12.2011 08:18, David Haslam wrote: Thechapter ... tags should end with rather than / Also there are no/chapter tags. Meaning, chapter/ couldn't be a milestone? Chapters are usually milestoned. Nothing wrong with that. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
The indeces (ot.vss, nt.vss) are essentially all 0. The osis file, while validating contains not the correct info to create a module. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 26/12/11 21:39, Peter von Kaehne wrote: The indeces (ot.vss, nt.vss) are essentially all 0. The osis file, while validating contains not the correct info to create a module. The terminal output of osis2mod is meant to show what osis2mod is doing. It does nothing. The file is somehow not correct - even if it validates. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
mod2imp shows that all content is put into verse 0 of each chapter. Switching on headlines in xiphos makes a lot of text appear. Peter On 26/12/11 21:57, Peter von Kaehne wrote: On 26/12/11 21:39, Peter von Kaehne wrote: The indeces (ot.vss, nt.vss) are essentially all 0. The osis file, while validating contains not the correct info to create a module. The terminal output of osis2mod is meant to show what osis2mod is doing. It does nothing. The file is somehow not correct - even if it validates. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
I hope to get a look at it tomorrow. Im busy w av11n stuff for jsword and entertaining my new grandson. Cent from my fone so theer mite be tipos. ;) On Dec 26, 2011, at 6:01 PM, Peter von Kaehne ref...@gmx.net wrote: mod2imp shows that all content is put into verse 0 of each chapter. Switching on headlines in xiphos makes a lot of text appear. Peter On 26/12/11 21:57, Peter von Kaehne wrote: On 26/12/11 21:39, Peter von Kaehne wrote: The indeces (ot.vss, nt.vss) are essentially all 0. The osis file, while validating contains not the correct info to create a module. The terminal output of osis2mod is meant to show what osis2mod is doing. It does nothing. The file is somehow not correct - even if it validates. Peter ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
On 27/12/11 00:33, DM Smith wrote: Im busy w av11n stuff for jsword Peter von Kaehne likes that. A lot. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
DMSmith is working towards making JSword capable to support av11n. Only when that's in place can the front-ends based on JSword begin to take advantage thereof. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233404.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
As a useful tip, it's generally a good idea to first to make a rawtext module with osis2mod. This avoids some known bugs in making a ztext module directly using mod2osis. You can easily use mod2zmod to compress it afterwards. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233409.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
The file bible.xml within http://mcepl.fedorapeople.org/tmp/bible.xml.gz seems to be not a text file but a binary! In fact, it needs to be renamed as bible.xml.tar and decompressed yet again to get the real bible.xml If you double compress something using tar then gzip, it's best to indicate that in the filename! David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233421.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Your OSIS XML file makes extensive use of sub-identifiers. This is the first time I've seen these in any OSIS file. I have not seen these discussed in any previous threads in the mailing list. This might turn out to be the real cause of why osis2mod doesn't produce a useful module. The relevant section in the OSIS Reference Manual is: 15.3. Sub-identifiers Translations also often split verses into parts, provided labels such as ‘a’ and ‘b’ for the separate parts. Encoders may freely add sub-identifiers below the lowest standardized level. They are set off from the standardized portion by the character ‘!’. For example: Rev.2.20!b Such subdivisions are not standard across different translations, so applications must be prepared to discard them when trying to locate a referenced location in a different edition. These extensions are not considered a formal part of the canonical reference scheme, and so a work that uses them need not claim it is using a different scheme. -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233487.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
If you make a temporary OSIS XML file with all the note elements removed, does that convert to a module with readable text? If all the sub-identifiers happen to lie within the study notes, then that might be a profitable course of investigation. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233521.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
The cause is much simpler! Your path for module creation is in ztext, yet your .conf file is for a rawtext module. Such a combination will always result in a module that appears to be empty of content. btw. I've made the same mistake myself on rare occasions. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233919.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
Though the OSIS validates, there is only one top level div element! osis2mod expects each book to be a div, and each testament to be also a div The chapter ... tags should end with rather than / Also there are no /chapter tags. See http://crosswire.org/wiki/OSIS_Bibles#Body David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4233989.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
Re: [sword-devel] osis2mod very happily generates modules with no text visible
It was easier for me to spot these issues in the body structure after I had used the *XML Tools* plugin for *Notepad++* to first linearize the XML and then 'pretty print' the XML. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/osis2mod-very-happily-generates-modules-with-no-text-visible-tp4232587p4234014.html Sent from the SWORD Dev mailing list archive at Nabble.com. ___ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page