Hyphenation
If I have hyphenate=true property set on block containing a table-cell and if the cell is very narrow (cannot accommodate more than 4 characters), the word seems to be broken into small segments but I don't see any '-' at the end of each segment. I did not see this behavior when using AntennaHouse. Is there any known issue with this? Also I was getting a bunch of errors from FOP that I need to use the language property. I ended up using the language=en property but that did not solve the problem completely. In some cases I see the hyphenation happen but not in all cases. Thanks, Kumar Puppala - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation
robert frapples wrote: This leads me to believe that I should be using 7. With 7 or with 1 (and no other 1's), it still breaks: Frap- ples,Robert(Age- ncy) I also tried setting this in my {src-dir}/src/hyph/en.xml and recompiling fop.jar. Still no luck. Try running java -classpath ...fop.jar \ org.apache.fop.layout.hyphenation.HyphenationTree This class has a main function, which can be used for debugging (check the source for details). You can use this to check whether the problem is with the hyphenation librabry code and/or your pattern definitions or whether it is in the code invoking the hyphenator in the line layout. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation
Odd numbers represent an acceptable location for a hyphen, with higher values overriding lower inhibiting values. . .Due to a bug currently patterns with values of 8 and greater don't have an effect, so don't wonder. (from http://xml.apache.org/fop/hyphenation.html) This leads me to believe that I should be using 7. With 7 or with 1 (and no other 1's), it still breaks: Frap- ples,Robert(Age- ncy) I also tried setting this in my {src-dir}/src/hyph/en.xml and recompiling fop.jar. Still no luck. Later on (probably tomorrow) I'll attach an example xml,xsl,en.xml snippet. On Tue, 14 Dec 2004 18:54:46 +0100, Andreas L. Delmelle [EMAIL PROTECTED] wrote: Just to make sure, if i added to classes: ) ( , and to patterns: 7( )7 ,7 And the text: Frapples,Robert(Agency) is split at 15 characters, I should get: Frapples,- Robert(Agency) Right? Not necessarily... '7' indicates an acceptable possibility for hyphenating, but as long as there's a possibility of breaking it at a '3', the latter will be chosen. Besides that: F-R-A-P-P-L-E-S-,-R-O-B-E-R-T is 15 characters, so I'd expect it to break between 'Robert' and '('. What result do you get? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: hyphenation
-Original Message- From: robert frapples [mailto:[EMAIL PROTECTED] Hi, Odd numbers represent an acceptable location for a hyphen, with higher values overriding lower inhibiting values. Sorry. Indeed! Don't know where my head was... Upside-down it seems :-) This leads me to believe that I should be using 7. With 7 or with 1 (and no other 1's), it still breaks: Frap- ples,Robert(Age- ncy) Beats me... Anyone who can shed some light on this? TIA! Greetz, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation
On Mon, 13 Dec 2004 23:17:01 +0100, J.Pietschmann [EMAIL PROTECTED] wrote: robert frapples wrote: C:\fop\hyph\en.xml has: . . . patterns 5( ,5 )5 . . . Shouldn't this make it tend to break before and after parenthesis, and after commas? Am I missing something? You have to add parenthesis and comma to the classes entry. And you'll get a hyphen unless you redefined the hyphen character to be something like a ZWS. I added, to classes ( ) , Still no luck. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation
I tried recompiling my fop.jar with the changes in {source-dir}/hyph/en.xml, still no luck. Just to make sure, if i added to classes: ) ( , and to patterns: 7( )7 ,7 And the text: Frapples,Robert(Agency) is split at 15 characters, I should get: Frapples,- Robert(Agency) Right? On Tue, 14 Dec 2004 07:24:39 -0500, robert frapples [EMAIL PROTECTED] wrote: On Mon, 13 Dec 2004 23:17:01 +0100, J.Pietschmann [EMAIL PROTECTED] wrote: robert frapples wrote: C:\fop\hyph\en.xml has: . . . patterns 5( ,5 )5 . . . Shouldn't this make it tend to break before and after parenthesis, and after commas? Am I missing something? You have to add parenthesis and comma to the classes entry. And you'll get a hyphen unless you redefined the hyphen character to be something like a ZWS. I added, to classes ( ) , Still no luck. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: hyphenation
-Original Message- From: robert frapples [mailto:[EMAIL PROTECTED] Hi, Just to make sure, if i added to classes: ) ( , and to patterns: 7( )7 ,7 And the text: Frapples,Robert(Agency) is split at 15 characters, I should get: Frapples,- Robert(Agency) Right? Not necessarily... '7' indicates an acceptable possibility for hyphenating, but as long as there's a possibility of breaking it at a '3', the latter will be chosen. Besides that: F-R-A-P-P-L-E-S-,-R-O-B-E-R-T is 15 characters, so I'd expect it to break between 'Robert' and '('. What result do you get? Greetz, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
hyphenation
I call fop with fop.bat -c C:\fop\cfg.xml . . . C:\fop\cfg.xml has entrykeyhyphenation-dir/keyvalueC:\fop\hyph/value/entry C:\fop\hyph\en.xml has: . . . patterns 5( ,5 )5 . . . Shouldn't this make it tend to break before and after parenthesis, and after commas? Am I missing something? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation
robert frapples wrote: C:\fop\hyph\en.xml has: . . . patterns 5( ,5 )5 . . . Shouldn't this make it tend to break before and after parenthesis, and after commas? Am I missing something? You have to add parenthesis and comma to the classes entry. And you'll get a hyphen unless you redefined the hyphen character to be something like a ZWS. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Re: hyphenation
!!!ATTENTION CE COURRIER N'EST PAS ARRIVE A DESTINATION!!! Due a une très forte progression du SPAM sur notre boite aux lettres, nous vous prions de bien vouloir transférer votre courriel a notre nouvelle adresse: [EMAIL PROTECTED] Merci de votre compréhension Alacuisine.Net !!!ATTENTION CE COURRIER N'EST PAS ARRIVE A DESTINATION!!! - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: hyphenation in single syllable language?
-Original Message- From: Hoàng Tuân Nguy?n Ð?ng [mailto:[EMAIL PROTECTED] Hi, i want to know about the capablity to disable spliting a word in fop ? hyphenation as i know!! i've build a Vietnamese pdf document with fop, but there is some problem with the result. Vietnamese is a single syllable language! Well... I see the document in question uses Vietnamese as well as English. I suspect you have hyphenation turned on for all of the document (?) So most likely, FOP is using English hyphenation patterns for the Vietnamese parts as well? If that is the case, I see two options: - activate hyphenation for all of the document, and disable it (hyphenate=false) for the Vietnamese parts - activate hyphenation (hyphenate=true) only for the English parts Hope this helps! Greetz, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: hyphenation in single syllable language?
-Original Message- From: Hoang Tuan Nguy?n D?ng [mailto:[EMAIL PROTECTED] Hi, snip / i've try your suggest but there is no change :(. fist at all, i've change my locale in docbook source from 'vi' to 'en', the hyphenation is work well, my word isn't split ! But when i try to copy en.xml to vi.xml and define hyphenation-dir in userconfig.xml; when fop-ing, there is no error messages about building hyphenation tree for language vi; but the word is split in PDF document I'm a bit confused here... So, you are 'copying' the XML containing the English hyphenation patterns to a new file 'vi.xml'? :-/ As I see it, this will have exactly the effect I described as 'using English hyph patterns for Vietnamese text'. Unless the 'vi.xml' is made to contain information specifically about hyphenation in Vietnamese, it's always going to lead to undesired results. When i add those lines: xsl:attribute-set name='hypenateText' xsl:attribute name='hyphenate'false/xsl:attribute xsl:attribute name='language'vi/xsl:attribute /xsl:attribute-set in my xsl stylesheet ang generate fo file with this; nothing change!! All depends on where this attribute-set is used. If this set is in fact being used on the Vietnamese parts, and hyphenation is still not disabled for those blocks of text, this would be a bug. If you were using two attribute-sets: xsl:attribute-set name=hyph-English xsl:attribute name=languageen/xsl:attribute xsl:attribute name=hyphenatetrue/xsl:attribute /xsl:attribute-set and xsl:attribute-set name=hyph-Vietnamese xsl:attribute name=languagevi/xsl:attribute xsl:attribute name=hyphenatefalse/xsl:attribute /xsl:attribute-set The XSL should look like: fo:block use-attribute-sets=hyph-English !-- For English Text -- /fo:block fo:block use-attribute-sets=hyph-Vietnamese !-- For Vietnamese Text -- /fo:block IIC, it would even be sufficient to only use the first set, and for the Vietnamese parts, just omit the hyphenation properties. Hope this helps! Greetz, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation in single syllable language?
On Thu, Nov 11, 2004 at 05:13:37PM +0100, Andreas L. Delmelle wrote: From: Hoang Tuan Nguy?n D?ng [mailto:[EMAIL PROTECTED] Hi, snip / i've try your suggest but there is no change :(. fist at all, i've change my locale in docbook source from 'vi' to 'en', the hyphenation is work well, my word isn't split ! But when i try to copy en.xml to vi.xml and define hyphenation-dir in userconfig.xml; when fop-ing, there is no error messages about building hyphenation tree for language vi; but the word is split in PDF document I'm a bit confused here... So, you are 'copying' the XML containing the English hyphenation patterns to a new file 'vi.xml'? :-/ As I see it, this will have exactly the effect I described as 'using English hyph patterns for Vietnamese text'. Unless the 'vi.xml' is made to contain information specifically about hyphenation in Vietnamese, it's always going to lead to undesired results. When i add those lines: xsl:attribute-set name='hypenateText' xsl:attribute name='hyphenate'false/xsl:attribute xsl:attribute name='language'vi/xsl:attribute /xsl:attribute-set in my xsl stylesheet ang generate fo file with this; nothing change!! If you would have no file vi.xml and declare the language of the vietnamese text parts to be 'vi', those parts should not be hyphenated. Regards, Simon -- Simon Pepping home page: http://www.leverkruid.nl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
hyphenation
Hi, how i can activate the hyphenation for geman! Regards! Thava! - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation
See here: http://xml.apache.org/fop/hyphenation.html http://offo.sourceforge.net/ http://nagoya.apache.org/eyebrowse/[EMAIL PROTECTED]msgId=1881555 On 23.09.2004 14:09:34 Anton Packianathan wrote: how i can activate the hyphenation for geman! Jeremias Maerki - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Project offo distributes hyphenation pattern files for FOP
Dear FOP users, In February 2004 a large number of hyphenation pattern files were removed from FOP's CVS repository due to licensing issues. These hyphenation patterns were contributed to FOP under licenses which allowed their free distribution, but under conditions which were felt to be in contradiction with the Apache license, under which FOP is distributed. Most files are licensed under the LaTeX Project Public License (http://www.latex-project.org/lppl/index.html), which requires that no modified version be published under the name of the original source file. For details see the FOP wiki page (http://nagoya.apache.org/wiki/apachewiki.cgi?FOPAudits/March2003). I am please to announce that the hyphenation pattern files for FOP are now made available by the project `Objects for Formatting Objects' (offo) (http://offo.sourceforge.net/). They can be downloaded from offo's project page (http://sourceforge.net/projects/offo/). At this moment the homepage of the project is not yet ready. Therefore the overview of the hyphenation pattern files and their licenses is available from my web site, http://www.leverkruid.nl/FOP/hyphenation.html. It is also contained in the package file. Regards, Simon -- Simon Pepping home page: http://www.leverkruid.nl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry
Jörg, On Wed, Jun 16, 2004 at 11:15:42PM +0200, J.Pietschmann wrote: Simon Pepping wrote: I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Well, sf.net would appeal to a larger body of developers, I think, and is certainly easier to menage for small projects, but we can also ask on jakarta-commons, xml-commons and even declare it a FOP (or XML graphics) subproject. It is up to you to decide where you want your efforts to be host it. I am happy to present it in a sf.net project, but there is little more that I can do. Do you want to pursue these efforts further, or do you want to make them available to others who might be interested? I am afraid that is always a difficult proposition without active recruitment. Anyway, I just uploaded http://cvs.apache.org/~pietsch/t.tar.gz which contains several unfinished stuff I produced the last year: - Utilities to generate tables for the Unicode line break property - A class keeping a line break state according to TR14, which should be easier to usee than the java.text.BreakIterator for FOP - A Java port of MySpell - An attempt at providing a layered hierarchy for spell checking and hyphenation interfaces. - A Java port of the link grammar parser (incomplete, badly designed, buggy and without approvement of the original authors, *please* use only for personal study, don't redistribute). - An attempt at a morphological analyzer for german words. Somehow, the simple port of patgen as well as other attempts at simplifying the current FOP hyphenator are missing, I hope I remember to upload them tomorrow. I see in the tar file two parts: java code and data. Would it be useful to indicate which parts of the code belong to each of the above items? buggy and without approvement of the original authors, *please* use only for personal study, don't redistribute). This would not exclude making it available on a public web page? Regarding your items to chew on, would you want these to be part of a web page for this code? Regards, Simon -- Simon Pepping home page: http://www.leverkruid.nl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry
I would like some rather different ideas: FOX or FO-extra AmFOra: an amphora for FO goodies EuFOria: FO goodies that make FO users euphoric Well-FO-Armed Simon On Wed, Jun 16, 2004 at 01:40:21PM -0700, Clay Leeds wrote: Sure! I'd love to participate! I don't know how yet, though... Such a project brings always more work with it than I care to do :-) -- Simon Pepping home page: http://www.leverkruid.nl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry
On Jun 20, 2004, at 12:14 PM, Simon Pepping wrote: Jörg, On Wed, Jun 16, 2004 at 11:15:42PM +0200, J.Pietschmann wrote: Simon Pepping wrote: I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Well, sf.net would appeal to a larger body of developers, I think, and is certainly easier to menage for small projects, but we can also ask on jakarta-commons, xml-commons and even declare it a FOP (or XML graphics) subproject. It is up to you to decide where you want your efforts to be host it. I am happy to present it in a sf.net project, but there is little more that I can do. Do you want to pursue these efforts further, or do you want to make them available to others who might be interested? I am afraid that is always a difficult proposition without active recruitment. Echoing Simon... Bring it on! The more (components, peripherals toys) the merrier!... However if active recruitment is required be prepared for... well, active recruiting... as for a project name, how about calling it: The XSL-FO Toybox = http://xslfo-toy-box.sourceforge.net/ or simply: The Toybox = http://toybox.sourceforge.net/ or: Toys http://toys.sourceforge.net/ Toys for Geeks == http://toys-for-geeks.sourceforge.net/ XML Toys http://xmltoys.sourceforge.net/ Java XML Toys = http://javaxmltoys.sourceforge.net/ or something... I think I like XMLToys... Anyway, I just uploaded http://cvs.apache.org/~pietsch/t.tar.gz which contains several unfinished stuff I produced the last year: - Utilities to generate tables for the Unicode line break property - A class keeping a line break state according to TR14, which should be easier to usee than the java.text.BreakIterator for FOP - A Java port of MySpell - An attempt at providing a layered hierarchy for spell checking and hyphenation interfaces. - A Java port of the link grammar parser (incomplete, badly designed, buggy and without approvement of the original authors, *please* use only for personal study, don't redistribute). - An attempt at a morphological analyzer for german words. Somehow, the simple port of patgen as well as other attempts at simplifying the current FOP hyphenator are missing, I hope I remember to upload them tomorrow. I see in the tar file two parts: java code and data. Would it be useful to indicate which parts of the code belong to each of the above items? Perhaps a classification system such as this for the project-name/applications/[insert apps here] project-name/peripherals/[insert peripherals here] project-name/examples/[insert examples here] project-name/so-on/[insert so-on here] project-name/docs/[insert docs here] project-name/wiki/[insert wiki here] buggy and without approvement of the original authors, *please* use only for personal study, don't redistribute). This would not exclude making it available on a public web page? Regarding your items to chew on, would you want these to be part of a web page for this code? Regards, Simon -- Simon Pepping home page: http://www.leverkruid.nl Sounds llike something of a non-trivial undertaking... :-) Sounds like fun! Web Maestro Clay - [EMAIL PROTECTED] --- My religion is simple. My religion is kindness. - His Holiness the 14th Dalai Lama of Tibet - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry
Simon Pepping wrote: Do you want to pursue these efforts further, or do you want to make them available to others who might be interested? I wanted to make my current code available to others, in particular the PatGen port (which unfortunately went missing), in order to avoid unnecessary duplication of work. This would not exclude making it available on a public web page? Yes. I'll replace the tarball with another one without the link parser but including the PatGen code. Unfortunately I detected the problem only after I uploaded the file and started composing the message, and I thought I'd leave it for now lest the other work go unnoticed. Regarding your items to chew on, would you want these to be part of a web page for this code? Why not? J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry [was: Re: proposed font project]
Simon Pepping wrote: Hi Clay, On Sat, May 29, 2004 at 10:02:37PM -0700, Clay Leeds wrote: It would also be good to develop some sort of hyphenation foundry... I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Hy-pe Hy-Phi Peter -- Peter B. West http://www.powerup.com.au/~pbwest/resume.html - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry [was: Re: proposed font project]
J.Pietschmann wrote: Simon Pepping wrote: I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Well, sf.net would appeal to a larger body of developers, I think, and is certainly easier to menage for small projects, but we can also ask on jakarta-commons, xml-commons and even declare it a FOP (or XML graphics) subproject. Anyway, I just uploaded http://cvs.apache.org/~pietsch/t.tar.gz which contains several unfinished stuff I produced the last year: - Utilities to generate tables for the Unicode line break property Does Character.UnicodeBlock provide any of this functionality? Peter -- Peter B. West http://www.powerup.com.au/~pbwest/resume.html - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Hyphenation foundry [was: Re: proposed font project]
Hi Clay, On Sat, May 29, 2004 at 10:02:37PM -0700, Clay Leeds wrote: It would also be good to develop some sort of hyphenation foundry... I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Regards, Simon -- Simon Pepping home page: http://www.leverkruid.nl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry [was: Re: proposed font project]
On Jun 16, 2004, at 12:20 PM, Simon Pepping wrote: Hi Clay, Hi Simon! On Sat, May 29, 2004 at 10:02:37PM -0700, Clay Leeds wrote: It would also be good to develop some sort of hyphenation foundry... I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Regards, Simon Sure! I'd love to participate! I don't know how yet, though... Ideas for names? I guess it depends on how 'we' want to position this foundry. Is the foundry geared toward FOP users? * fopstuff * fop-stuff * fostuff * fo-stuff * xslfostuff * xsl-fo-stuff * foptoys * fop-toys * fotoys * fo-toys * xslfotoys * xsl-fo-toys * fopaccessories * fop-accessories * foaccessories * fo-accessories * xslfoaccessories * xsl-fo-accessories * fopperipherals * fop-peripherals * foperipherals * fo-peripherals * xslfoperipherals * xsl-fo-peripherals I don't have a particular favorite, although since there are so many, it wouldn't be very helpful if I didn't 'choose' one or two. I like the ones *with* the hyphen (no pun intended! ;-) -- which makes it easier to read): * xsl-fo-toys * xsl-fo-stuff In addition, since we want it to be of broader use (i.e., not just FOP), I would think we'd want to use one of the 'fo' or 'xsl-fo' prefixes (with or without hyphens) over the 'fop' based ones. Hope this helps! Web Maestro Clay [EMAIL PROTECTED] --- There are only 10 kinds of people in the world: those who understand binary and those who don't. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation foundry [was: Re: proposed font project]
Simon Pepping wrote: I think it is time to create a project for the hyphenation files at Sourceforge. The project should be a home for all sorts of accessories to FOP, or even to FO processors in general. Do you want to participate? Do you know a nice name? Well, sf.net would appeal to a larger body of developers, I think, and is certainly easier to menage for small projects, but we can also ask on jakarta-commons, xml-commons and even declare it a FOP (or XML graphics) subproject. Anyway, I just uploaded http://cvs.apache.org/~pietsch/t.tar.gz which contains several unfinished stuff I produced the last year: - Utilities to generate tables for the Unicode line break property - A class keeping a line break state according to TR14, which should be easier to usee than the java.text.BreakIterator for FOP - A Java port of MySpell - An attempt at providing a layered hierarchy for spell checking and hyphenation interfaces. - A Java port of the link grammar parser (incomplete, badly designed, buggy and without approvement of the original authors, *please* use only for personal study, don't redistribute). - An attempt at a morphological analyzer for german words. Somehow, the simple port of patgen as well as other attempts at simplifying the current FOP hyphenator are missing, I hope I remember to upload them tomorrow. If someone want some problems to chew on: - Implementation of an optimized trie or ternary or PATRICIA tree. Issues here: The FOP implementation packs both tree construction and retrieval into a single class, while the data structure is WORM. Furthermore, while it is fast, it could be implemented with much less memory, especially peak memory during construction. I ultimately concluded compiling the data into Java bytecode would be the best. Consider inserting the words WORD and WORM. A PATRICIA tree would collapse this to root: WOR - leaf D - leaf M In order to map this, the root node gets an operation match string with the string WOR leading to the subtree. Statistical compression could optimize the necessary operation, like switch array, match 2char string, match 3char string, match n-char string etc. May utilize BCEL. - Institutionalized alphabet transformation. This is somewhat of a generalization of the hyphenation character classes. Java uses 16bit characters, but in many languages it is rare that more than 256 characters are actually used in words. TeX/PatGen also map the characters onto the numbers 1..N (256), folding character classification into the process. Mapping chars onto bytes saves almost half the memory. Because there are languages which requires more than 256 characters, at least two implementation of the trie/whatever holding the patterns are necessary, one where the keys are byte sequences, another with char sequences. Too bad generics aren't ready yet, but if the data is byte compiled into a Java class, the compiler may analyze the patterns and decide whether bytes are sufficient. Stuff like Unicode character normalization should probably be folded into the classification/alphabet transformation too. It would be too bad if hyphenation failed because someone decided to use unnormalized characters like FI LIGATURE. - API design. Need a hierarchy of interfaces which allow polymorphy at various levels: + Hyphenator implementations: pattern hyphenator, dictionary hyphenator, composite hyphenator: delegate to a collection of child hyphenators + Pattern hyphenator - pattern storage implementations: HashTable (very easy to understand but slow), R/W-trie, optimized WORM class, ... + Dictionary hyphenator - dictionary ... For reuse in interactive applications, R/W storage may be useful (user dictionaries) - Generalized line breaking strategies. Possible strategies + naive, break before the first non-space after a space + TR14 + break before any character + pattern, regexp or dictionary pased - Other ideas: API for processing the Unicode data files. Optimized compile for Unicode properties into Java class data: select the properties you want, get it. Use this to get the latest Unicode data into your Java applications rather than the outdated stuff in the JRE. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: FOP Hyphenation question
Andreas L. Delmelle wrote: Hi, I guess my former mail was not clear enough. As I said the email address is inserted in a text provided by an user. I may not expect of them that they put the fo:inline around the email address themselves. So to add the fo:inline around the email address I'll have to do a search and replace afterwards, if it's not possible to prevent the hyphenation of email addresses in the hyphenation patterns. Can you be more specific on the way you're constructing the FO? If it is an option to use XML+XSLT, you wouldn't need sr. Provided that the email addresses are separate elements in the source XML, you could just write a template rule to transform those elements into fo:inline's... However, as Joerg already indicated, hyphenation props are block props, so they wouldn't do you much good here :( I'm using XML for input data and a XSLT Style Sheet to create the FO. Unfortenately the emailadresses are put in a common block in the input data (a free text field provided by the users of the application). But reading that the fo:inline trick won't work, I'll have to do something with hypnenation patterns. Over to the bright(er) side then... Take a look at http://xml.apache.org/fop/hyphenation.html#patterns to get an idea of how to use hyphenation patterns for this. Taking into account a limitation mentioned over there, a pattern like: '[EMAIL PROTECTED]' would indicate that hyphenating before or after the @ character is highly undesirable. Thanks for this example. I'll take a look at this page and see if I can at least get the desired result for most of the PDF's generated. As for preventing hyphenation on the name and domain, this seems a little more complicated, but then again, I'm no expert in this matter. Hope someone with more experience in hyphenation patterns can comment in on this. (A combination of the classes and exceptions might just provide you with a way to achieve what you're after...) As the warning on the page indicates, any feedback regarding errors/problems that could help to make it function better is most welcome. Hope this helps! Greetz, Andreas I hope it as well, I'll get back on this subject if I've succeeded. Thanks, Wessel van Norel - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
FOP Hyphenation question
Hi all, I've a FOP hyphenation question. I need to stop the hyphenation of email addresses in a text provided by an user. I could put a fo:inline hyphenate=false around the email address. But I am hoping there is a way to do this with the hyphenation patterns, since that would prevent a search and replace for the email addresses. My question is, does anyone know what I should add in my hyphenation XML file to stop the hyphenation of email addresses. Regards Wessel van Norel - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: FOP Hyphenation question
On May 28, 2004, at 7:14 AM, Wessel van Norel wrote: Hi all, I've a FOP hyphenation question. I need to stop the hyphenation of email addresses in a text provided by an user. I could put a fo:inline hyphenate=false around the email address. But I am hoping there is a way to do this with the hyphenation patterns, since that would prevent a search and replace for the email addresses. My question is, does anyone know what I should add in my hyphenation XML file to stop the hyphenation of email addresses. Regards Wessel van Norel How would adding fo:inline hyphenate=false[EMAIL PROTECTED]/fo:inline around the email address affect search replace? It wouldn't be affecting the text-string '[EMAIL PROTECTED]'... Web Maestro Clay - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: FOP Hyphenation question
Clay Leeds wrote: On May 28, 2004, at 7:14 AM, Wessel van Norel wrote: Hi all, I've a FOP hyphenation question. I need to stop the hyphenation of email addresses in a text provided by an user. I could put a fo:inline hyphenate=false around the email address. But I am hoping there is a way to do this with the hyphenation patterns, since that would prevent a search and replace for the email addresses. My question is, does anyone know what I should add in my hyphenation XML file to stop the hyphenation of email addresses. Regards Wessel van Norel How would adding fo:inline hyphenate=false[EMAIL PROTECTED]/fo:inline around the email address affect search replace? It wouldn't be affecting the text-string '[EMAIL PROTECTED]'... Web Maestro Clay I guess my former mail was not clear enough. As I said the email address is inserted in a text provided by an user. I may not expect of them that they put the fo:inline around the email address themselves. So to add the fo:inline around the email address I'll have to do a search and replace afterwards, if it's not possible to prevent the hyphenation of email addresses in the hyphenation patterns. Regards, Wessel van Norel - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: FOP Hyphenation question
Clay Leeds wrote: How would adding fo:inline hyphenate=false[EMAIL PROTECTED]/fo:inline around the email address affect search replace? It wouldn't disable hyphenation for the string though. Hyphenation properties are block properties and implemented as such. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: FOP Hyphenation question
-Original Message- From: Wessel van Norel [mailto:[EMAIL PROTECTED] Hi, I guess my former mail was not clear enough. As I said the email address is inserted in a text provided by an user. I may not expect of them that they put the fo:inline around the email address themselves. So to add the fo:inline around the email address I'll have to do a search and replace afterwards, if it's not possible to prevent the hyphenation of email addresses in the hyphenation patterns. Can you be more specific on the way you're constructing the FO? If it is an option to use XML+XSLT, you wouldn't need sr. Provided that the email addresses are separate elements in the source XML, you could just write a template rule to transform those elements into fo:inline's... However, as Joerg already indicated, hyphenation props are block props, so they wouldn't do you much good here :( Over to the bright(er) side then... Take a look at http://xml.apache.org/fop/hyphenation.html#patterns to get an idea of how to use hyphenation patterns for this. Taking into account a limitation mentioned over there, a pattern like: '[EMAIL PROTECTED]' would indicate that hyphenating before or after the @ character is highly undesirable. As for preventing hyphenation on the name and domain, this seems a little more complicated, but then again, I'm no expert in this matter. Hope someone with more experience in hyphenation patterns can comment in on this. (A combination of the classes and exceptions might just provide you with a way to achieve what you're after...) As the warning on the page indicates, any feedback regarding errors/problems that could help to make it function better is most welcome. Hope this helps! Greetz, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation Patterns
On Thu, Mar 25, 2004 at 07:00:45AM -0800, Clay Leeds wrote: On Mar 25, 2004, at 1:39 AM, Chris Bowditch wrote: Waggershauser, Thomas wrote: Hello, I need Hyphenation Patterns for german and french language. These are not available at the fop-homepage, but a reference to a tex source. Its mentioned that its possible to convert the tex-format to the fop xml-format. I tried really hard to find out how to do this, but I got no idea. Maybe here is someone who can help me... I believe that, due to licensing issues, the hyphenation patterns have been removed from old binaries (hence the 'a' in 0.20.4a, 0.20.3a, etc.). I suspect that the best course of action is to follow the instructions on the FOP Hyphenation page[1] to convert TeX hyphenation patterns. The FOP developers sincerely regret that the hyphenation patterns had to be removed. If you have problems with the conversion, please don't hesitate to ask for help on this list. http://xml.apache.org/fop/hyphenation.html I am about to make the hyphenation patterns for German and French available on my homepage, www.leverkruid.nl/FOP/index.html. I intend to find a home for all hyphenation pattern files that have a license that is not compatible with the Apache license, but which allows their free distribution. I am thinking of the TeX archive CTAN, SourceForge or Savannah. Enjoy, Simon Pepping -- Simon Pepping home page: http://www.leverkruid.nl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Hyphenation Patterns
Hello, I need Hyphenation Patterns for german and french language. These are not available at the fop-homepage, but a reference to a tex source. Its mentioned that its possible to convert the tex-format to the fop xml-format. I tried really hard to find out how to do this, but I got no idea. Maybe here is someone who can help me... Thomas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation Patterns
Waggershauser, Thomas wrote: Hello, I need Hyphenation Patterns for german and french language. These are not available at the fop-homepage, but a reference to a tex source. Its mentioned that its possible to convert the tex-format to the fop xml-format. I tried really hard to find out how to do this, but I got no idea. Maybe here is someone who can help me... I have never done the conversion myself, but heard that it is actually quite tricky. The french and german hypenatation patterns used to be distributed with older versions of FOP, 0.20.4. Older versions of FOP binaries can be downloaded from the website. So I suggest you download and install FOP 0.20.4 and then extract the hypenatation files from there. Chris - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation Patterns
On Mar 25, 2004, at 1:39 AM, Chris Bowditch wrote: Waggershauser, Thomas wrote: Hello, I need Hyphenation Patterns for german and french language. These are not available at the fop-homepage, but a reference to a tex source. Its mentioned that its possible to convert the tex-format to the fop xml-format. I tried really hard to find out how to do this, but I got no idea. Maybe here is someone who can help me... I have never done the conversion myself, but heard that it is actually quite tricky. The french and german hypenatation patterns used to be distributed with older versions of FOP, 0.20.4. Older versions of FOP binaries can be downloaded from the website. So I suggest you download and install FOP 0.20.4 and then extract the hypenatation files from there. Chris I believe that, due to licensing issues, the hyphenation patterns have been removed from old binaries (hence the 'a' in 0.20.4a, 0.20.3a, etc.). I suspect that the best course of action is to follow the instructions on the FOP Hyphenation page[1] to convert TeX hyphenation patterns. The FOP developers sincerely regret that the hyphenation patterns had to be removed. If you have problems with the conversion, please don't hesitate to ask for help on this list. http://xml.apache.org/fop/hyphenation.html Good luck! Web Maestro Clay - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE : hyphenation problem
I try the rebuild method proposed and both points of last mail are ok. Thank you very much. Ro -- -Message d'origine- -- De : Peter Menzel [mailto:[EMAIL PROTECTED] -- Envoyé : mardi 27 janvier 2004 10:06 -- À : [EMAIL PROTECTED] -- Objet : Re: hyphenation problem -- -- -- Hi -- -- If you want to include new hyphenation patterns into fop, -- it's not enough to zip them into the fop.jar.. You have to -- download the source, copy the patterns in the hyph dir and -- then run the build script, to compile a new jar file.. -- -- Peter -- -- -- -- Hy, -- -- I upgrade from 0.20.4 to 0.20.5 and I have the following -- error in the -- log : -- -- [ERROR] Couldn't find hyphenation pattern fr_fr -- using general language pattern fr instead. -- -- Even I've add to fop.jar the fr.hyp file. -- -- Maibe it's not the right solution, but french hyphenation was not -- support in the 0.20.5 I beleave. -- -- Could someone help please. -- -- -- -- -- -- [EMAIL PROTECTED] -- Institut für Medizinische Informatik, Statistik und -- Epidemiologie (IMISE) Universität Leipzig -- -- -- - -- To unsubscribe, e-mail: [EMAIL PROTECTED] -- For additional commands, e-mail: [EMAIL PROTECTED] -- -- - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE : hyphenation problem
Do you think the error below can be generated by the absence of a fr.xml hyphenation file in the 0.20.5 version of FOP ? So the two problems (add hyphenation file and this error ) maibe the same one . If not, does someone know how I can fix this please : java.io.InvalidClassException: org.apache.fop.layout.hyphenation.HyphenationTree; Local class not compatible: stream cla ssdesc serialVersionUID=-7784516835472871478 local class serialVersionUID=6956085100635965297 at java.io.ObjectStreamClass.validateLocalClass(Unknown Source) at java.io.ObjectStreamClass.setClass(Unknown Source) at java.io.ObjectInputStream.inputClassDescriptor(Unknown Source) at java.io.ObjectInputStream.readObject(Unknown Source) at java.io.ObjectInputStream.readObject(Unknown Source) at java.io.ObjectInputStream.inputObject(Unknown Source) at java.io.ObjectInputStream.readObject(Unknown Source) at java.io.ObjectInputStream.readObject(Unknown Source) at org.apache.fop.layout.hyphenation.Hyphenator.getFopHyphenationTree(Hyphenator.java:165) at org.apache.fop.layout.hyphenation.Hyphenator.getHyphenationTree(Hyphenator.java:91) at org.apache.fop.layout.hyphenation.Hyphenator.hyphenate(Hyphenator.java:251) at org.apache.fop.layout.LineArea.doHyphenation(LineArea.java:1208) at org.apache.fop.layout.LineArea.addText(LineArea.java:857) at org.apache.fop.fo.FOText.addRealText(FOText.java:278) at org.apache.fop.fo.FOText.addText(FOText.java:252) at org.apache.fop.fo.FOText.layout(FOText.java:161) at org.apache.fop.fo.FObjMixed.layout(FObjMixed.java:139) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.TableCell.layout(TableCell.java:326) at org.apache.fop.fo.flow.TableRow.layout(TableRow.java:360) at org.apache.fop.fo.flow.AbstractTableBody.layout(AbstractTableBody.java:236) at org.apache.fop.fo.flow.Table.layout(Table.java:302) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.AbstractFlow.layout(AbstractFlow.java:154) at org.apache.fop.fo.flow.AbstractFlow.layout(AbstractFlow.java:110) at org.apache.fop.fo.pagination.PageSequence.makePage(PageSequence.java:400) at org.apache.fop.fo.pagination.PageSequence.format(PageSequence.java:338) at org.apache.fop.apps.StreamRenderer.render(StreamRenderer.java:262) at org.apache.fop.fo.FOTreeBuilder.endElement(FOTreeBuilder.java:223) at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source) at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.fop.apps.Driver.render(Driver.java:498) at org.apache.fop.apps.CommandLineStarter.run(CommandLineStarter.java:106) at org.apache.fop.apps.Fop.main(Fop.java:62) -- -Message d'origine- -- De : Peter Menzel [mailto:[EMAIL PROTECTED] -- Envoyé : mardi 27 janvier 2004 10:06 -- À : [EMAIL PROTECTED] -- Objet : Re: hyphenation problem -- -- -- Hi -- -- If you want to include new hyphenation patterns into fop, -- it's not enough to zip them into the fop.jar.. You have to -- download the source, copy the patterns in the hyph dir and -- then run the build script, to compile a new jar file.. -- -- Peter -- -- -- -- Hy, -- -- I upgrade from 0.20.4 to 0.20.5 and I have the following -- error in the -- log : -- -- [ERROR] Couldn't find hyphenation pattern fr_fr -- using general language pattern fr instead. -- -- Even I've add to fop.jar the fr.hyp file. -- -- Maibe it's not the right solution, but french hyphenation was not -- support in the 0.20.5 I beleave. -- -- Could someone help please. -- -- -- -- -- -- [EMAIL PROTECTED] -- Institut für Medizinische Informatik, Statistik und -- Epidemiologie (IMISE) Universität Leipzig -- -- -- - -- To unsubscribe, e-mail: [EMAIL PROTECTED
Re: RE : hyphenation problem
VAGNER Rodolphe wrote: Do you think the error below can be generated by the absence of a fr.xml hyphenation file in the 0.20.5 version of FOP ? So the two problems (add hyphenation file and this error ) maibe the same one . If not, does someone know how I can fix this please : java.io.InvalidClassException: ...Local class not compatible: This happens if you try to load a compiled hyphenation pattern instance from an incompatible class. Try the following: - get a FOP source distribution - add your fr.xml to the source/hyph - run build.sh package J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: configuration problem: hyphenation
Kyle Kotwica wrote: Using either form of the URL suggested. The comment meant: 1. file:///\fop\fop-0.20.5\conf While it is a syntatcically valid URL, it probably wont work as expected, i.e. it does not address the file system object \fop\fop-0.20.5\conf 2. file:///\fop\fop-0.20.5\conf This will address the file system object \fop\fop-0.20.5\conf, but it's still no use because 3. The config setting is not an URL, but realy a directory name (path name). You should use \fop\fop-0.20.5\conf or C:\fop\fop-0.20.5\conf or whatever drive you use. Running under -d I do get [DEBUG] user configuration file: \fop\fop-0.20.5\conf\userconfig.xml but no more about en_US.xml other then the message: [ERROR] Couldn't find hyphenation pattern en_US using general language pattern en instead. Oops, debug dumps are controlled by a static variable in the hyphenator, and are by default disabled. You can enable it by doing the following: 1. Get a JDK and the FOP source distro. 2. Open the file src/org/apache/fop/layout/hyphenation/Hyphenator.java in a text editor 3. Change the initialization of the errorDump variable to true. 4. Recompile by running build.bat. I still believe the cause of the problem is malformed XML, perhaps an encoding problem. Use another XML tool (not XML Spy) to verify your XML is free of this problems. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: configuration problem: hyphenation
On Tue, 2004-01-13 at 19:14, J.Pietschmann wrote: Kyle Kotwica wrote: Using either form of the URL suggested. The comment meant: 1. file:///\fop\fop-0.20.5\conf While it is a syntatcically valid URL, it probably wont work as expected, i.e. it does not address the file system object \fop\fop-0.20.5\conf 2. file:///\fop\fop-0.20.5\conf This will address the file system object \fop\fop-0.20.5\conf, but it's still no use because 3. The config setting is not an URL, but realy a directory name (path name). I thought that Java correctly translates forward slash to the System property file-separator setting, in the appropriate places. -- John Austin [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: configuration problem: hyphenation
J.Pietschmann wrote: 2. file:///\fop\fop-0.20.5\conf Darn, should be file:///fop/fop-0.20.5/conf of course. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: configuration problem: hyphenation
-Original Message- From: J.Pietschmann [mailto:[EMAIL PROTECTED] J.Pietschmann wrote: 2. file:///\fop\fop-0.20.5\conf Darn, should be file:///fop/fop-0.20.5/conf of course. Sorry if it's contagious :) Cheers, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Question about hungarian hyphenation
Hello! I have just tried to use hungarian hyphenation, but it didn't work with the hungarian a' letter... It can hyphenate words with the another accentuated letters (e', i', o', o:, u', u:) except the u and U letters... Has anyone any experience with this thema? I have seen the in the fop-0.20.5-src.zip\fop-0.20.5\src\hyph\ directory the hu.xml file, and there isn't any a' or A' in the classes field, but there are the o and O letters (as o~ and O~)... What can be the problem? Can it help if I add the a' and A' letters? I have find out, that the public int find(char[] key, int start) in TernaryTree gives me -1 and after thet the public Hyphenation hyphenate(char[] w, int offset, int len, int remainCharCount, int pushCharCount) methon in HyphenationTree.java says int nc = classmap.find(c, 0); if (nc 0) {// found a non-letter character, abort return null; } also abort... Thx! Sandor - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
configuration problem: hyphenation
I'm trying to add my own hyphenation file, en_US.xml. I get the following error message: [ERROR] Couldn't find hyphenation pattern en_US using general language pattern en instead. Fop seems to know the configuration directory. [INFO] Using org.apache.xerces.parsers.SAXParser as SAX2 Parser [INFO] Using org.apache.xerces.parsers.SAXParser as SAX2 Parser Dumping configuration: [INFO] -- [INFO] key: debugMode [INFO] key: version [INFO]value: FOP 0.20.5 [INFO] key: dumpConfiguration [INFO]value: true [INFO] key: quiet [INFO]value: false [INFO] key: hyphenation-dir [INFO]value: \fop\fop-0.20.5\conf [INFO] key: fonts [INFO]values: [INFO] [INFO] -- [INFO] key: stream-filter-list [INFO]values: [INFO] flate - [INFO] ascii-85 - [INFO] Directory of C:\fop\fop-0.20.5\conf 01/12/2004 12:48 PMDIR . 01/12/2004 12:48 PMDIR .. 07/14/2003 06:04 PM 193 config.dtd 01/07/2004 05:42 PM 1,547 config.xml 01/07/2004 04:40 PM63,848 en.xml 01/07/2004 05:24 PM63,848 en_US.xml 01/12/2004 11:45 AM 3,235 userconfig.xml From userconfig.xml entry keyhyphenation-dir/key value\fop\fop-0.20.5\conf/value /entry - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: configuration problem: hyphenation
-Original Message- From: Kyle Kotwica [mailto:[EMAIL PROTECTED] I'm trying to add my own hyphenation file, en_US.xml. I get the following error message: snip / From userconfig.xml entry keyhyphenation-dir/key value\fop\fop-0.20.5\conf/value Make this path a valid URL: file:///\fop\fop-0.20.5\conf Should work. Hope this helps! Cheers, Andreas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Question about hungarian hyphenation
ECKER Sandor wrote: I have just tried to use hungarian hyphenation, but it didn't work with the hungarian a' letter... It can hyphenate words with the another accentuated letters (e', i', o', o:, u', u:) except the u and U letters... Has anyone any experience with this thema? I have seen the in the fop-0.20.5-src.zip\fop-0.20.5\src\hyph\ directory the hu.xml file, and there isn't any a' or A' in the classes field, but there are the o and O letters (as o~ and O~)... What can be the problem? Can it help if I add the a' and A' letters? Yes, a bit. This doesnt mean words with such characters are hyphenated properly afterwards. Maybe you have to map them to plain a or something. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: configuration problem: hyphenation
Andreas L. Delmelle wrote: Make this path a valid URL: file:///\fop\fop-0.20.5\conf Using file:///fop/fop-0.20.5/conf would be an even more valid URL. However, the hyphenation-dir is really meant to be a directory, *not* an URL. The actual data is read from a java.io.File. Running FOP with -d should either give reading ... en_US.xml which means the file was found (but probably is not well-formed XML) or Tried to load ... which means the file was not found. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: configuration problem: hyphenation
Thanks for the help, but still no luck. Using either form of the URL suggested. Running under -d I do get [DEBUG] user configuration file: \fop\fop-0.20.5\conf\userconfig.xml but no more about en_US.xml other then the message: [ERROR] Couldn't find hyphenation pattern en_US using general language pattern en instead. -Original Message- From: J.Pietschmann [mailto:[EMAIL PROTECTED] Sent: Monday, January 12, 2004 2:08 PM To: [EMAIL PROTECTED] Subject: Re: configuration problem: hyphenation Andreas L. Delmelle wrote: Make this path a valid URL: file:///\fop\fop-0.20.5\conf Using file:///fop/fop-0.20.5/conf would be an even more valid URL. However, the hyphenation-dir is really meant to be a directory, *not* an URL. The actual data is read from a java.io.File. Running FOP with -d should either give reading ... en_US.xml which means the file was found (but probably is not well-formed XML) or Tried to load ... which means the file was not found. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation question
I've a problem with the hyphenation in fop. There is a certain word that contains a minus character that should not be hyphenated. Is there a way to add this word in the hyphenation.xml file, so that it won't get hyphenated? Or do I have to translate the minus character to a special minus character, and if so, which one? You could simply exclude this word from hyphenation in the fo code like this: fo:inline hyphenate=falsesome-word/fo:inline As far as i understand the documentation at http://xml.apache.org/fop/hyphenation.html#patterns there is no way to exclude words with characters in them that equal the hyphen character. From the documentation: - exceptions contains whitespace-separated words, each of which has either explicit hyphen characters to denote acceptable breakage points, or no hyphen characters, to indicate that this word should never be hyphenated, or contain explicit hyp elements for specifying changes of spelling due to hyphenation (like backen - bak-ken or Stoffarbe - Stoff-farbe in the old german spelling). Exceptions override the patterns described below. Explicit hyp declarations don't work yet (patches welcome). Exceptions are generally a bit brittle, test carefully. - But if you replace the hyphen in the word with a similar looking character other than the hyphenation character, you should be able to exclude this word from hyphenation generally by putting it into the exceptions section of your hyphenation xml file as described in the documentation. Which character to use instead depends on the font you use. At least i think so. I do not know, which character is used for hyphenation by default. In Arial for example there is a minus (U+002D) and a hyphen (U+2013). So if the hyphen is used for hyphenation you could be able to use the minus for your special word, if these two characters are not treated the same (but that may be the case). But this is only what i think. I don't have tried this or have any experience with that. Just have a look into the character table for your font and you should find characters you might be able to use alternatively. Maybe someone else can give some more hints on that. Kind regards, Roland - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
J.Pietschmann [EMAIL PROTECTED]: Steinar Bang wrote: Chris Bowditch [EMAIL PROTECTED]: I tried inserting #200b; after http://;. The result was like this: http:// some- machine/path/to/somewhere You'll have to turn off hyphenation too Is using #x200b; and turning hyphenation off, better than inserting fo:block/? (in addition to using the correct character). I used the correct character in the document. I just mistyped it in the message to the mailing list. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
Steinar Bang wrote: Is using #x200b; and turning hyphenation off, better than inserting fo:block/? It depends. From the code, I'd say fo:block/ could yield strange layout artefacts due to a variety of hacks, but then, people rarely complained, and I conclude such odd things don't happen often in practice. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
How to change FOP hyphenation points?
Platform: DocBook XML 4.2, DocBook XSL 1.62.4, xsltproc 1.0.27, fop 0.20.5 Is there a way to introduce soft line breaks into FOP formatting? I'm using the DocBook XSL style sheets to create XSL:FO files from DocBook XML documents, and feed the XSL:FO files through FOP to create PDF files. I have two URLs in table cells, that breaks in this manner: ht- ttp://somemachine/path/to/some- where And that looks a bit silly. I can't make the table columns wider, and I can't make the URLs shorter, and I don't want to change the table. It's a table that lists PUT and GET URLs for different services. Ie. like this: +--+--+-+ | | GET URL | PUT URL | +--+--+-+ | Some desc. | | | +--+--+-+ | Some other desc. | | | +--+--+-+ Is there eg. a character I could insert that would tell FOP to break at a particular point? I tried using shy; after http://;. But the result was that the URLs were broken in the same way as before in the PDF output, and a hyphen character was inserted where I had put in shy;. The numerical character reference inserted into the XSL:FO, was xAD;. I've looked at http://xml.apache.org/fop/hyphenation.html, but it looked a bit complicated, for what I wanted to do. I was looking for a way to find a file I could drop in the document's directory, where I could put the URLs, with proper break points inserted. Thanx! - Steinar - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
From: Steinar Bang [EMAIL PROTECTED] snip/ Is there a way to introduce soft line breaks into FOP formatting? I'm using the DocBook XSL style sheets to create XSL:FO files from DocBook XML documents, and feed the XSL:FO files through FOP to create PDF files. I have two URLs in table cells, that breaks in this manner: ht- ttp://somemachine/path/to/some- where snip/ Take a look at http://xml.apache.org/fop/faq.html#cells-overflow Specifically the bit about inserting zero-width spaces. That will allow FOP to break your http where you have inserted the spaces. Chris _ Find a cheaper internet access deal - choose one to suit you. http://www.msn.co.uk/internetaccess - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
Chris Bowditch [EMAIL PROTECTED]: From: Steinar Bang [EMAIL PROTECTED] snip/ I have two URLs in table cells, that breaks in this manner: ht- ttp://somemachine/path/to/some- where Take a look at http://xml.apache.org/fop/faq.html#cells-overflow Specifically the bit about inserting zero-width spaces. That will allow FOP to break your http where you have inserted the spaces. Thanx for the tip! I tried inserting #200b; after http://;. The result was like this: http:// some- machine/path/to/somewhere The result I was hoping for, was: http:// somemachine/path/to/somewhere There is room enough on the second line to fit everything after http://;, but there may be some special handling for the last lines of paragraphs? Thanx! - Steinar - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
Steinar Bang [EMAIL PROTECTED]: Chris Bowditch [EMAIL PROTECTED]: Specifically the bit about inserting zero-width spaces. That will allow FOP to break your http where you have inserted the spaces. Thanx for the tip! I tried inserting #200b; after http://;. The result was like this: [snip!] I tried using line feeds instead, with the same result. I ended up using fo:block/ to force the line break at the desired place. I put in the PI ?fo-table-cell-break? where I wanted the break, and put the following template into the XSLT style sheet generating the XSL:FO: xsl:template match=processing-instruction('fo-table-cell-break') fo:block/ /xsl:template This is a bit clunky, but it got me desired results. Thanx! - Steinar - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
From: Steinar Bang [EMAIL PROTECTED] Thanx for the tip! I tried inserting #200b; after http://;. The result was like this: you've inserted the wrong character, the link I provided says to use #x200b. Notice the x which indicates the number is in hex rather than decimal snip/ Chris _ Sign-up for a FREE BT Broadband connection today! http://www.msn.co.uk/specials/btbroadband - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
Steinar Bang wrote: Chris Bowditch [EMAIL PROTECTED]: I tried inserting #200b; after http://;. The result was like this: http:// some- machine/path/to/somewhere You'll have to turn off hyphenation too (in addition to using the correct character). J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: How to change FOP hyphenation points?
Chris Bowditch [EMAIL PROTECTED]: From: Steinar Bang [EMAIL PROTECTED] I tried inserting #200b; after http://;. The result was like this: you've inserted the wrong character, the link I provided says to use #x200b. Notice the x which indicates the number is in hex rather than decimal I inserted the correct character into the document. I mistyped it in the mailing list message you quote. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Hyphenation on free text
Hi all, I am using 20.4 version in Win 2K. I am trying to use en_GB to hyphenate long free text without spacing. However, I find there is a problem with the output pdf. I have made a sample.xml with a long free text as shown in the attached document (sample3.xml) with continuous character - 'AABBCCC..' and so on. I find that the appearance of the characters become dis-order after the line breaking (please find in the output.pdf). I don't know whether it is due to the xslt problem or fo problem. Please give me some suggestions. Thank for your help in advance. Sherlane __ Do you Yahoo!? Yahoo! SiteBuilder - Free, easy-to-use web site design software http://sitebuilder.yahoo.com10 Sep 200310/09/200310-09-20032003-09-1010-SEP-2003PPPriority payment13 Dec 200213/12/200213-12-20022002-12-1313-DEC-2002 1234567 1234 12345678HKD100.0012 12 12 1212 1 212 121 121 121 121HKHBAP 222 999411B00NXQ111 112003-09-10-02.00.15.077785AABBCCDDEEFFGGHHIIJJKKLLMMNNOOPPQQRRSSTTUUVVWWXXYYZZaabbccddeeffgghhiijjkkllmmnnooppqqrrssttuuvvwwxxyy?xml version=1.0 encoding=UTF-8? xsl:stylesheet version=1.0 xmlns:xsl=http://www.w3.org/1999/XSL/Transform; xmlns:fo=http://www.w3.org/1999/XSL/Format; xsl:template match=/ fo:root !-- defines page layout -- fo:layout-master-set fo:simple-page-master master-name=simple page-height=27.9cm page-width=21cm margin-top=1.5cm margin-bottom=1.5cm margin-left=1.5cm margin-right=1.5cm fo:region-body margin-top=2.75cm margin-bottom=3cm/ fo:region-before extent=5cm/ fo:region-after extent=3cm/ /fo:simple-page-master /fo:layout-master-set fo:page-sequence master-reference=simple initial-page-number=1 fo:flow flow-name=xsl-region-body fo:table table-layout=fixed padding-after=0.5cm fo:table-column column-width=17cm/ fo:table-body fo:table-row line-height=12pt fo:table-cell fo:block text-align=left font-size=08pt font-family=Times fo:wrapperTEST 8y xslt + xml - pdf /fo:wrapper /fo:block /fo:table-cell /fo:table-row fo:table-row fo:table-cell fo:block hyphenate=true language=en_GB text-align=left font-size=08pt font-family=Times white-space-collapse=false fo:wrapper xsl:for-each select=Advice/FreeText xsl:apply-templates/xsl:apply-templates /xsl:for-each /fo:wrapper !--xsl:value-of select=Advice/FreeText/-- /fo:block /fo:table-cell /fo:table-row /fo:table-body /fo:table /fo:flow /fo:page-sequence /fo:root /xsl:template /xsl:stylesheet Output.pdf Description: Output.pdf - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation on free text
From: Sherlane Lam [EMAIL PROTECTED] Hi all, snip/ I have made a sample.xml with a long free text as shown in the attached document (sample3.xml) with continuous character - 'AABBCCC..' and so on. I find that the appearance of the characters become dis-order after the line breaking (please find in the output.pdf). I don't know whether it is due to the xslt problem or fo problem. Please give me some suggestions. I believe this behaviour was a bug in 0.20.4. Please upgrade to 0.20.5, as this problem should be fixed. Chris _ Stay in touch with absent friends - get MSN Messenger http://www.msn.co.uk/messenger 10 Sep 200310/09/200310-09-20032003-09-1010-SEP-2003PPPriority payment13 Dec 200213/12/200213-12-20022002-12-1313-DEC-2002 1234567 1234 12345678HKD100.0012 12 12 1212 1 212 121 121 121 121HKHBAP 222 999411B00NXQ111 112003-09-10-02.00.15.077785AABBCCDDEEFFGGHHIIJJKKLLMMNNOOPPQQRRSSTTUUVVWWXXYYZZaabbccddeeffgghhiijjkkllmmnnooppqqrrssttuuvvwwxxyy - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
AW: exceptions in Hyphenation
Thank you, Is there any roadmap for fop, where i can see when it will be implemented? A.Weck -Ursprüngliche Nachricht- Von: J.Pietschmann [mailto:[EMAIL PROTECTED] Gesendet: Mittwoch, 6. August 2003 20:57 An: [EMAIL PROTECTED] Betreff: Re: exceptions in Hyphenation Weck, Andreas wrote: I did an entry into the exceptions-block for the german hyphenation (de.xml) like this: Hyphenation exceptions aren't well supported currently. it may well be possible that supressing pattern based hyphenation for certain words is the only working mode. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
exceptions in Hyphenation
Hallo, I'm using Fop 0.20.5rc3a. I did an entry into the exceptions-block for the german hyphenation (de.xml) like this: hyphen-char value=-/ exceptions Denkm-al-pfle-ge /exceptions After a successful building of fob.jar i created a pdf, but the hyphenation algorithm hyphen the word like this: Denk-malpflege So what i do wrong? Functions hyph at all? Thank you Andreas Weck eMail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Thai Hyphenation
Many thanks! It is very hard to find any distribution of hyphenation pattern for Thai. I can do hyphenation for Chinese, Japanese and some other Asian languages correctly with FOP even I have not used a specific hyphenation pattern. However, only Thai, I cannot do it correctly. Therefore, I want to find some hyphenation pattern in FOP site. However, I cannot find (or I don't understand) any information in both FOP and CTAN sites. Any suggestion/direction for me? Thanks thanks and thanks!!! Sherlane --- J.Pietschmann [EMAIL PROTECTED] wrote: Sherlane Lam wrote: Anyone knows how can I make the Thai hyphenation? Where can I get the Thai hyphenation xml or related file? I don't know much about Thai except that typesetting thai script is a major PITA. From all what I read hyphenation or even proper line breaking requires thorough morphological analysis, something which is unlikely to be achievable with the knowledge available on this list. I heard a TeX distribution localized for Thai mentioned somewhere, but it doesn't seem to be available on CTAN. If you can track this down it may be a sourec for proper hyphenation patterns, but I believe there will be additional modifications necessary, like for chinese. I use en_US with some 3 bytes Thai word wrongly hyphenated. Any comment? I can't parse the sentence. If you mean you get wrong hyphenations with an en_US language setting but actual text in Thai, well, this is hardly a surprise. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] __ Do you Yahoo!? SBC Yahoo! DSL - Now only $29.95 per month! http://sbc.yahoo.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: Thai Hyphenation
Sherlane Lam wrote: It is very hard to find any distribution of hyphenation pattern for Thai. I can do hyphenation for Chinese, Japanese and some other Asian languages correctly with FOP even I have not used a specific hyphenation pattern. However, only Thai, I cannot do it correctly. Therefore, I want to find some hyphenation pattern in FOP site. However, I cannot find (or I don't understand) any information in both FOP and CTAN sites. Any suggestion/direction for me? As Joerg has already mentioned, they don't exist (in FOP anyway), and we don't really know enough to build them. However, we are in the process of beefing up the hyphenation documentation to include information about how to create/modify the patterns files. If you are interested in sorting through the linguistic issues, we'll do our best to support you on the technical side. If you can find an electronic Thai dictionary that includes hyphenation, that should be a big help. Do you have any interest in helping with this? Victor Mote - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
AW: hyphenation again
Hi! J.Pietschmann Robert P. J. Day - thanks for your answers! I can test the thing on the eving, but i'm sure, that they will work. My first problem with the path was solved for itself after rebuilding FOP with only packages, that i use. And the both others (# as hyphanation sign and warnings about not foundig de) are in fact DocBook, specially Stylesheet related difficulties. Here are the links which describe, how to solve it. Perhaps somebody has the same troubles: http://lists.oasis-open.org/archives/docbook-apps/200305/msg00211.html http://lists.oasis-open.org/archives/docbook-apps/200305/msg00233.html http://www.sagehill.net/xml/docbookxsl/CustomMethods.html#CustomGentext CU Anton - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: AW: hyphenation again
On Wed, 11 Jun 2003, Anton Wardaschko (PA) wrote: Hi! J.Pietschmann Robert P. J. Day - thanks for your answers! I can test the thing on the eving, but i'm sure, that they will work. i'm not sure what the rationale is for having the default hyphenation character be a #, but if you want to just manually change it, edit the common/en.xml file that's part of the docbook stylesheets and change that hyphenation character explicitly to a -. either that, or copy that line, change it and add the new line to your own customization layer. rday - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
hyphenation again
Hi! I get the TeX hyphenation file and the XML version too, i set the path in the userconf.xml, but i read on the command line again and again, that they are not found. The path to the hyph files is correct, couse i'm able to embed the ttf fonts. Any suggestions? entry keyhyphenation-dir/key value/fop/build/classes/hyph/value /entry this can i see when i generate the fo-file with saxon: Making portrait pages on A4 paper (210mmx297mm) No de localization of hyphenation-character exists; using en. No de localization of hyphenation-push-character-count exists; using en. No de localization of hyphenation-remain-character-count exists; using en. No de localization of hyphenation-character exists; using en. No de localization of hyphenation-push-character-count exists; using en. No de localization of hyphenation-remain-character-count exists; using en. No de localization of hyphenation-character exists; using en. No de localization of hyphenation-push-character-count exists; using en. No de localization of hyphenation-remain-character-count exists; using en. No de localization of hyphenation-character exists; using en. No de localization of hyphenation-push-character-count exists; using en. No de localization of hyphenation-remain-character-count exists; using en. And these errors came from FOP: [ERROR] Couldn't find hyphenation pattern de [ERROR] Error building hyphenation tree for language de java.io.StreamCorruptedException: invalid stream header at java.io.ObjectInputStream.readStreamHeader(ObjectInputStream.java:737) at java.io.ObjectInputStream.init(ObjectInputStream.java:253) at org.apache.fop.layout.hyphenation.Hyphenator.getFopHyphenationTree(Hyphenator.j ava:164) at org.apache.fop.layout.hyphenation.Hyphenator.getHyphenationTree(Hyphenator.java :91) at org.apache.fop.layout.hyphenation.Hyphenator.hyphenate(Hyphenator.java:251) at org.apache.fop.layout.LineArea.doHyphenation(LineArea.java:1206) at org.apache.fop.layout.LineArea.addText(LineArea.java:856) at org.apache.fop.fo.FOText.addRealText(FOText.java:294) at org.apache.fop.fo.FOText.addText(FOText.java:252) at org.apache.fop.fo.FOText.layout(FOText.java:161) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.Block.layout(Block.java:257) at org.apache.fop.fo.flow.AbstractFlow.layout(AbstractFlow.java:154) at org.apache.fop.fo.flow.AbstractFlow.layout(AbstractFlow.java:110) at org.apache.fop.fo.pagination.PageSequence.makePage(PageSequence.java:400) at org.apache.fop.fo.pagination.PageSequence.format(PageSequence.java:338) at org.apache.fop.apps.StreamRenderer.render(StreamRenderer.java:262) at org.apache.fop.fo.FOTreeBuilder.endElement(FOTreeBuilder.java:221) at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source) at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher .dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Sour ce) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.fop.apps.Driver.render(Driver.java:498) at org.apache.fop.apps.CommandLineStarter.run(CommandLineStarter.java:106) at org.apache.fop.apps.Fop.main(Fop.java:62) - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation again
The easiest way is to put the XML version in the src/hyph directory and recompile FOP (build clean, followed by build). The XML file is being converted to an internal format (serialized java object) as part of this build process. On 10.06.2003 09:42:38 Anton Wardaschko (PA) wrote: I get the TeX hyphenation file and the XML version too, i set the path in the userconf.xml, but i read on the command line again and again, that they are not found. The path to the hyph files is correct, couse i'm able to embed the ttf fonts. Any suggestions? Jeremias Maerki - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: hyphenation again
Hi! The easiest way is to put the XML version in the src/hyph directory and recompile FOP (build clean, followed by build). The XML file is being converted to an internal format (serialized java object) as part of this build process. OK, it seems to work, thanks! I have in this case some questions, but i fear they are not 100% FOP related. 1. Further i were able to write the path to my True Types without drive letter: /aelitha-docbook/fonts/ and now FOP don't find the fonts: [ERROR] Failed to read a font metrics file: Invalid font metrics file: arial.xml (Invalid fontBaseDir specified: /aelitha-docbook/fonts/ (no protocol: /aelitha-docbook/fonts/)) only with the full path c:/aelitha-docbook/fonts/ the fonts will be found. Why? Is it possible to compile some TTFs with the FOP, so i don't have to use the config.xml? 2. While generating FO-file i still see Making portrait pages on A4 paper (210mmx297mm) No de localization of hyphenation-character exists; using en. No de localization of hyphenation-push-character-count exists; using en. ... But when FOP renders the FO-file there are no warnings and the generated PDF has hyphenation. 3. The separator sign for hyphention in the preface is # and in the chapters correct -. I use DocBook with Stylesheet version 1.61.2. CU Anton smime.p7s Description: S/MIME cryptographic signature
Re: hyphenation again
anton wrote: 1. Further i were able to write the path to my True Types without drive letter: /aelitha-docbook/fonts/ and now FOP don't find the fonts: [ERROR] Failed to read a font metrics file: Invalid font metrics file: arial.xml (Invalid fontBaseDir specified: /aelitha-docbook/fonts/ (no protocol: /aelitha-docbook/fonts/)) only with the full path c:/aelitha-docbook/fonts/ the fonts will be found. Why? Is it possible to compile some TTFs with the FOP, so i don't have to use the config.xml? See http://xml.apache.org/fop/faq.html#MalformedURLException The value must be a valuid URL. In some cases the RTL will fall back to interpret this as a file name, but this doesn't work consistently. 2. While generating FO-file i still see Making portrait pages on A4 paper (210mmx297mm) No de localization of hyphenation-character exists; using en. No de localization of hyphenation-push-character-count exists; using en. *Shrug* Ask the DocBook folks. 3. The separator sign for hyphention in the preface is # and in the See http://xml.apache.org/fop/faq.html#pdf-characters Check the docbook XSL which character they actually use. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation again
On Tue, 10 Jun 2003, J.Pietschmann wrote: anton wrote: 3. The separator sign for hyphention in the preface is # and in the See http://xml.apache.org/fop/faq.html#pdf-characters Check the docbook XSL which character they actually use. sadly, it's a #, so anton will have to override that in his stylesheet. rday - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Error building hyphenation tree for language de
Hello @ all! I'm using fop v. 0.20.5rc2. The transformation of my xml-file with a xsl-stylesheet is running with the provided xalan.jar and xerces.jar. The transformation into a fo-file works stable, but fop throws an exception as following: Error building hyphenation tree for language de Couldn't find hyphenation pattern de Greetz suse - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
hyphenation pattern
I'm using 0.20.5rc2. I'm getting: Couldn't find hyphenation pattern en_US Error building hyphenation tree for language en I didn't get this with version 0.20.4. Can anyone help? Thanks, Derrick This electronic transmission is strictly confidential to Smith Nephew and intended solely for the addressee. It may contain information which is covered by legal, professional or other privilege. If you are not the intended addressee, or someone authorized by the intended addressee to receive transmissions on behalf of the addressee, you must not retain, disclose in any form, copy or take any action in reliance on this transmission. If you have received this transmission in error, please notify the sender as soon as possible and destroy this message. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation pattern
Derrick, 0.20.5rc2 does not include the hyphenation pattern en_US. It only includes the hyphenation pattern en_GB. Sorry, I don't know where to get the other one (you might find it in the SOURCE for 0.20.5rc or 0.20.4, but then you'd have to integrate it into the SOURCE for 0.20.5rc2 and then BUILD FOP yourself). Good luck! -- Clay Leeds - [EMAIL PROTECTED] Web Developer - Medata, Inc. - http://www.medata.com PGP Public Key: https://mail.medata.com/pgp/cleeds.asc - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation pattern
Most of the hyphenation files had to be removed because of license reasons. We are looking for alternatives so we can once again provide hyphenation patterns in many languages. Simply copying them from an older distribution doesn't make them legal. Sorry for the inconvenience, but this is a serious issue. On 27.02.2003 17:41:16 Clay Leeds wrote: 0.20.5rc2 does not include the hyphenation pattern en_US. It only includes the hyphenation pattern en_GB. Sorry, I don't know where to get the other one (you might find it in the SOURCE for 0.20.5rc or 0.20.4, but then you'd have to integrate it into the SOURCE for 0.20.5rc2 and then BUILD FOP yourself). Jeremias Maerki - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: hyphenation pattern
Sorry for leading a list memeber astray. Jeremias is completely correct, and I should've left it at 0.20.5rc2 does not include the hyphenation pattern en_US. My bad. Jeremias Maerki wrote: Most of the hyphenation files had to be removed because of license reasons. We are looking for alternatives so we can once again provide hyphenation patterns in many languages. Simply copying them from an older distribution doesn't make them legal. Sorry for the inconvenience, but this is a serious issue. On 27.02.2003 17:41:16 Clay Leeds wrote: 0.20.5rc2 does not include the hyphenation pattern en_US. It only includes the hyphenation pattern en_GB. Sorry, I don't know where to get the other one (you might find it in the SOURCE for 0.20.5rc or 0.20.4, but then you'd have to integrate it into the SOURCE for 0.20.5rc2 and then BUILD FOP yourself). Jeremias Maerki -- Clay Leeds - [EMAIL PROTECTED] Web Developer - Medata, Inc. - http://www.medata.com PGP Public Key: https://mail.medata.com/pgp/cleeds.asc - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: More newbie hyphenation and option flag issues with docbook.xml -- fo --pdf
Aaron Weber wrote: [ERROR] Couldn't find hyphenation pattern en Ah! Set the language to en_GB. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: More newbie hyphenation and option flag issues with docbook.xml -- fo --pdf
Excellent! A quick search and replace has gotten rid of the hyphenation problems, (language=en -- language=en_GB) and deleting a couple more unsupported tags has gotten rid of all my errors! Now, I'm down to [WARNING] table-layout=auto is not supported, using fixed! [WARNING] current implementation of tables requires a table-column for each column, indicating column-width [WARNING] table-layout=auto is not supported, using fixed! [WARNING] current implementation of tables requires a table-column for each column, indicating column-width a couple of.. [INFO] area contents overflows area [INFO] area contents overflows area [INFO] area contents overflows area and of course at the end: [ERROR] null and failure. :( Really, I want to be able to automate the Docbook XML -- PDF transition, and avoid editing the autogenerated .fo files by hand as much as possible (autogenerated is almost always synonomous with hard to edit by hand, and any sort of manual intervention means extra opportunity for confusion, error, and trouble when we need to regenerate after a change to the original). We've got the XML--HTML transition down fine, fortunately... Any tips? I'll try get rid of the offending table-layout issues, but... I don't know about adding table-column width attributes to every table in the document. Yours, Aaron. On Thu, 2003-02-20 at 19:44, J.Pietschmann wrote: Aaron Weber wrote: [ERROR] Couldn't find hyphenation pattern en Ah! Set the language to en_GB. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Table's warning (was More newbie hyphenation and option flag issues ...)
[WARNING] table-layout=auto is not supported, using fixed! For this one, just use the attribute table-layout of fo:table tags : fo:table table-layout=fixed [WARNING] current implementation of tables requires a table-column for each column, indicating column-width To avoid this one, put a table-column tag for each column in your table after the fo:table tag : fo:table table-layout=fixed fo:table-column column-width=10mm / fo:table-column column-width=87.5mm / fo:table-column column-width=15mm / fo:table-column column-width=87.5mm / fo:table-column column-width=10mm / fo:table-body fo:table-row height=5mm !-- End of the table -- Cheers, Simon - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Stack Trace (was: newbie hyphenation...)
I'll see about getting newer Docbook XSL setups and customizing them for table widths. Funny thing is, I don't know that I even *have* any tables, at least not that I put in myself... Anyway, here's the stack trace: [ERROR] null org.apache.fop.apps.FOPException at org.apache.fop.apps.CommandLineStarter.run(CommandLineStarter.java:74) at org.apache.fop.apps.Fop.main(Fop.java:19) - java.lang.NullPointerException at org.apache.fop.fo.PropertyManager.getTextDecoration(PropertyManager.java:328) at org.apache.fop.fo.FObjMixed.init(FObjMixed.java:28) at org.apache.fop.fo.flow.Block.init(Block.java:72) at org.apache.fop.fo.flow.Block$Maker.make(Block.java:37) at org.apache.fop.fo.FOTreeBuilder.startElement(FOTreeBuilder.java:255) at org.apache.xerces.parsers.AbstractSAXParser.startElement(Unknown Source) at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.fop.apps.Driver.render(Driver.java:457) at org.apache.fop.apps.CommandLineStarter.run(CommandLineStarter.java:69) at org.apache.fop.apps.Fop.main(Fop.java:19) - java.lang.NullPointerException at org.apache.fop.fo.PropertyManager.getTextDecoration(PropertyManager.java:328) at org.apache.fop.fo.FObjMixed.init(FObjMixed.java:28) at org.apache.fop.fo.flow.Block.init(Block.java:72) at org.apache.fop.fo.flow.Block$Maker.make(Block.java:37) at org.apache.fop.fo.FOTreeBuilder.startElement(FOTreeBuilder.java:255) at org.apache.xerces.parsers.AbstractSAXParser.startElement(Unknown Source) at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanStartElement(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source) at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) at org.apache.fop.apps.Driver.render(Driver.java:457) at org.apache.fop.apps.CommandLineStarter.run(CommandLineStarter.java:69) at org.apache.fop.apps.Fop.main(Fop.java:19) On Fri, 2003-02-21 at 13:35, J.Pietschmann wrote: Aaron Weber wrote: [INFO] area contents overflows area This happens if, well, the contents overflows the allocated area. Usually caused by long words and word-like strings in table cells. [ERROR] null Possibilities - Most often, a mandatory (for FOP) property was not set. - Footnotes on the last page which did not fit that page. - Unspecified problems with markers. Run FOP with the -d flag to get a stacktrace, and post the first few lines here. Any tips? I'll try get rid of the offending table-layout issues, but... I don't know about adding table-column width attributes to every table in the document. Upgrade to the latest DocBook XSL, which fixes many issues. Also, you can specify table column widths in DocBook, check the DocBook documentation. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Stack Trace (was: newbie hyphenation...)
Aaron Weber wrote: Funny thing is, I don't know that I even *have* any tables, at least not that I put in myself... DocBook might have put in something for you. Anyway, here's the stack trace: [ERROR] null ... java.lang.NullPointerException at org.apache.fop.fo.PropertyManager.getTextDecoration(PropertyManager.java:328) at org.apache.fop.fo.FObjMixed.init(FObjMixed.java:28) at org.apache.fop.fo.flow.Block.init(Block.java:72) at org.apache.fop.fo.flow.Block$Maker.make(Block.java:37) at org.apache.fop.fo.FOTreeBuilder.startElement(FOTreeBuilder.java:255) This means you have content outside a flow, which is illegal. This should not happen. Upgrade your DocBook XSL, be sure your DocBook docs validate, and complain on the DocBook list if this persists. J.Pietschmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
More newbie hyphenation and option flag issues with docbook.xml -- fo --pdf
Hello all, I've checked through the archives and FAQ, and sure enough, hyphenation looks like a pretty common issue but I'm still having a little trouble with it. I'm using a docbook XML file which I converted to .fo using xmlto, and I'm now trying to take that file and turn it into a PDF. I've got a Java runtime installed (j2re) and JAVA_HOME is set correctly, and I've downloaded Jimi and taken the class file and put it in my fop directory, and now I've hit a brick wall. When I run the program like so: [EMAIL PROTECTED] fop-0.20.5rc2]$ ./fop.sh -fop mydocument.fo -pdf mydocument.pdf I get a complete fop usage help message, then the output: [INFO] Using org.apache.xerces.parsers.SAXParser as SAX2 Parser [INFO] FOP 0.20.5rc2 [INFO] Using org.apache.xerces.parsers.SAXParser as SAX2 Parser [ERROR] null at that point the program apparently crashes, returning me to the command line and not leaving me a pdf. (I tried this with one and two dashes in front of the flags, in case I'd read wrong and it was --pdf and not -pdf). It should be the same to run it with just the file names, not the flags: ./fop.sh rcserver.fo rcserver.pdf But it isn't: it crashes in a completely different way, repeating this warning several hundred times: [ERROR] Error building hyphenation tree for language en [ERROR] Couldn't find hyphenation pattern en Which baffles me completely. Isn't it supposed to default to en (English) when it can't find any other hyphenation patterns? Shouldn't it give up and just not hyphenate? Anyway, if you've got any clue about it, I'd appreciate it. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Strange hyphenation
Hi, all! Some weeks ago I complained to the wrong hyphenation in Russian pdf-output from fop. I have refused from hyphenation at all and all was all right till today. Now I have text with 2-lines title and sometime the first word on the page is truncated. It doesn't happen if I have 1-line title. Of course I can short the title but this case looks very similar to the wrong hyphenation. And it happens with English words too! For example if the word database must be on the next page, I have atabase as the first word on the next page. O.T. asked me to show fo-file but I don't know how to produce it with fop from xml and xsl files. May be somebody else have met this feature with 2-lines article title? -- Best regards Alexander Lepekhine - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Strange hyphenation
lepekhine wrote: Some weeks ago I complained to the wrong hyphenation in Russian pdf-output from fop. I have refused from hyphenation at all and all was all right till today. Now I have text with 2-lines title and sometime the first word on the page is truncated. It doesn't happen if I have 1-line title. Of course I can short the title but this case looks very similar to the wrong hyphenation. And it happens with English words too! For example if the word database must be on the next page, I have atabase as the first word on the next page. O.T. asked me to show fo-file but I don't know how to produce it with fop from xml and xsl files. Ok, open a bug and attach xml+xsl, we'll see what's going on. btw, what is FOP version are you talking about? Have you tried 0.20.5rc? -- Oleg Tkachenko eXperanto team Multiconn Technologies, Israel - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Strange hyphenation
Thank you very much! On Monday 25 November 2002 17:31, Oleg Tkachenko wrote: lepekhine wrote: Could you help me with the very strange hyphenation of Russian words? I use fop 0.20.4 and docbook 1.55.0 In the multysyllable russian words sometimes the middle syllable is left at the and of line, then goes hyphen mark and the rest of word (without middle syllable) goes on the next line. In English it corresponds to the next division of word hyphenation for example: phen- hynation. How does generated fo look like? Did you set language property? I have set language in book lang=ru I did not generate fo file but if you are interested in I will. Now I am fully satisfied with the absence of hyphenation. Thank you once more. Alexander Lepekhine - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Strange hyphenation
Hi all! Could you help me with the very strange hyphenation of Russian words? I use fop 0.20.4 and docbook 1.55.0 In the multysyllable russian words sometimes the middle syllable is left at the and of line, then goes hyphen mark and the rest of word (without middle syllable) goes on the next line. In English it corresponds to the next division of word hyphenation for example: phen- hynation. Can I refuse of auto hyphenation at all? -- Best regards Alexander Lepekhine - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Strange hyphenation
lepekhine wrote: Could you help me with the very strange hyphenation of Russian words? I use fop 0.20.4 and docbook 1.55.0 In the multysyllable russian words sometimes the middle syllable is left at the and of line, then goes hyphen mark and the rest of word (without middle syllable) goes on the next line. In English it corresponds to the next division of word hyphenation for example: phen- hynation. How does generated fo look like? Did you set language property? Can I refuse of auto hyphenation at all? hyphenate=false, see http://www.w3.org/TR/xsl/slice7.html#hyphenate -- Oleg Tkachenko eXperanto team Multiconn Technologies, Israel - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Hyphenation
Is there a way to enforce TeX's behavior? Not prepackaged. You are invited to hack around in the files in layout/hyphenation. OK, I did: It is all in org.apache.fop.layout.LineArea.java. I altered the hyphenation strategie as follows: If there is a hyphen in a word and the character data before the hyphen will fit into the current line, hyphenation will be done at the existing hyphen and nowhere else in the word. If there is a hyphen but not enough space left do not hyphenate at all except for the case when this word is the first in a line (-the column/cell is not wide enough to hold the word in one piece). In the latter case fall back to the standard hyphenation strategie. There is also a bug (I think) in LineArea: If the word to be hyphenated is the first in a line an additional space is put in front of it. I tried to solve it by not assigning a 'space' as a spacer but a zero width space (\u200B) in this case. But I still end up with a visible space in the PDF document. I will post my LineArea.java as soon as I have it cleaned up a bit. :-) Ralf
Hyphenation
Hi everybody. Is FOP supposed to behave the same as (La)TeX does when evaluating the hyphenation patterns? The problem I have is with words already containing a hyphen character. Such words are hyphenated in TeX at no other position. For example 'Leinfelden-Echterdingen', 'Leinfelden--Echterdingen' and 'Leinfelden Echterdingen' are hyphenated: Leinfelden-Echterdingen; Leinfelden--Echterdingen; Lein-fel-den Ech-ter-din-gen. FOP seems to ignore the hyphen in 'Leinfelden-Echterdingen' and hyphenates like 'Leinfelden-Echter-dingen', for example. Is there a way to enforce TeX's behavior? TIA Ralf Steppacher
Re: Hyphenation
Ralf Steppacher wrote: Is FOP supposed to behave the same as (La)TeX does when evaluating the hyphenation patterns? I don't think so. Is there a way to enforce TeX's behavior? Not prepackaged. You are invited to hack around in the files in layout/hyphenation. J.Pietschmann
Hi all, the second Release Candidate for 0.20.3 (Maintenance release) is finally available at http://xml.apache.org/dist/fop for downloading and testing. There will be no new functionality added and if no new bugs show up FOP 0.20.3 will be released next week (target date is 27. Februar ;-) Changes from previous Release Candidate: - Proper use of font encodings for native fonts Submitted by: Peter S. Housel (housel@acm.org) and Rainer Garus (rainer.garus@arcor.de) - jimi.jar removed for license reasons - Added Printing from a servlet example Submitted by: Ralph LaChance Ralph_LaChance@compuserve.com - make sure label-end() value gets calculated (fix for bug #6094, endless loop while processing list-item-label)) - Improved i18n support for AWT viewer (Japanese dialogs) Submitted by: Satoshi Ishigami (ishigami@victokai.co.jp) - Generated text is written in UTF-8 encoding instead of ISO-8859-1 - MIF output strings escaping (see bug #1332) Submitted by: Vicente Salvador (vicentesalvador@netscape.net) - Fixed NoSuchElementException in MIF renderer (Christian Geisert) - Added Polish Hyphenation based on TeX hypenation Submitted by: Marcin Kuthan [M.Kuthan@elka.pw.edu.pl] - Added greek Hyphenation Submitted by: Dimitris Kamenopoulos el97146@mail.ntua.gr - Added hungarian Hyphenation Submitted by: Simon Geza simon@mail.drotposta.hu Just a reminder: This version has been updated to XSL 1.0 Recommendation syntax and requires JDK 1.2 or better. [ANNOUNCEMENT] Fop 0.20.3 released
Hi all, just in case someone hasn't noticed yet ;-) FOP 0.20.3 is finally available at http://xml.apache.org/dist/fop for downloading. The main reason for this release is the conformance to the XSL-FO Version 1.0 W3C Recommendation. Other changes include: - support for CMYK and embedded ICC profiles in jpeg images - support for EPS images - improved font encodings for native (Acrobat) fonts - i18n improvments for AWT viewer - support for letter-spacing - Polish, Greek, and Hungarian hyphenation - and of course a lot of bugfixes... Changes from previous Release Candidate (rc2): - added missing japanese resource for AWT viewer - fixed Markers - updated logkit from 1.0b4 to 1.0 Enjoy, Christian