Hello,
By the way, I wonder if hyphens should count as split characters? The 
default iText implementation recognizes '-' (hyphen-minus, \u002D) as a 
split character, but this character is commonly used as a minus sign and 
since you don't want a negative number to become separated from its 
sign, there should be no a break after a '-' character.

So, right now I'm using a slightly modified version of the default 
SplitCharacter implementation :
...
// note : '–' is not the same as '-' : this is a dash.
if (c <= ' ' || c == '–') {
return true;
}
...
I've considered making it more universal, but Wikipedia gave me at least 
5 different dashes, three hyphens and a minus sign, and I'm afraid the 
list is not yet complete.
The Unicode documentation on the matter is not a model of simplicity ( 
http://www.unicode.org/unicode/reports/tr14 ) but it contains a 
reference algorithm for breaking lines. If I have time, I might try and 
implement it in iText, but I'm not sure if it's worth it...

Damien Drix

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Reply via email to