Re: hyphenation

2004-12-17 Thread J.Pietschmann
robert frapples wrote:
This leads me to believe that I should be using 7.  With 7 or with 1
(and no other 1's), it still breaks:
Frap-
ples,Robert(Age-
ncy)
I also tried setting this in my {src-dir}/src/hyph/en.xml and
recompiling fop.jar.  Still no luck.
Try running
 java -classpath ...fop.jar \
   org.apache.fop.layout.hyphenation.HyphenationTree
This class has a main function, which can be used for debugging
(check the source for details). You can use this to check whether
the problem is with the hyphenation librabry code and/or your
pattern definitions or whether it is in the code invoking the
hyphenator in the line layout.
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


RE: hyphenation

2004-12-15 Thread Andreas L. Delmelle
> -Original Message-
> From: robert frapples [mailto:[EMAIL PROTECTED]
> 

Hi,

> "Odd numbers represent an acceptable location for a hyphen, with
> higher values overriding lower inhibiting values. 

Sorry. Indeed! Don't know where my head was... Upside-down it seems :-)

> 
> This leads me to believe that I should be using 7.  With 7 or with 1
> (and no other 1's), it still breaks:
> Frap-
> ples,Robert(Age-
> ncy)
> 

Beats me... Anyone who can shed some light on this?

TIA!

Greetz,

Andreas

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation

2004-12-15 Thread robert frapples
"Odd numbers represent an acceptable location for a hyphen, with
higher values overriding lower inhibiting values. . .Due to a bug
currently patterns with values of 8 and greater don't have an effect,
so don't wonder." (from http://xml.apache.org/fop/hyphenation.html)

This leads me to believe that I should be using 7.  With 7 or with 1
(and no other 1's), it still breaks:
Frap-
ples,Robert(Age-
ncy)

I also tried setting this in my {src-dir}/src/hyph/en.xml and
recompiling fop.jar.  Still no luck.  Later on (probably tomorrow)
I'll attach an example xml,xsl,en.xml snippet.

On Tue, 14 Dec 2004 18:54:46 +0100, Andreas L. Delmelle
<[EMAIL PROTECTED]> wrote:
> > Just to make sure, if i added to :
> > )
> > (
> > ,
> >
> > and to :
> > 7(
> > )7
> > ,7
> >
> > And the text: "Frapples,Robert(Agency)" is split at 15
> > characters, I should get:
> > "Frapples,-
> > Robert(Agency)"
> >
> > Right?
> 
> Not necessarily... '7' indicates an acceptable possibility for hyphenating,
> but as long as there's a possibility of breaking it at a '3', the latter
> will be chosen.
> 
> Besides that:
> F-R-A-P-P-L-E-S-,-R-O-B-E-R-T
> 
> is 15 characters, so I'd expect it to break between 'Robert' and '('.
> 
> What result do you get?

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: hyphenation

2004-12-14 Thread Andreas L. Delmelle
> -Original Message-
> From: robert frapples [mailto:[EMAIL PROTECTED]
>

Hi,

> Just to make sure, if i added to :
> )
> (
> ,
>
> and to :
> 7(
> )7
> ,7
>
> And the text: "Frapples,Robert(Agency)" is split at 15
> characters, I should get:
> "Frapples,-
> Robert(Agency)"
>
> Right?

Not necessarily... '7' indicates an acceptable possibility for hyphenating,
but as long as there's a possibility of breaking it at a '3', the latter
will be chosen.

Besides that:
F-R-A-P-P-L-E-S-,-R-O-B-E-R-T

is 15 characters, so I'd expect it to break between 'Robert' and '('.

What result do you get?

Greetz,

Andreas


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation

2004-12-14 Thread robert frapples
I tried recompiling my fop.jar with the changes in
{source-dir}/hyph/en.xml, still no luck.

Just to make sure, if i added to :
)
(
,

and to :
7(
)7
,7

And the text: "Frapples,Robert(Agency)" is split at 15 characters, I should get:
"Frapples,-
Robert(Agency)"

Right?


On Tue, 14 Dec 2004 07:24:39 -0500, robert frapples
<[EMAIL PROTECTED]> wrote:
> On Mon, 13 Dec 2004 23:17:01 +0100, J.Pietschmann <[EMAIL PROTECTED]> wrote:
> > robert frapples wrote:
> > > C:\fop\hyph\en.xml has:
> > > . . .
> > > 
> > > 5(
> > > ,5
> > > )5
> > > . . .
> > >
> > > Shouldn't this make it tend to break before and after parenthesis, and
> > > after commas? Am I missing something?
> >
> > You have to add parenthesis and comma to the "classes" entry. And
> > you'll get a hyphen unless you redefined the hyphen character to
> > be something like a ZWS.
> > 
> I added, to 
> (
> )
> ,
> 
> Still no luck.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation

2004-12-14 Thread robert frapples
On Mon, 13 Dec 2004 23:17:01 +0100, J.Pietschmann <[EMAIL PROTECTED]> wrote:
> robert frapples wrote:
> > C:\fop\hyph\en.xml has:
> > . . .
> > 
> > 5(
> > ,5
> > )5
> > . . .
> >
> > Shouldn't this make it tend to break before and after parenthesis, and
> > after commas? Am I missing something?
> 
> You have to add parenthesis and comma to the "classes" entry. And
> you'll get a hyphen unless you redefined the hyphen character to
> be something like a ZWS.
> 
I added, to 
(
)
,

Still no luck.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Re: hyphenation

2004-12-13 Thread Webmaster
!!!ATTENTION CE COURRIER N'EST PAS ARRIVE A DESTINATION!!!
Due a une très forte progression du SPAM sur notre boite aux lettres, nous vous 
prions de bien vouloir transférer votre courriel a notre nouvelle adresse: 
[EMAIL PROTECTED]
Merci de votre compréhension
Alacuisine.Net
!!!ATTENTION CE COURRIER N'EST PAS ARRIVE A DESTINATION!!!



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation

2004-12-13 Thread J.Pietschmann
robert frapples wrote:
C:\fop\hyph\en.xml has:
. . .

5(
,5
)5
. . .
Shouldn't this make it tend to break before and after parenthesis, and
after commas? Am I missing something?
You have to add parenthesis and comma to the "classes" entry. And
you'll get a hyphen unless you redefined the hyphen character to
be something like a ZWS.
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: hyphenation in single syllable language?

2004-11-11 Thread Simon Pepping
On Thu, Nov 11, 2004 at 05:13:37PM +0100, Andreas L. Delmelle wrote:
> > From: Hoang Tuan Nguy?n D?ng [mailto:[EMAIL PROTECTED]
> >
> 
> Hi,
> 
> 
> > i've try your suggest but there is no change :(.
> > fist at all, i've change my locale in docbook source from 'vi' to
> > 'en', the hyphenation is work well, my word isn't split !
> >
> > But when i try to copy en.xml to vi.xml and define hyphenation-dir in
> > userconfig.xml; when fop-ing, there is no error messages about
> > building hyphenation tree for language vi; but the word is split in
> > PDF document 
> 
> I'm a bit confused here... So, you are 'copying' the XML containing the
> English hyphenation patterns to a new file 'vi.xml'? :-/
> As I see it, this will have exactly the effect I described as 'using English
> hyph patterns for Vietnamese text'. Unless the 'vi.xml' is made to contain
> information specifically about hyphenation in Vietnamese, it's always going
> to lead to undesired results.
> 
> >
> > When i add those lines:
> > 
> >false
> >vi
> > 
> > in my xsl stylesheet ang generate fo file with this; nothing change!!
 
If you would have no file vi.xml and declare the language of the
vietnamese text parts to be 'vi', those parts should not be
hyphenated.

Regards, Simon

-- 
Simon Pepping
home page: http://www.leverkruid.nl


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: hyphenation in single syllable language?

2004-11-11 Thread Andreas L. Delmelle
> -Original Message-
> From: Hoang Tuan Nguy?n D?ng [mailto:[EMAIL PROTECTED]
>

Hi,


> i've try your suggest but there is no change :(.
> fist at all, i've change my locale in docbook source from 'vi' to
> 'en', the hyphenation is work well, my word isn't split !
>
> But when i try to copy en.xml to vi.xml and define hyphenation-dir in
> userconfig.xml; when fop-ing, there is no error messages about
> building hyphenation tree for language vi; but the word is split in
> PDF document 

I'm a bit confused here... So, you are 'copying' the XML containing the
English hyphenation patterns to a new file 'vi.xml'? :-/
As I see it, this will have exactly the effect I described as 'using English
hyph patterns for Vietnamese text'. Unless the 'vi.xml' is made to contain
information specifically about hyphenation in Vietnamese, it's always going
to lead to undesired results.

>
> When i add those lines:
> 
>false
>vi
> 
> in my xsl stylesheet ang generate fo file with this; nothing change!!

All depends on where this attribute-set is used. If this set is in fact
being used on the Vietnamese parts, and hyphenation is still not disabled
for those blocks of text, this would be a bug.

If you were using two attribute-sets:


  en
  true

and

  vi
  false


The XSL should look like:


  



  


IIC, it would even be sufficient to only use the first set, and for the
Vietnamese parts, just omit the hyphenation properties.

Hope this helps!

Greetz,

Andreas


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: hyphenation in single syllable language?

2004-11-11 Thread Andreas L. Delmelle
> -Original Message-
> From: Hoàng Tuân Nguy?n Ð?ng [mailto:[EMAIL PROTECTED]
>

Hi,

> i want to know about the capablity to disable spliting a word in fop ?
> hyphenation as i know!!
>
> i've build a Vietnamese pdf document with fop, but there is some
> problem with the result. Vietnamese is a single syllable language!

Well... I see the document in question uses Vietnamese as well as English. I
suspect you have hyphenation turned on for all of the document (?) So most
likely, FOP is using English hyphenation patterns for the Vietnamese parts
as well?

If that is the case, I see two options:
- activate hyphenation for all of the document, and disable it
(hyphenate="false") for the Vietnamese parts
- activate hyphenation (hyphenate="true") only for the English parts

Hope this helps!

Greetz,

Andreas


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation

2004-09-23 Thread Jeremias Maerki
See here:
http://xml.apache.org/fop/hyphenation.html
http://offo.sourceforge.net/
http://nagoya.apache.org/eyebrowse/[EMAIL PROTECTED]&msgId=1881555

On 23.09.2004 14:09:34 Anton Packianathan wrote:
> how i can activate the hyphenation for geman!


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Hyphenation foundry

2004-06-21 Thread Simon Pepping
Jörg,

On Sun, Jun 20, 2004 at 10:08:46PM +0200, J.Pietschmann wrote:
> Simon Pepping wrote:
> >This would not exclude making it available on a public web page?
> Yes. I'll replace the tarball with another one without the link
> parser but including the PatGen code. Unfortunately I detected the
> problem only after I uploaded the file and started composing the
> message, and I thought I'd leave it for now lest the other work
> go unnoticed.

OK. I am not in a hurry.
 
> >Regarding your items to chew on, would you want these to be part of a
> >web page for this code?
> Why not?

Sure. Just wanted to confirm.

Regards, Simon

-- 
Simon Pepping
home page: http://www.leverkruid.nl


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Hyphenation foundry

2004-06-20 Thread J.Pietschmann
Simon Pepping wrote:
Do you want to pursue these efforts further, or do you
want to make them available to others who might be interested?
I wanted to make my current code available to others, in particular
the PatGen port (which unfortunately went missing), in order to
avoid unnecessary duplication  of work.
This would not exclude making it available on a public web page?
Yes. I'll replace the tarball with another one without the link
parser but including the PatGen code. Unfortunately I detected the
problem only after I uploaded the file and started composing the
message, and I thought I'd leave it for now lest the other work
go unnoticed.
Regarding your items to chew on, would you want these to be part of a
web page for this code?
Why not?
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation foundry

2004-06-20 Thread Clay Leeds
On Jun 20, 2004, at 12:14 PM, Simon Pepping wrote:
Jörg,
On Wed, Jun 16, 2004 at 11:15:42PM +0200, J.Pietschmann wrote:
Simon Pepping wrote:
I think it is time to create a project for the hyphenation files at
Sourceforge. The project should be a home for all sorts of 
accessories
to FOP, or even to FO processors in general. Do you want to
participate? Do you know a nice name?
Well, sf.net would appeal to a larger body of developers, I think,
and is certainly easier to menage for small projects, but we
can also ask on jakarta-commons, xml-commons and even declare it
a FOP (or XML graphics) subproject.
It is up to you to decide where you want your efforts to be host it. I
am happy to present it in a sf.net project, but there is little more
that I can do. Do you want to pursue these efforts further, or do you
want to make them available to others who might be interested? I am
afraid that is always a difficult proposition without active
recruitment.
Echoing Simon... Bring it on! The more (components, peripherals & toys) 
the merrier!... However if active recruitment is required be prepared 
for... well, active recruiting...

as for a project name, how about calling it:
The XSL-FO Toybox
=
http://xslfo-toy-box.sourceforge.net/
or simply:
The Toybox
=
http://toybox.sourceforge.net/
or:
Toys

http://toys.sourceforge.net/
Toys for Geeks
==
http://toys-for-geeks.sourceforge.net/
XML Toys

http://xmltoys.sourceforge.net/
Java XML Toys
=
http://javaxmltoys.sourceforge.net/
or something... I think I like XMLToys...
Anyway, I just uploaded
 http://cvs.apache.org/~pietsch/t.tar.gz
which contains several unfinished stuff I produced the last year:
- Utilities to generate tables for the Unicode line break property
- A class keeping a line break state according to TR14, which should
  be easier to usee than the java.text.BreakIterator for FOP
- A Java port of MySpell
- An attempt at providing a layered hierarchy for spell checking
 and hyphenation interfaces.
- A Java port of the link grammar parser (incomplete, badly designed,
 buggy and without approvement of the original authors, *please* use
 only for personal study, don't redistribute).
- An attempt at a morphological analyzer for german words.
Somehow, the simple port of patgen as well as other attempts at
simplifying the current FOP hyphenator are missing, I hope I
remember to upload them tomorrow.
I see in the tar file two parts: java code and data. Would it be
useful to indicate which parts of the code belong to each of the above
items?
Perhaps a classification system such as this for the
project-name/applications/[insert apps here]
project-name/peripherals/[insert peripherals here]
project-name/examples/[insert examples here]
project-name/so-on/[insert so-on here]
project-name/docs/[insert docs here]
project-name/wiki/[insert wiki here]

 buggy and without approvement of the original authors, *please* use
 only for personal study, don't redistribute).
This would not exclude making it available on a public web page?
Regarding your items to chew on, would you want these to be part of a
web page for this code?
Regards, Simon
--
Simon Pepping
home page: http://www.leverkruid.nl
Sounds llike something of a non-trivial undertaking... :-) Sounds like 
fun!

Web Maestro Clay - <[EMAIL PROTECTED]>
---
My religion is simple. My religion is kindness.
- His Holiness the 14th Dalai Lama of Tibet
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation foundry

2004-06-20 Thread Simon Pepping
I would like some rather different ideas:

FOX or FO-extra
AmFOra: an amphora for FO goodies
EuFOria: FO goodies that make FO users euphoric
Well-FO-Armed

Simon

On Wed, Jun 16, 2004 at 01:40:21PM -0700, Clay Leeds wrote:
> Sure! I'd love to participate! I don't know how yet, though...

Such a project brings always more work with it than I care to do :-)

-- 
Simon Pepping
home page: http://www.leverkruid.nl


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Hyphenation foundry

2004-06-20 Thread Simon Pepping
Jörg,

On Wed, Jun 16, 2004 at 11:15:42PM +0200, J.Pietschmann wrote:
> Simon Pepping wrote:
> >I think it is time to create a project for the hyphenation files at
> >Sourceforge. The project should be a home for all sorts of accessories
> >to FOP, or even to FO processors in general. Do you want to
> >participate? Do you know a nice name?
> 
> Well, sf.net would appeal to a larger body of developers, I think,
> and is certainly easier to menage for small projects, but we
> can also ask on jakarta-commons, xml-commons and even declare it
> a FOP (or XML graphics) subproject.

It is up to you to decide where you want your efforts to be host it. I
am happy to present it in a sf.net project, but there is little more
that I can do. Do you want to pursue these efforts further, or do you
want to make them available to others who might be interested? I am
afraid that is always a difficult proposition without active
recruitment.
 
> Anyway, I just uploaded
>  http://cvs.apache.org/~pietsch/t.tar.gz
> which contains several unfinished stuff I produced the last year:
> - Utilities to generate tables for the Unicode line break property
> - A class keeping a line break state according to TR14, which should
>   be easier to usee than the java.text.BreakIterator for FOP
> - A Java port of MySpell
> - An attempt at providing a layered hierarchy for spell checking
>  and hyphenation interfaces.
> - A Java port of the link grammar parser (incomplete, badly designed,
>  buggy and without approvement of the original authors, *please* use
>  only for personal study, don't redistribute).
> - An attempt at a morphological analyzer for german words.
> Somehow, the simple port of patgen as well as other attempts at
> simplifying the current FOP hyphenator are missing, I hope I
> remember to upload them tomorrow.

I see in the tar file two parts: java code and data. Would it be
useful to indicate which parts of the code belong to each of the above
items?

>  buggy and without approvement of the original authors, *please* use
>  only for personal study, don't redistribute).

This would not exclude making it available on a public web page?

Regarding your items to chew on, would you want these to be part of a
web page for this code?

Regards, Simon

-- 
Simon Pepping
home page: http://www.leverkruid.nl


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Hyphenation foundry [was: Re: proposed font project]

2004-06-17 Thread J.Pietschmann
Peter B. West wrote:
Does Character.UnicodeBlock provide any of this functionality?
I'm not aware of any Java RTL class which reads the original
Unicode data files. Most of the data found its way into the
Charachter and Character.UnicodeBlock classes though, and
as of Unicode 4.0 the data seems to be stable enough that
outdated data isn't a problem (it never was for the vast
majority of the Java deveolpers). However, the line breaking
properties are not directly accessible, only through the
BreakIterator class. I think BIDI properties aren't directly
accessible either, so there may be still some need to get
Unicode data into some form into the program.
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation foundry [was: Re: proposed font project]

2004-06-17 Thread Peter B. West
J.Pietschmann wrote:
Simon Pepping wrote:
I think it is time to create a project for the hyphenation files at
Sourceforge. The project should be a home for all sorts of accessories
to FOP, or even to FO processors in general. Do you want to
participate? Do you know a nice name?

Well, sf.net would appeal to a larger body of developers, I think,
and is certainly easier to menage for small projects, but we
can also ask on jakarta-commons, xml-commons and even declare it
a FOP (or XML graphics) subproject.
Anyway, I just uploaded
 http://cvs.apache.org/~pietsch/t.tar.gz
which contains several unfinished stuff I produced the last year:
- Utilities to generate tables for the Unicode line break property
Does Character.UnicodeBlock provide any of this functionality?
Peter
--
Peter B. West 
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation foundry [was: Re: proposed font project]

2004-06-17 Thread Peter B. West
Simon Pepping wrote:
Hi Clay,
On Sat, May 29, 2004 at 10:02:37PM -0700, Clay Leeds wrote:
It would also be good to develop some sort of hyphenation foundry...

I think it is time to create a project for the hyphenation files at
Sourceforge. The project should be a home for all sorts of accessories
to FOP, or even to FO processors in general. Do you want to
participate? Do you know a nice name?
Hy-pe
Hy-Phi
Peter
--
Peter B. West 
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation foundry [was: Re: proposed font project]

2004-06-16 Thread J.Pietschmann
Simon Pepping wrote:
I think it is time to create a project for the hyphenation files at
Sourceforge. The project should be a home for all sorts of accessories
to FOP, or even to FO processors in general. Do you want to
participate? Do you know a nice name?
Well, sf.net would appeal to a larger body of developers, I think,
and is certainly easier to menage for small projects, but we
can also ask on jakarta-commons, xml-commons and even declare it
a FOP (or XML graphics) subproject.
Anyway, I just uploaded
 http://cvs.apache.org/~pietsch/t.tar.gz
which contains several unfinished stuff I produced the last year:
- Utilities to generate tables for the Unicode line break property
- A class keeping a line break state according to TR14, which should
  be easier to usee than the java.text.BreakIterator for FOP
- A Java port of MySpell
- An attempt at providing a layered hierarchy for spell checking
 and hyphenation interfaces.
- A Java port of the link grammar parser (incomplete, badly designed,
 buggy and without approvement of the original authors, *please* use
 only for personal study, don't redistribute).
- An attempt at a morphological analyzer for german words.
Somehow, the simple port of patgen as well as other attempts at
simplifying the current FOP hyphenator are missing, I hope I
remember to upload them tomorrow.
If someone want some problems to chew on:
- Implementation of an optimized trie or ternary or PATRICIA tree.
 Issues here: The FOP implementation packs both tree construction and
 retrieval into a single class, while the data structure is WORM.
 Furthermore, while it is fast, it could be implemented with much
 less memory, especially peak memory during construction. I ultimately
 concluded compiling the data into Java bytecode would be the best.
 Consider inserting the words WORD and WORM. A PATRICIA tree would
 collapse this to
   root: WOR -> leaf D
 -> leaf M
 In order to map this, the root node gets an operation "match string"
 with the string "WOR" leading to the subtree. Statistical compression
 could optimize the necessary operation, like "switch array", match
 2char string, match 3char string, match n-char string etc. May utilize
 BCEL.
- Institutionalized alphabet transformation. This is somewhat of a
 generalization of the hyphenation character classes. Java uses 16bit
 characters, but in many languages it is rare that more than 256
 characters are actually used in words. TeX/PatGen also map the
 characters onto the numbers 1..N (<256), folding character
 classification into the process. Mapping chars onto bytes saves almost
 half the memory. Because there are languages which requires more than
 256 characters, at least two implementation of the trie/whatever
 holding the patterns are necessary, one where the keys are byte
 sequences, another with char sequences. Too bad generics aren't ready
 yet, but if the data is byte compiled into a Java class, the compiler
 may analyze the patterns and decide whether bytes are sufficient.
 Stuff like Unicode character normalization should probably be folded
 into the classification/alphabet transformation too. It would be too
 bad if hyphenation failed because someone decided to use unnormalized
 characters like FI LIGATURE.
- API design. Need a hierarchy of interfaces which allow polymorphy
 at various levels:
  + Hyphenator
  implementations: pattern hyphenator, dictionary hyphenator,
  composite hyphenator: delegate to a collection of child
  hyphenators
  + Pattern hyphenator - pattern storage
 implementations: HashTable (very easy to understand but slow),
 R/W-trie, optimized WORM class, ...
  + Dictionary hyphenator - dictionary ...
 For reuse in interactive applications, R/W storage may be useful (user
 dictionaries)
- Generalized line breaking strategies. Possible strategies
 + naive, break before the first non-space after a space
 + TR14
 + break before any character
 + pattern, regexp or dictionary pased
- Other ideas: API for processing the Unicode data files. Optimized
 compile for Unicode properties into Java class data: select the
 properties you want, get it. Use this to get the latest Unicode data
 into your Java applications rather than the outdated stuff in the
 JRE.
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation foundry [was: Re: proposed font project]

2004-06-16 Thread Clay Leeds
On Jun 16, 2004, at 12:20 PM, Simon Pepping wrote:
Hi Clay,
Hi Simon!
On Sat, May 29, 2004 at 10:02:37PM -0700, Clay Leeds wrote:
It would also be good to develop some sort of hyphenation foundry...
I think it is time to create a project for the hyphenation files at
Sourceforge. The project should be a home for all sorts of accessories
to FOP, or even to FO processors in general. Do you want to
participate? Do you know a nice name?
Regards, Simon
Sure! I'd love to participate! I don't know how yet, though...
Ideas for names? I guess it depends on how 'we' want to position this 
foundry. Is the foundry geared toward FOP users?

* fopstuff
* fop-stuff
* fostuff
* fo-stuff
* xslfostuff
* xsl-fo-stuff
* foptoys
* fop-toys
* fotoys
* fo-toys
* xslfotoys
* xsl-fo-toys
* fopaccessories
* fop-accessories
* foaccessories
* fo-accessories
* xslfoaccessories
* xsl-fo-accessories
* fopperipherals
* fop-peripherals
* foperipherals
* fo-peripherals
* xslfoperipherals
* xsl-fo-peripherals
I don't have a particular favorite, although since there are so many, 
it wouldn't be very helpful if I didn't 'choose' one or two. I like the 
ones *with* the hyphen (no pun intended! ;-) -- which makes it easier 
to read):

* xsl-fo-toys
* xsl-fo-stuff
In addition, since we want it to be of broader use (i.e., not just 
FOP), I would think we'd want to use one of the 'fo' or 'xsl-fo' 
prefixes (with or without hyphens) over the 'fop' based ones.

Hope this helps!
Web Maestro Clay <[EMAIL PROTECTED]>
---
There are only 10 kinds of people in the world: those who understand 
binary and those who don't.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation Patterns

2004-03-29 Thread Clay Leeds
On Mar 29, 2004, at 11:51 AM, Simon Pepping wrote:
I am about to make the hyphenation patterns for German and French
available on my homepage, www.leverkruid.nl/FOP/index.html.
I intend to find a home for all hyphenation pattern files that have a
license that is not compatible with the Apache license, but which
allows their free distribution. I am thinking of the TeX archive CTAN,
SourceForge or Savannah.
Enjoy,
Simon Pepping
--
Simon Pepping
home page: http://www.leverkruid.nl
You're the bomb, Simon! (TRANSLATION: I was hoping someone might step 
up to this challenge... Thank you very much! :-))

Web Maestro Clay
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation Patterns

2004-03-29 Thread Simon Pepping
On Thu, Mar 25, 2004 at 07:00:45AM -0800, Clay Leeds wrote:
> On Mar 25, 2004, at 1:39 AM, Chris Bowditch wrote:
> >Waggershauser, Thomas wrote:
> >
> >>Hello,
> >>I need Hyphenation Patterns for german and french language. These are 
> >>not available at the fop-homepage, but a reference to a tex source. 
> >>Its mentioned that its possible to convert the tex-format to the fop 
> >>xml-format. I tried really hard to find out how to do this, but I got 
> >>no idea.
> >>Maybe here is someone who can help me...
> >
> I believe that, due to licensing issues, the hyphenation patterns have 
> been removed from old binaries (hence the 'a' in 0.20.4a, 0.20.3a, 
> etc.). I suspect that the best course of action is to follow the 
> instructions on the FOP Hyphenation page[1] to convert TeX hyphenation 
> patterns. The FOP developers sincerely regret that the hyphenation 
> patterns had to be removed. If you have problems with the conversion, 
> please don't hesitate to ask for help on this list.
> 
>   http://xml.apache.org/fop/hyphenation.html

I am about to make the hyphenation patterns for German and French
available on my homepage, www.leverkruid.nl/FOP/index.html.

I intend to find a home for all hyphenation pattern files that have a
license that is not compatible with the Apache license, but which
allows their free distribution. I am thinking of the TeX archive CTAN,
SourceForge or Savannah.

Enjoy,
Simon Pepping

-- 
Simon Pepping
home page: http://www.leverkruid.nl


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Hyphenation Patterns

2004-03-25 Thread Clay Leeds
On Mar 25, 2004, at 1:39 AM, Chris Bowditch wrote:
Waggershauser, Thomas wrote:
Hello,
I need Hyphenation Patterns for german and french language. These are 
not available at the fop-homepage, but a reference to a tex source. 
Its mentioned that its possible to convert the tex-format to the fop 
xml-format. I tried really hard to find out how to do this, but I got 
no idea.
Maybe here is someone who can help me...
I have never done the conversion myself, but heard that it is actually 
quite tricky. The french and german hypenatation patterns used to be 
distributed with older versions of FOP, 0.20.4. Older versions of FOP 
binaries can be downloaded from the website. So I suggest you download 
and install FOP 0.20.4 and then extract the hypenatation files from 
there.

Chris
I believe that, due to licensing issues, the hyphenation patterns have 
been removed from old binaries (hence the 'a' in 0.20.4a, 0.20.3a, 
etc.). I suspect that the best course of action is to follow the 
instructions on the FOP Hyphenation page[1] to convert TeX hyphenation 
patterns. The FOP developers sincerely regret that the hyphenation 
patterns had to be removed. If you have problems with the conversion, 
please don't hesitate to ask for help on this list.

  http://xml.apache.org/fop/hyphenation.html
Good luck!
Web Maestro Clay
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation Patterns

2004-03-25 Thread Chris Bowditch
Waggershauser, Thomas wrote:
Hello,
I need Hyphenation Patterns for german and french language. These are not 
available at the fop-homepage, but a reference to a tex source. Its mentioned 
that its possible to convert the tex-format to the fop xml-format. I tried 
really hard to find out how to do this, but I got no idea.
Maybe here is someone who can help me...
I have never done the conversion myself, but heard that it is actually 
quite tricky. The french and german hypenatation patterns used to be 
distributed with older versions of FOP, 0.20.4. Older versions of FOP 
binaries can be downloaded from the website. So I suggest you download 
and install FOP 0.20.4 and then extract the hypenatation files from there.

Chris
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: RE : hyphenation problem

2004-01-27 Thread J.Pietschmann
VAGNER Rodolphe wrote:
Do you think the error below can be generated by the absence of a fr.xml
hyphenation file in  the 0.20.5 version of FOP ? So the two problems (add
hyphenation file and this error ) maibe the same one . If not, does someone
know how I can fix this please :
java.io.InvalidClassException: ...Local class not compatible:
This happens if you try to load a compiled hyphenation pattern
instance from an incompatible class.
Try the following:
- get a FOP source distribution
- add your fr.xml to the source/hyph
- run
   build.sh package
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


RE : hyphenation problem

2004-01-27 Thread VAGNER Rodolphe
Do you think the error below can be generated by the absence of a fr.xml 
hyphenation file in  the 0.20.5 version of FOP ? So the two problems (add 
hyphenation file and this error ) maibe the same one . 
If not, does someone know how I can fix this please : 

java.io.InvalidClassException: 
org.apache.fop.layout.hyphenation.HyphenationTree; Local class not compatible: 
stream cla
ssdesc serialVersionUID=-7784516835472871478 local class 
serialVersionUID=6956085100635965297
at java.io.ObjectStreamClass.validateLocalClass(Unknown Source)
at java.io.ObjectStreamClass.setClass(Unknown Source)
at java.io.ObjectInputStream.inputClassDescriptor(Unknown Source)
at java.io.ObjectInputStream.readObject(Unknown Source)
at java.io.ObjectInputStream.readObject(Unknown Source)
at java.io.ObjectInputStream.inputObject(Unknown Source)
at java.io.ObjectInputStream.readObject(Unknown Source)
at java.io.ObjectInputStream.readObject(Unknown Source)
at 
org.apache.fop.layout.hyphenation.Hyphenator.getFopHyphenationTree(Hyphenator.java:165)
at 
org.apache.fop.layout.hyphenation.Hyphenator.getHyphenationTree(Hyphenator.java:91)
at 
org.apache.fop.layout.hyphenation.Hyphenator.hyphenate(Hyphenator.java:251)
at org.apache.fop.layout.LineArea.doHyphenation(LineArea.java:1208)
at org.apache.fop.layout.LineArea.addText(LineArea.java:857)
at org.apache.fop.fo.FOText.addRealText(FOText.java:278)
at org.apache.fop.fo.FOText.addText(FOText.java:252)
at org.apache.fop.fo.FOText.layout(FOText.java:161)
at org.apache.fop.fo.FObjMixed.layout(FObjMixed.java:139)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.TableCell.layout(TableCell.java:326)
at org.apache.fop.fo.flow.TableRow.layout(TableRow.java:360)
at 
org.apache.fop.fo.flow.AbstractTableBody.layout(AbstractTableBody.java:236)
at org.apache.fop.fo.flow.Table.layout(Table.java:302)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.Block.layout(Block.java:257)
at org.apache.fop.fo.flow.AbstractFlow.layout(AbstractFlow.java:154)
at org.apache.fop.fo.flow.AbstractFlow.layout(AbstractFlow.java:110)
at 
org.apache.fop.fo.pagination.PageSequence.makePage(PageSequence.java:400)
at 
org.apache.fop.fo.pagination.PageSequence.format(PageSequence.java:338)
at org.apache.fop.apps.StreamRenderer.render(StreamRenderer.java:262)
at org.apache.fop.fo.FOTreeBuilder.endElement(FOTreeBuilder.java:223)
at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown 
Source)
at 
org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source)
at 
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown
 Source)
at 
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown 
Source)
at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source)
at org.apache.xerces.parsers.DTDConfiguration.parse(Unknown Source)
at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
at org.apache.fop.apps.Driver.render(Driver.java:498)
at 
org.apache.fop.apps.CommandLineStarter.run(CommandLineStarter.java:106)
at org.apache.fop.apps.Fop.main(Fop.java:62)





--  -Message d'origine-
--  De : Peter Menzel [mailto:[EMAIL PROTECTED] 
--  Envoyé : mardi 27 janvier 2004 10:06
--  À : [EMAIL PROTECTED]
--  Objet : Re: hyphenation problem
--  
--  
--  Hi
--  
--  If you want to include new hyphenation patterns into fop, 
--  it's not enough to zip them into the fop.jar.. You have to 
--  download the source, copy the patterns in the hyph dir and 
--  then run the build script, to compile a new jar file..
--  
--  Peter
--  
--  
--  
--  > Hy,
--  >  
--  > I upgrade from 0.20.4 to 0.20.5 and I have the following 
--  error in the 
--  > log :
--  >  
--  > "[ERROR] Couldn't find hyphenation pattern  fr_fr
--  > using general language pattern fr instead."
--  >  
--  > Even I've add to fop.jar the fr.hyp file.
--  >  
--  > Maibe it's not the right solution, but french hyphenation was not 
--  > support in the 0.20.5 I beleave.
--  >  
--  > Could someone help please.
--  >  
--  >  
--  > 
--  --
--  [EMAIL PROTECTED]
--  Institut für Medizinische Informatik, Statistik und 
--  Epidemiologie (IMISE) Universität Leipzig
--  
--  ---

RE : hyphenation problem

2004-01-27 Thread VAGNER Rodolphe
I try the rebuild method proposed  and both points of last mail are ok.
Thank you very much.

Ro

--  -Message d'origine-
--  De : Peter Menzel [mailto:[EMAIL PROTECTED] 
--  Envoyé : mardi 27 janvier 2004 10:06
--  À : [EMAIL PROTECTED]
--  Objet : Re: hyphenation problem
--  
--  
--  Hi
--  
--  If you want to include new hyphenation patterns into fop, 
--  it's not enough to zip them into the fop.jar.. You have to 
--  download the source, copy the patterns in the hyph dir and 
--  then run the build script, to compile a new jar file..
--  
--  Peter
--  
--  
--  
--  > Hy,
--  >  
--  > I upgrade from 0.20.4 to 0.20.5 and I have the following 
--  error in the 
--  > log :
--  >  
--  > "[ERROR] Couldn't find hyphenation pattern  fr_fr
--  > using general language pattern fr instead."
--  >  
--  > Even I've add to fop.jar the fr.hyp file.
--  >  
--  > Maibe it's not the right solution, but french hyphenation was not 
--  > support in the 0.20.5 I beleave.
--  >  
--  > Could someone help please.
--  >  
--  >  
--  > 
--  --
--  [EMAIL PROTECTED]
--  Institut für Medizinische Informatik, Statistik und 
--  Epidemiologie (IMISE) Universität Leipzig
--  
--  
--  -
--  To unsubscribe, e-mail: [EMAIL PROTECTED]
--  For additional commands, e-mail: [EMAIL PROTECTED]
--  
--  

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation problem

2004-01-27 Thread Peter Menzel
Hi

If you want to include new hyphenation patterns into fop, it's not enough
to zip them into the fop.jar.. You have to download the source, copy the
patterns in the hyph dir and then run the build script, to compile a new jar
file..

Peter



> Hy,
>
> I upgrade from 0.20.4 to 0.20.5 and I have the following error in the
> log :
>
> "[ERROR] Couldn't find hyphenation pattern  fr_fr
> using general language pattern fr instead."
>
> Even I've add to fop.jar the fr.hyp file.
>
> Maibe it's not the right solution, but french hyphenation was not
> support in the 0.20.5 I beleave.
>
> Could someone help please.
>
>
>
--
[EMAIL PROTECTED]
Institut für Medizinische Informatik, Statistik und Epidemiologie (IMISE)
Universität Leipzig

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation problem

2004-01-26 Thread J.Pietschmann
VAGNER Rodolphe wrote:
"[ERROR] Couldn't find hyphenation pattern  fr_fr
using general language pattern fr instead."
 
Even I've add to fop.jar the fr.hyp file.
Well, the hyphenator looks for a fr_fr.hyp file, probably because
you have both language="fr" and country="fr" defined. Either omit
the country, or rename the file, or just ignore the message.
J.Pietschmann
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation question

2003-12-11 Thread Roland Schroth
> I've a problem with the hyphenation in fop. There is a certain word that
> contains a minus character that should not be hyphenated. Is there a way
> to add this word in the hyphenation.xml file, so that it won't get
> hyphenated? Or do I have to translate the minus character to a special
> minus character, and if so, which one?

You could simply exclude this word from hyphenation in the fo code like this:
some-word

As far as i understand the documentation at 
http://xml.apache.org/fop/hyphenation.html#patterns
there is no way to exclude words with characters in them that equal the hyphen 
character.

>From the documentation:
-
 contains whitespace-separated words, each of which has either 
explicit hyphen characters to denote acceptable breakage
points, or no hyphen characters, to indicate that this word should never be 
hyphenated, or contain explicit  elements for
specifying changes of spelling due to hyphenation (like backen -> bak-ken or 
Stoffarbe -> Stoff-farbe in the old german spelling).
Exceptions override the patterns described below. Explicit  declarations 
don't work yet (patches welcome). Exceptions are
generally a bit brittle, test carefully.
-

But if you replace the hyphen in the word with a similar looking character 
other than the hyphenation character, you should be able
to exclude this word from hyphenation generally by putting it into the 
 section of your hyphenation xml file as
described in the documentation.

Which character to use instead depends on the font you use. At least i think 
so. I do not know, which character is used for
hyphenation by default. In Arial for example there is a minus (U+002D) and a 
hyphen (U+2013). So if the hyphen is used for
hyphenation you could be able to use the minus for your special word, if these 
two characters are not treated the same (but that may
be the case). But this is only what i think. I don't have tried this or have 
any experience with that.
Just have a look into the character table for your font and you should find 
characters you might be able to use alternatively.

Maybe someone else can give some more hints on that.

Kind regards,
Roland



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Hyphenation on free text

2003-09-17 Thread Chris Bowditch
From: Sherlane Lam <[EMAIL PROTECTED]>
Hi all,

I have made a sample.xml with a long free text as
shown in the attached document (sample3.xml) with
continuous character - 'AABBCCC..'
and so on.  I find that the appearance of the
characters become dis-order after the line breaking
(please find in the output.pdf).  I don't know whether
it is due to the xslt problem or fo problem.  Please
give me some suggestions.
I believe this behaviour was a bug in 0.20.4. Please upgrade to 0.20.5, as 
this problem should be fixed.

Chris
_
Stay in touch with absent friends - get MSN Messenger 
http://www.msn.co.uk/messenger
10 Sep 200310/09/200310-09-20032003-09-1010-SEP-2003PPPriority payment13 Dec 200213/12/200213-12-20022002-12-1313-DEC-2002 1234567 1234 12345678HKD100.0012 12 12 1212 1 212 121 121 121 121HKHBAP 222  999411B00NXQ111  112003-09-10-02.00.15.077785AABBCCDDEEFFGGHHIIJJKKLLMMNNOOPPQQRRSSTTUUVVWWXXYYZZaabbccddeeffgghhiijjkkllmmnnooppqqrrssttuuvvwwxxyy

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: hyphenation again

2003-06-10 Thread Robert P. J. Day
On Tue, 10 Jun 2003, J.Pietschmann wrote:

> anton wrote:
 
> > 3. The separator sign for hyphention in the preface is "#" and in the
> 
> See
>   http://xml.apache.org/fop/faq.html#pdf-characters
> Check the docbook XSL which character they actually use.

sadly, it's a "#", so anton will have to override that in his
stylesheet.

rday


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation again

2003-06-10 Thread J.Pietschmann
anton wrote:
1. Further i were able to write the path to my True Types without drive
letter: /aelitha-docbook/fonts/ and now FOP don't find the fonts: 

[ERROR] Failed to read a font metrics file: Invalid font metrics file:
arial.xml (Invalid
fontBaseDir specified: /aelitha-docbook/fonts/ (no protocol:
/aelitha-docbook/fonts/))
only with the full path c:/aelitha-docbook/fonts/ the fonts will be
found. Why? Is it possible to compile some TTFs with the FOP, so i don't
have to use the config.xml? 
See
 http://xml.apache.org/fop/faq.html#MalformedURLException
The value must be a valuid URL. In some cases the RTL will fall
back to interpret this as a file name, but this doesn't work
consistently.
2. While generating FO-file i still see 

Making portrait pages on A4 paper (210mmx297mm)
No "de" localization of "hyphenation-character" exists; using "en".
No "de" localization of "hyphenation-push-character-count" exists; using
"en".
*Shrug* Ask the DocBook folks.
3. The separator sign for hyphention in the preface is "#" and in the
See
 http://xml.apache.org/fop/faq.html#pdf-characters
Check the docbook XSL which character they actually use.

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


RE: hyphenation again

2003-06-10 Thread anton
Hi!

>>>
The easiest way is to put the XML version in the src/hyph directory and
recompile FOP ("build clean", followed by "build"). The XML file is
being converted to an internal format (serialized java object) as part
of this build process.
>>>

OK, it seems to work, thanks!

I have in this case some questions, but i fear they are not 100% FOP
related.

1. Further i were able to write the path to my True Types without drive
letter: /aelitha-docbook/fonts/ and now FOP don't find the fonts: 

[ERROR] Failed to read a font metrics file: Invalid font metrics file:
arial.xml (Invalid
fontBaseDir specified: /aelitha-docbook/fonts/ (no protocol:
/aelitha-docbook/fonts/))

only with the full path c:/aelitha-docbook/fonts/ the fonts will be
found. Why? Is it possible to compile some TTFs with the FOP, so i don't
have to use the config.xml? 


2. While generating FO-file i still see 

Making portrait pages on A4 paper (210mmx297mm)
No "de" localization of "hyphenation-character" exists; using "en".
No "de" localization of "hyphenation-push-character-count" exists; using
"en".

...

But when FOP renders the FO-file there are no warnings and the generated
PDF has hyphenation.


3. The separator sign for hyphention in the preface is "#" and in the
chapters correct "-". I use DocBook with Stylesheet version 1.61.2.


CU
Anton


smime.p7s
Description: S/MIME cryptographic signature


Re: hyphenation again

2003-06-10 Thread Jeremias Maerki
The easiest way is to put the XML version in the src/hyph directory and
recompile FOP ("build clean", followed by "build"). The XML file is
being converted to an internal format (serialized java object) as part
of this build process.

On 10.06.2003 09:42:38 Anton Wardaschko (PA) wrote:
> I get the TeX hyphenation file and the XML version too, i set the path
> in the userconf.xml, but i read on the command line again and again,
> that they are not found. The path to the hyph files is correct, couse
> i'm able to embed the ttf fonts. Any suggestions?


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation pattern

2003-02-27 Thread Clay Leeds
Sorry for leading a list memeber astray. Jeremias is completely correct, 
and I should've left it at "0.20.5rc2 does not include the hyphenation 
pattern en_US."

My bad.
Jeremias Maerki wrote:
Most of the hyphenation files had to be removed because of license
reasons. We are looking for alternatives so we can once again provide
hyphenation patterns in many languages. Simply copying them from an
older distribution doesn't make them legal. Sorry for the inconvenience,
but this is a serious issue.
On 27.02.2003 17:41:16 Clay Leeds wrote:
0.20.5rc2 does not include the hyphenation pattern en_US. It only 
includes the hyphenation pattern en_GB. Sorry, I don't know where to get 
the other one (you might find it in the SOURCE for 0.20.5rc or 0.20.4, 
but then you'd have to integrate it into the SOURCE for 0.20.5rc2 and 
then BUILD FOP yourself).
Jeremias Maerki
--
Clay Leeds - [EMAIL PROTECTED]
Web Developer - Medata, Inc. - http://www.medata.com
PGP Public Key: https://mail.medata.com/pgp/cleeds.asc
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: hyphenation pattern

2003-02-27 Thread Jeremias Maerki
Most of the hyphenation files had to be removed because of license
reasons. We are looking for alternatives so we can once again provide
hyphenation patterns in many languages. Simply copying them from an
older distribution doesn't make them legal. Sorry for the inconvenience,
but this is a serious issue.

On 27.02.2003 17:41:16 Clay Leeds wrote:
> 0.20.5rc2 does not include the hyphenation pattern en_US. It only 
> includes the hyphenation pattern en_GB. Sorry, I don't know where to get 
> the other one (you might find it in the SOURCE for 0.20.5rc or 0.20.4, 
> but then you'd have to integrate it into the SOURCE for 0.20.5rc2 and 
> then BUILD FOP yourself).


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: hyphenation pattern

2003-02-27 Thread Clay Leeds
Derrick,
0.20.5rc2 does not include the hyphenation pattern en_US. It only 
includes the hyphenation pattern en_GB. Sorry, I don't know where to get 
the other one (you might find it in the SOURCE for 0.20.5rc or 0.20.4, 
but then you'd have to integrate it into the SOURCE for 0.20.5rc2 and 
then BUILD FOP yourself).

Good luck!
--
Clay Leeds - [EMAIL PROTECTED]
Web Developer - Medata, Inc. - http://www.medata.com
PGP Public Key: https://mail.medata.com/pgp/cleeds.asc
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Hyphenation

2002-06-24 Thread Ralf Steppacher
I will post my LineArea.java as soon as I have it cleaned up a bit. :-)
Here it is.
The problem concerning additional space is solved, too.
Ralf
/*
 * $Id: LineArea.java,v 1.53.2.5 2002/02/11 00:43:45 chrisg Exp $
 * Copyright (C) 2001 The Apache Software Foundation. All rights reserved.
 * For details on use and redistribution please refer to the
 * LICENSE file included with these sources.
 */

package org.apache.fop.layout;

// fop
import org.apache.fop.render.Renderer;
import org.apache.fop.messaging.MessageHandler;
import org.apache.fop.layout.inline.*;
import org.apache.fop.datatypes.IDNode;
import org.apache.fop.fo.properties.WrapOption;
import org.apache.fop.fo.properties.WhiteSpaceCollapse;
import org.apache.fop.fo.properties.TextAlign;
import org.apache.fop.fo.properties.TextAlignLast;
import org.apache.fop.fo.properties.LeaderPattern;
import org.apache.fop.fo.properties.Hyphenate;
import org.apache.fop.fo.properties.CountryMaker;
import org.apache.fop.fo.properties.LanguageMaker;
import org.apache.fop.fo.properties.LeaderAlignment;
import org.apache.fop.fo.properties.VerticalAlign;
import org.apache.fop.layout.hyphenation.Hyphenation;
import org.apache.fop.layout.hyphenation.Hyphenator;
import org.apache.fop.configuration.Configuration;

// java
import java.util.Vector;
import java.util.Enumeration;
import java.util.StringTokenizer;
import java.awt.Rectangle;

public class LineArea extends Area {

protected int lineHeight;
protected int halfLeading;
protected int nominalFontSize;
protected int nominalGlyphHeight;

protected int allocationHeight;
protected int startIndent;
protected int endIndent;

private int placementOffset;

private FontState currentFontState;// not the nominal, which is
// in this.fontState
private float red, green, blue;
private int wrapOption;
private int whiteSpaceCollapse;
int vAlign;

/* hyphenation */
HyphenationProps hyphProps;

/*
 * the width of text that has definitely made it into the line
 * area
 */
protected int finalWidth = 0;

/* the position to shift a link rectangle in order to compensate for links 
embedded within a word */
protected int embeddedLinkStart = 0;

/* the width of the current word so far */
// protected int wordWidth = 0;

/* values that prev (below) may take */
protected static final int NOTHING = 0;
protected static final int WHITESPACE = 1;
protected static final int TEXT = 2;
protected static final int MULTIBYTECHAR = 3;

/* the character type of the previous character */
protected int prev = NOTHING;

/* the position in data[] of the start of the current word */
// protected int wordStart;

/* the length (in characters) of the current word */
// protected int wordLength = 0;

/* width of spaces before current word */
protected int spaceWidth = 0;

/*
 * the inline areas that have not yet been added to the line
 * because subsequent characters to come (in a different addText)
 * may be part of the same word
 */
protected Vector pendingAreas = new Vector();

/* the width of the pendingAreas */
protected int pendingWidth = 0;

/* text-decoration of the previous text */
protected boolean prevUlState = false;
protected boolean prevOlState = false;
protected boolean prevLTState = false;

public LineArea(FontState fontState, int lineHeight, int halfLeading,
int allocationWidth, int startIndent, int endIndent,
LineArea prevLineArea) {
super(fontState);

this.currentFontState = fontState;
this.lineHeight = lineHeight;
this.nominalFontSize = fontState.getFontSize();
this.nominalGlyphHeight = fontState.getAscender()
  - fontState.getDescender();

this.placementOffset = fontState.getAscender();
this.contentRectangleWidth = allocationWidth - startIndent
 - endIndent;
this.fontState = fontState;

this.allocationHeight = this.nominalGlyphHeight;
this.halfLeading = this.lineHeight - this.allocationHeight;

this.startIndent = startIndent;
this.endIndent = endIndent;

if (prevLineArea != null) {
Enumeration e = prevLineArea.pendingAreas.elements();
Box b = null;
// There might be InlineSpaces at the beginning
// that should not be there - eat them
boolean eatMoreSpace = true;
int eatenWidth = 0;

while (eatMoreSpace) {
if (e.hasMoreElements()) {
b = (Box)e.nextElement();
if (b instanceof InlineSpace) {
InlineSpace is = (InlineSpace)b;
if (is.isEatable())
eatenWidth += is.getSize();
else
   

Re: Hyphenation

2002-06-21 Thread Ralf Steppacher
Is there a way to enforce TeX's behavior?
Not prepackaged. You are invited to hack around in the files
in layout/hyphenation.
OK, I did:
It is all in org.apache.fop.layout.LineArea.java. I altered the 
hyphenation strategie as follows:
If there is a hyphen in a word and the character data before the hyphen 
will fit into the current line, hyphenation will be done at the existing 
hyphen and nowhere else in the word.
If there is a hyphen but not enough space left do not hyphenate at all 
except for the case when this word is the first in a line (->the 
column/cell is not wide enough to hold the word in one piece).
In the latter case fall back to the standard hyphenation strategie.

There is also a bug (I think) in LineArea:
If the word to be hyphenated is the first in a line an additional space 
is put in front of it.
I tried to solve it by not assigning a 'space' as a spacer but a zero 
width space (\u200B) in this case. But I still end up with a visible 
space in the PDF document.

I will post my LineArea.java as soon as I have it cleaned up a bit. :-)
Ralf


Re: Hyphenation

2002-06-19 Thread J.Pietschmann
Ralf Steppacher wrote:
Is FOP supposed to behave the same as (La)TeX does when evaluating the 
hyphenation patterns?
I don't think so.
Is there a way to enforce TeX's behavior?
Not prepackaged. You are invited to hack around in the files
in layout/hyphenation.
J.Pietschmann