Re: RE: Does FOP support non-breaking hyphens?

2009-04-30 Thread kkapelon

 
 May be post a fo snippet showing what you give to fop and then describe your
 expected and the actual output.
 

Thank you for your answer!

I am using FOP 0.95 (latest stable at the time of writing).

I am trying the following fo snippet.

fo:block
This is the example with 2011 (as char) non 
breaking hyphen: 
This#8209;is#8209;a#8209;line#8209;with#8209;8209#8209;characters 
/fo:block
fo:block
This is the example with 2011 (as hex) non 
breaking hyphen: 
This#x2011;is#x2011;a#x2011;line#x2011;with#x2011;8209#x2011;characters 
/fo:block
fo:block
This is the example with 2011 (as number) non 
breaking hyphen: 
This#8209;is#8209;a#8209;line#8209;with#8209;8209#8209;characters 
/fo:block
fo:block This is the example ZZ with an 
HTML shy hyphen: 
Bigand­long­word­that­should­break­in­a­logical­place­and­not­go­to­the­next­line
 /fo:block
fo:block This is the example Z with non 
breaking space : 
Big#160;and#160;long#160;word#160;that#160;should#160;break#160;in#160;a#160;logical#160;place#160;and#160;not#160;go#160;to#160;the#160;next#160;line
 /fo:block

Comments:

- The hyphens in the first block are hex 2011 chars inserted directly into the 
FO file from the VIM editor
- The hyphens in second block are entities with number 8209
- The hyphens in the third block are entities with hex 2011
- There are hyphens in 4th block with hex AD inserted directly by VIM
- The non-breaking-space in the 5th block are entities with number 160

I run this snippet with FOP using
fop -fo hyphens.xml hyphens.pdf

Expected output

First Block: This-is-a-line-with-8209-characters sentence appearing with 
hyphens but it should not break at any point.
Second block: The same as first
Third block: The same as first
Fourth block: The sentence Bigandlong... with no visible hyphens. Only one 
visible hyphen when it breaks
Fifth block: The sentence Big and long... with spaces but no break at all

Actual output
First block:The sentence appearing with # characters instead of hyphens
Second block: Same as first
Third block same as first
Fourth block: As expected (so this works OK)
Fifth block: As expeected (so this works OK)

Thus the problem is that I get the # (sharp symbol) instead of a 
non-breaking-hyphen in the PDF output.

I hope this is more clear.








-
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org



Using keep-together.within-column and break-after results in missing text

2009-04-30 Thread kkapelon

Hello 

I am using FOP 0.95 (latest stable at the time of writing) and trying the 
following FO file.

?xml version=1.0 encoding=utf-8?
fo:root xmlns:fo=http://www.w3.org/1999/XSL/Format;

fo:layout-master-set
fo:simple-page-master page-width=210mm page-height=297mm 
font-family=LiberationSerif master-name=pim-page
fo:region-body margin-left=2.5cm margin-bottom=2cm 
margin-right=2.5cm margin-top=2cm/
/fo:simple-page-master
/fo:layout-master-set

fo:page-sequence master-reference=pim-page
fo:flow flow-name=xsl-region-body
fo:table border=1px
fo:table-body
fo:table-row 
keep-together.within-column=always
fo:table-cell
fo:block

fo:block 
break-after=page  
Text on 
the first page. A page break should follow.
/fo:block
fo:blockText 
on the next page after the break/fo:block

/fo:block
/fo:table-cell
/fo:table-row
/fo:table-body
/fo:table
/fo:flow
/fo:page-sequence
/fo:root

I then convert it to PDF with

fop -fo minimal.xml minimal.PDF

Expected output

Either (option 1):

A PDF file with two pages with Text on the first .. shown in the first page 
and Text on the next... shown in the second (this means that break-after 
prevails)

Or (option 2):
A PDF file with only one page that contains both of the above sentences (the 
keep-together prevails)

Actual Output

A PDF file with two pages. Only the Text on the first.. appears on the 
document. The rest of Text on the next... is completely missing (not shown at 
all).

What am I missing?
Is this an issue or my mistake?

Thank you.




-
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org



RE: RE: Does FOP support non-breaking hyphens?

2009-04-30 Thread Manuel Mall
Thank you - yes it does make it more clear.

The # is an indication that the font your are using has no matching glyph
for the non breaking hyphen.

From your initial post I incorrectly assumed that the problem was incorrect
line breaking behaviour of the non breaking hyphen. Obviously that is not
the case.

For the time being the only solution I can offer is to find a font that
contains a glyph for that code point. FOP is not doing anything wrong as
such. Although an argument could be made that FOP should substitute the
glyph for a normal hyphen when there is no glyph for the non breaking hyphen
in the selected font. However, 'intelligent' glyph substitution is a more
generic problem which I believe FOP has not tried to tackle so far.

Manuel

-Original Message-
From: kkape...@freemail.gr [mailto:kkape...@freemail.gr] 
Sent: Thursday, 30 April 2009 4:23 PM
To: fop-users@xmlgraphics.apache.org
Subject: Re: RE: Does FOP support non-breaking hyphens?


 
 May be post a fo snippet showing what you give to fop and then describe
your
 expected and the actual output.
 

Thank you for your answer!

I am using FOP 0.95 (latest stable at the time of writing).

I am trying the following fo snippet.

fo:block
This is the example with 2011 (as char) non
breaking hyphen:
This#8209;is#8209;a#8209;line#8209;with#8209;8209#8209;characters 
/fo:block
fo:block
This is the example with 2011 (as hex) non
breaking hyphen:
This#x2011;is#x2011;a#x2011;line#x2011;with#x2011;8209#x2011;character
s 
/fo:block
fo:block
This is the example with 2011 (as number)
non breaking hyphen:
This#8209;is#8209;a#8209;line#8209;with#8209;8209#8209;characters 
/fo:block
fo:block This is the example ZZ with
an HTML shy hyphen:
Bigand-long-word-that-should-break-in-a-logical-place-and-not-go-to-the-next
-line /fo:block
fo:block This is the example Z with non
breaking space :
Big#160;and#160;long#160;word#160;that#160;should#160;break#160;in#1
60;a#160;logical#160;place#160;and#160;not#160;go#160;to#160;the#160
;next#160;line /fo:block

Comments:

- The hyphens in the first block are hex 2011 chars inserted directly into
the FO file from the VIM editor
- The hyphens in second block are entities with number 8209
- The hyphens in the third block are entities with hex 2011
- There are hyphens in 4th block with hex AD inserted directly by VIM
- The non-breaking-space in the 5th block are entities with number 160

I run this snippet with FOP using
fop -fo hyphens.xml hyphens.pdf

Expected output

First Block: This-is-a-line-with-8209-characters sentence appearing with
hyphens but it should not break at any point.
Second block: The same as first
Third block: The same as first
Fourth block: The sentence Bigandlong... with no visible hyphens. Only one
visible hyphen when it breaks
Fifth block: The sentence Big and long... with spaces but no break at all

Actual output
First block:The sentence appearing with # characters instead of hyphens
Second block: Same as first
Third block same as first
Fourth block: As expected (so this works OK)
Fifth block: As expeected (so this works OK)

Thus the problem is that I get the # (sharp symbol) instead of a
non-breaking-hyphen in the PDF output.

I hope this is more clear.








-
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org

-
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org



Re: Using keep-together.within-column and break-after results in missing text

2009-04-30 Thread Vincent Hennebert
Hi,

kkapelon freemail gr wrote:
 Hello 
 
 I am using FOP 0.95 (latest stable at the time of writing) and trying the 
 following FO file.
 
snip/
 
 I then convert it to PDF with
 
 fop -fo minimal.xml minimal.PDF
 
 Expected output
 
 Either (option 1):
 
 A PDF file with two pages with Text on the first .. shown in the first page 
 and Text on the next... shown in the second (this means that break-after 
 prevails)

This is the expected output. Breaks must always be honoured, even if
there are surrounding keeps.


 Or (option 2):
 A PDF file with only one page that contains both of the above sentences (the 
 keep-together prevails)
 
 Actual Output
 
 A PDF file with two pages. Only the Text on the first.. appears on the 
 document. The rest of Text on the next... is completely missing (not shown 
 at all).
 
 What am I missing?
 Is this an issue or my mistake?

This is a bug. Can you please file a bug report on Bugzilla, attaching
your simple example?
https://issues.apache.org/bugzilla/enter_bug.cgi?product=Fop


Thanks,
Vincent

-
To unsubscribe, e-mail: fop-users-unsubscr...@xmlgraphics.apache.org
For additional commands, e-mail: fop-users-h...@xmlgraphics.apache.org



Incorrect display of accented characters in plain font

2009-04-30 Thread Griffin,Sean
Fellow FOP users,
I am looking for some help in an issue I am investigating for one of my clients 
that actually doesn't appear to have much to do with FOP at all.  However, 
since there's a good chance that someone on this list knows more than I do 
about how fonts map to glyphs in PostScript, I figure this might be a good pool 
to ask.

The issue is that a PDF generated from FOP with accented characters stored in 
the FO documented as Windows-1252 encoding, in either Helvetica or Times fonts, 
will display just fine when viewed in a PDF viewer, and they'll print just fine 
when printed from the PDF viewer.  However, when that PDF is ran through a 
PostScript interpreter (Preview SDK from Liberty Systems), the accented 
characters are mangled.  For example, é converts to D.  What's strange is that 
this only happens if it's plain font -- not bold, italic, or bold-italic.

Now I understand that plain, bold, italic, and bold-italic are 4 different 
fonts, and I've looked through the PostScript specification to see how the 
interpreter is supposed to map the font to the glyph, but I still can't figure 
out what's going on.  And finally, when trying this same code execution here in 
house, in the US (the client is in France), I cannot reproduce the problem.

Does anyone know if PostScript interpreters look anywhere on the PC itself 
(this is Windows) to find the font glyph information, or does anyone know why 
the decoding of characters would be different for one font than the other, even 
within the same font family?

Thanks ahead of time for any clues you can offer.

Sean Griffin | MSVC Architect | Cerner Corporation | 816.201.1599 | 
sgrif...@cerner.com | www.cerner.com http://www.cerner.com/ 


--
CONFIDENTIALITY NOTICE This message and any included attachments are from 
Cerner Corporation and are intended only for the addressee. The information 
contained in this message is confidential and may constitute inside or 
non-public information under international, federal, or state securities laws. 
Unauthorized forwarding, printing, copying, distribution, or use of such 
information is strictly prohibited and may be unlawful. If you are not the 
addressee, please promptly delete this message and notify the sender of the 
delivery error by e-mail or you may call Cerner's corporate offices in Kansas 
City, Missouri, U.S.A at (+1) (816)221-1024.


RE: Incorrect display of accented characters in plain font

2009-04-30 Thread Griffin,Sean
I need to clarify something I have below.  It should be this:

 

when that PDF is ran through a PostScript interpreter (Preview SDK from Liberty 
Systems) and sent to the printer, the accented characters are mangled on the 
printed output.




 

From: Griffin,Sean [mailto:sgrif...@cerner.com] 
Sent: Thursday, April 30, 2009 9:04 AM
To: fop-users@xmlgraphics.apache.org
Subject: Incorrect display of accented characters in plain font

 

Fellow FOP users,

I am looking for some help in an issue I am investigating for one of my clients 
that actually doesn't appear to have much to do with FOP at all.  However, 
since there's a good chance that someone on this list knows more than I do 
about how fonts map to glyphs in PostScript, I figure this might be a good pool 
to ask.

The issue is that a PDF generated from FOP with accented characters stored in 
the FO documented as Windows-1252 encoding, in either Helvetica or Times fonts, 
will display just fine when viewed in a PDF viewer, and they'll print just fine 
when printed from the PDF viewer.  However, when that PDF is ran through a 
PostScript interpreter (Preview SDK from Liberty Systems), the accented 
characters are mangled.  For example, é converts to D.  What's strange is that 
this only happens if it's plain font -- not bold, italic, or bold-italic.

Now I understand that plain, bold, italic, and bold-italic are 4 different 
fonts, and I've looked through the PostScript specification to see how the 
interpreter is supposed to map the font to the glyph, but I still can't figure 
out what's going on.  And finally, when trying this same code execution here in 
house, in the US (the client is in France), I cannot reproduce the problem.

Does anyone know if PostScript interpreters look anywhere on the PC itself 
(this is Windows) to find the font glyph information, or does anyone know why 
the decoding of characters would be different for one font than the other, even 
within the same font family?

Thanks ahead of time for any clues you can offer.

Sean Griffin | MSVC Architect | Cerner Corporation | 816.201.1599 | 
sgrif...@cerner.com mailto:sgrif...@cerner.com  | www.cerner.com 
http://www.cerner.com/ 



CONFIDENTIALITY NOTICE This message and any included attachments are from 
Cerner Corporation and are intended only for the addressee. The information 
contained in this message is confidential and may constitute inside or 
non-public information under international, federal, or state securities laws. 
Unauthorized forwarding, printing, copying, distribution, or use of such 
information is strictly prohibited and may be unlawful. If you are not the 
addressee, please promptly delete this message and notify the sender of the 
delivery error by e-mail or you may call Cerner's corporate offices in Kansas 
City, Missouri, U.S.A at (+1) (816)221-1024.