Re: Re: RTF, nested tables, context enhancement - status

2006-05-30 Thread b . ohnsorg

- original Nachricht 

Betreff: Re: RTF, nested tables, context enhancement - status
Gesendet: Fr 26 Mai 2006 16:42:13 CEST
Von: Jeremias Maerki[EMAIL PROTECTED]

 
 On 23.05.2006 08:02:50 b.ohnsorg wrote:
  
  - original Nachricht 
  
  Betreff: Re: RTF, nested tables, context enhancement - status
  Gesendet: Mo 22 Mai 2006 22:30:07 CEST
  Von: Jeremias Maerki[EMAIL PROTECTED]
  
   page-number-citation and page numbering: Both work to at least a
 certain
   degree. Can you elaborate on the problems you're seeing?
  RTF does not know any «citation», so you need to write a character '2',
  if you want a link to display a page information refering to page '2'.
  For example an index at the end of the document does not know, which
  page explains «Barcode» and all links will show a question mark.
 
 That's not correct. RTF has support for page number citations. You
 should take a lot at the RTF specification and create an RTF file in MS
 Word with references so you see how this should be implemented.
If someone has a reliable resource, let me know. I don't own any Micros~1 
component and took OpenOffice-documents. They contain fields like MS-Office, 
but page numbers are hard coded. (tried cross referencing and TOC-mechanism) 

 
  I thought about that last night (while cruising through the
  PDF-renderer to compare my table width «guessing» with the already
 implemented one)
  and draw the following conclusion:
  
  Fact 1: RTF knows page breaks
 
 Wrong. RTF can contain hard page breaks but doesn't have to.
That's what I say: «knows page breaks» (and it's obvious, that I mean 
Insert-Page break-«Manual page break»)

 
  Fact 2: RTF is page-oriented
 
 Wrong. RTF is a flow-oriented format. Microsoft Word is responsible for
 the page break decisions unless you add hard page breaks.
Same intention, other words. Mkay, it's a «stream» of format attributes, but 
it's intended to render to pages, therefore you may insert a page break. If it 
would be a drawing program, intended to render to large sheets of paper to 
stick to walls, it would not include page breaking. Depends on point of view. 
(You may set the page size to any value one may think of, mkay, but that's not 
the point. Another example would be an index or cross reference with the page 
number appended)

 
  Fact 3: RTF aligns elements absolute
 
 Yes.
 
  Fact 4: RTF does not know anything about the document's structuring
 
 Depends on what you mean by structure. RTF supports styles which allow
 some kind of structuring.
Telling a block of text to be Arial, 22pt, underlined and bold is not 
structure, but formatting. You may assume that this is heading level 1 but it 
could also be a capitalized letter or dadaistic poem. What I mean with 
structure is something like: h1May there be light/h2, so it's a heading, 
nothing else (I exclude mistakenly used tags by HTML-beginners).

 
  Conclusion: RTF differs not that much from PDF-rendering (only written
 tokens are a bit different)
  - Use the PDF-layout management system and add a RTF-writer. So RTF
  would look exactly like all PDFs. And there's only a slight difference
  from Office-generated RTFs: page breaks after every page.
 
 That would defeat the purpose of the current RTF output support. The
 clue is that you can edit the generated file and then print it. If you
 predefine the page breaks you make the editing a lot harder for the
 end-user. If you don't need RTF editing, then don't generate RTF because
 PDF offers much better quality.

And that's what I wanted to read: There's no need for all that TOC-ing, page 
number citation and whatsoever. Therefore also indenting makes no sense, esp. 
when nesting blocks, tables and lists. Mkay, there should be a small amount of 
space between a list item and it's bullet, but for editing you don't need any 
formatting attributes - what makes it a lot easier.

 
   
   What's exactly the problem with graphics? We've got pretty good support
   for many cases, even SVG now.
  I was only talking about the current RTF export. AFAIK GIFs are not
 rendered (there was an annoying exception string inside the document).
  
   GIF: The LZW patents are expired but still it's a good idea to forget
   GIF. The legal side is still less than clear.
  But PDF «understands» GIFs, so I can use them. Should be the same with
  RTF, and the only thing I'll do is reading them and transform into a
  RTF-compatible format...if it's not legal I won't do this...
 
 PDF doesn't understand GIFs. Images are converted from GIF to a generic
 bitmap format supported by PDF. In the case of RTF output this code
 hasn't been written, yet. Patches welcome.
That's what I wanted to hear, too.

 
   RTF does support referenced images.
  So there ought to be a switch (render to the document, use references)
 
 ...if someone implements that switch.

me got list - ugh!

 
   Over all, your post seems to address topics which, as far as I know,
 are
   mostly solved, so I'm

Re: RTF, nested tables, context enhancement - status

2006-05-30 Thread Jeremias Maerki
It would be good if we could migrate this thread to fop-dev. Please
subscribe there if you're not already. Thanks.

On 30.05.2006 08:13:50 b.ohnsorg wrote:
 
 - original Nachricht 
 
 Betreff: Re: RTF, nested tables, context enhancement - status
 Gesendet: Fr 26 Mai 2006 16:42:13 CEST
 Von: Jeremias Maerki[EMAIL PROTECTED]
 
  
  On 23.05.2006 08:02:50 b.ohnsorg wrote:
   
   - original Nachricht 
   
   Betreff: Re: RTF, nested tables, context enhancement - status
   Gesendet: Mo 22 Mai 2006 22:30:07 CEST
   Von: Jeremias Maerki[EMAIL PROTECTED]
   
page-number-citation and page numbering: Both work to at least a
  certain
degree. Can you elaborate on the problems you're seeing?
   RTF does not know any «citation», so you need to write a character '2',
   if you want a link to display a page information refering to page '2'.
   For example an index at the end of the document does not know, which
   page explains «Barcode» and all links will show a question mark.
  
  That's not correct. RTF has support for page number citations. You
  should take a lot at the RTF specification and create an RTF file in MS
  Word with references so you see how this should be implemented.
 If someone has a reliable resource, let me know. I don't own any
 Micros~1 component and took OpenOffice-documents. They contain fields
 like MS-Office, but page numbers are hard coded. (tried cross
 referencing and TOC-mechanism) 

You can get it from M$:
RTF 1.7 (MS Word 2002):
http://www.microsoft.com/downloads/details.aspx?FamilyID=e5b8ebc2-6ad6-49f0-8c90-e4f763e3f04fDisplayLang=en
(that's what I currently use)

RTF 1.8 (MS Word 2003):
http://www.microsoft.com/downloads/details.aspx?familyid=AC57DE32-17F0-4B46-9E4E-467EF9BC5540displaylang=en

Don't forget to have an MS Word (or Word Viewer) installed somewhere.
The specification alone doesn't really help. OpenOffice is useless for
this.

  
   I thought about that last night (while cruising through the
   PDF-renderer to compare my table width «guessing» with the already
  implemented one)
   and draw the following conclusion:
   
   Fact 1: RTF knows page breaks
  
  Wrong. RTF can contain hard page breaks but doesn't have to.
 That's what I say: «knows page breaks» (and it's obvious, that I mean 
 Insert-Page break-«Manual page break»)

So, we're talking a completely different language. :-( Someone want to
write a FOP glossary? ;-)

  
   Fact 2: RTF is page-oriented
  
  Wrong. RTF is a flow-oriented format. Microsoft Word is responsible for
  the page break decisions unless you add hard page breaks.
 Same intention, other words. Mkay, it's a «stream» of format attributes,
 but it's intended to render to pages, therefore you may insert a page
 break. If it would be a drawing program, intended to render to large
 sheets of paper to stick to walls, it would not include page breaking.
 Depends on point of view. (You may set the page size to any value one
 may think of, mkay, but that's not the point. Another example would be
 an index or cross reference with the page number appended)
 
  
   Fact 3: RTF aligns elements absolute
  
  Yes.
  
   Fact 4: RTF does not know anything about the document's structuring
  
  Depends on what you mean by structure. RTF supports styles which allow
  some kind of structuring.
 Telling a block of text to be Arial, 22pt, underlined and bold is not
 structure, but formatting. You may assume that this is heading level 1
 but it could also be a capitalized letter or dadaistic poem. What I
 mean with structure is something like: h1May there be light/h2, so it's a
 heading, nothing else (I exclude mistakenly used tags by HTML-beginners).

Again the language problem maybe. My styles refers not only to the
group of formatting attributes but also to the attributes indicating
whether the heading level. However, it seems this is not written down in
the RTF specification. If you look at a Word-generated RTF file you'll
see the private attribute soutlvl, for example. While this may not be
structure by itself but at least an indication of structure. But the
argument probably makes no sense as XSL-FO doesn't support structure,
either. You can't distinguish a header from a normal paragraph.

  
   Conclusion: RTF differs not that much from PDF-rendering (only written
  tokens are a bit different)
   - Use the PDF-layout management system and add a RTF-writer. So RTF
   would look exactly like all PDFs. And there's only a slight difference
   from Office-generated RTFs: page breaks after every page.
  
  That would defeat the purpose of the current RTF output support. The
  clue is that you can edit the generated file and then print it. If you
  predefine the page breaks you make the editing a lot harder for the
  end-user. If you don't need RTF editing, then don't generate RTF because
  PDF offers much better quality.
 
 And that's what I wanted to read: There's no need for all that TOC-ing,
 page number citation and whatsoever

Re: RTF, nested tables, context enhancement - status

2006-05-26 Thread Jeremias Maerki

On 23.05.2006 08:02:50 b.ohnsorg wrote:
 
 - original Nachricht 
 
 Betreff: Re: RTF, nested tables, context enhancement - status
 Gesendet: Mo 22 Mai 2006 22:30:07 CEST
 Von: Jeremias Maerki[EMAIL PROTECTED]
 
  page-number-citation and page numbering: Both work to at least a certain
  degree. Can you elaborate on the problems you're seeing?
 RTF does not know any «citation», so you need to write a character '2',
 if you want a link to display a page information refering to page '2'.
 For example an index at the end of the document does not know, which
 page explains «Barcode» and all links will show a question mark.

That's not correct. RTF has support for page number citations. You
should take a lot at the RTF specification and create an RTF file in MS
Word with references so you see how this should be implemented.

 I thought about that last night (while cruising through the
 PDF-renderer to compare my table width «guessing» with the already 
 implemented one)
 and draw the following conclusion:
 
 Fact 1: RTF knows page breaks

Wrong. RTF can contain hard page breaks but doesn't have to.

 Fact 2: RTF is page-oriented

Wrong. RTF is a flow-oriented format. Microsoft Word is responsible for
the page break decisions unless you add hard page breaks.

 Fact 3: RTF aligns elements absolute

Yes.

 Fact 4: RTF does not know anything about the document's structuring

Depends on what you mean by structure. RTF supports styles which allow
some kind of structuring.

 Conclusion: RTF differs not that much from PDF-rendering (only written tokens 
 are a bit different)
 - Use the PDF-layout management system and add a RTF-writer. So RTF
 would look exactly like all PDFs. And there's only a slight difference
 from Office-generated RTFs: page breaks after every page.

That would defeat the purpose of the current RTF output support. The
clue is that you can edit the generated file and then print it. If you
predefine the page breaks you make the editing a lot harder for the
end-user. If you don't need RTF editing, then don't generate RTF because
PDF offers much better quality.

  
  What's exactly the problem with graphics? We've got pretty good support
  for many cases, even SVG now.
 I was only talking about the current RTF export. AFAIK GIFs are not rendered 
 (there was an annoying exception string inside the document).
 
  GIF: The LZW patents are expired but still it's a good idea to forget
  GIF. The legal side is still less than clear.
 But PDF «understands» GIFs, so I can use them. Should be the same with
 RTF, and the only thing I'll do is reading them and transform into a
 RTF-compatible format...if it's not legal I won't do this...

PDF doesn't understand GIFs. Images are converted from GIF to a generic
bitmap format supported by PDF. In the case of RTF output this code
hasn't been written, yet. Patches welcome.

  RTF does support referenced images.
 So there ought to be a switch (render to the document, use references)

...if someone implements that switch.

  Over all, your post seems to address topics which, as far as I know, are
  mostly solved, so I'm not really sure what you're after. Maybe if you'd
  show the problems you're trying to address with examples
 I only wanted to check any progress, maybe I forgot some important fact
 or any discussion. But the referencing of page-number-citation is one
 problem (we already discussed this in February), nesting tables,
 indenting...everything which needs structures inside structures (a
 block inside a block, a table inside a block, a block inside a table...)

You can see that there is still some work ahead. Some features will
probably never be implemented because RTF is such a crappy format. I
doubt we will ever get 100% functionality on nested tables.

  Anyway, we're looking forward to any patches against FOP SVN Trunk you
  might have. At any rate, you'd have problems committing anything to the
  repository without write access. That privilege will need to be earned
  first. ;-)
 I know, therefore the review-strategy and «restructuring» the
 RTF-renderer will break it for a while (compiles, but does not produce
 any usable document).

Before you take the RTF handler (not renderer!) apart you should
understand RTF first. It seems you have a few things to catch up. As I
said above, converting the RTFHandler into a Renderer defeats the
purpose of the RTF format in the first place.

 By now it's just transforming FOP-events to  RTF-tags and I ran into
 problems which were solved with the new  FOP-code (more effective
 memory usage) but seemed to be necessary (e.g. a table's
 width can only be determined if all columns were parsed, therefore you
 need to save previous column objects and by nesting tables it gets
 quite heavy [tried to experiment a bit with auto-table-layout, so you need all
 the rows, too])

Yes, the layout engine (for page-oriented documents) contains code that
more or less needs to be duplicated for RTF. Maybe some

Re: Re: RTF, nested tables, context enhancement - status

2006-05-23 Thread b . ohnsorg

- original Nachricht 

Betreff: Re: RTF, nested tables, context enhancement - status
Gesendet: Mo 22 Mai 2006 22:30:07 CEST
Von: Jeremias Maerki[EMAIL PROTECTED]

 page-number-citation and page numbering: Both work to at least a certain
 degree. Can you elaborate on the problems you're seeing?
RTF does not know any «citation», so you need to write a character '2', if you 
want a link to display a page information refering to page '2'. For example an 
index at the end of the document does not know, which page explains «Barcode» 
and all links will show a question mark.

I thought about that last night (while cruising through the PDF-renderer to 
compare my table width «guessing» with the already implemented one) and draw 
the following conclusion:

Fact 1: RTF knows page breaks
Fact 2: RTF is page-oriented
Fact 3: RTF aligns elements absolute
Fact 4: RTF does not know anything about the document's structuring

Conclusion: RTF differs not that much from PDF-rendering (only written tokens 
are a bit different)
- Use the PDF-layout management system and add a RTF-writer. So RTF would look 
exactly like all PDFs. And there's only a slight difference from 
Office-generated RTFs: page breaks after every page.

 
 What's exactly the problem with graphics? We've got pretty good support
 for many cases, even SVG now.
I was only talking about the current RTF export. AFAIK GIFs are not rendered 
(there was an annoying exception string inside the document).

 GIF: The LZW patents are expired but still it's a good idea to forget
 GIF. The legal side is still less than clear.
But PDF «understands» GIFs, so I can use them. Should be the same with RTF, and 
the only thing I'll do is reading them and transform into a RTF-compatible 
format...if it's not legal I won't do this...

 RTF does support referenced images.
So there ought to be a switch (render to the document, use references)

 Over all, your post seems to address topics which, as far as I know, are
 mostly solved, so I'm not really sure what you're after. Maybe if you'd
 show the problems you're trying to address with examples
I only wanted to check any progress, maybe I forgot some important fact or any 
discussion. But the referencing of page-number-citation is one problem (we 
already discussed this in February), nesting tables, indenting...everything 
which needs structures inside structures (a block inside a block, a table 
inside a block, a block inside a table...)

 Anyway, we're looking forward to any patches against FOP SVN Trunk you
 might have. At any rate, you'd have problems committing anything to the
 repository without write access. That privilege will need to be earned
 first. ;-)
I know, therefore the review-strategy and «restructuring» the RTF-renderer will 
break it for a while (compiles, but does not produce any usable document). By 
now it's just transforming FOP-events to RTF-tags and I ran into problems which 
were solved with the new FOP-code (more effective memory usage) but seemed to 
be necessary (e.g. a table's width can only be determined if all columns were 
parsed, therefore you need to save previous column objects and by nesting 
tables it gets quite heavy [tried to experiment a bit with auto-table-layout, 
so you need all the rows, too])





--
Verlieben Sie sich mit Deutschlands größter Partneragentur
für langfristige Beziehungen. Machen Sie den ersten Schritt.
www.freenet.parship.de/?source=webmailfooter


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: RTF, nested tables, context enhancement - status

2006-05-22 Thread Jeremias Maerki
You're right: RTF is probably not the most important part of FOP for
everyone, but for some people it is. Everyone has a chance to help
improve the parts he/she is most interested in. That's the cool thing
about open source.

I'm a little confused about your post. Let's look at the details:

page-number-citation and page numbering: Both work to at least a certain
degree. Can you elaborate on the problems you're seeing?

What's exactly the problem with graphics? We've got pretty good support
for many cases, even SVG now.

GIF: The LZW patents are expired but still it's a good idea to forget
GIF. The legal side is still less than clear.

RTF does support referenced images.

Over all, your post seems to address topics which, as far as I know, are
mostly solved, so I'm not really sure what you're after. Maybe if you'd
show the problems you're trying to address with examples

Anyway, we're looking forward to any patches against FOP SVN Trunk you
might have. At any rate, you'd have problems committing anything to the
repository without write access. That privilege will need to be earned
first. ;-)

On 22.05.2006 16:56:11 b.ohnsorg wrote:
 Hi there,
 
 I was quite busy with other tasks and followed the mailing list only if
 there were RTF-questions. I'm still working on that RTF-thingy, to get
 it working again (local *g*) and there'll be some major changes and
 serious limitations. Page numbering, linking and tables for layout won't
 work for fancy formatting. I assume, that RTF is not the most important
 part of FOP and only for post processing with Micros~1-tools.
 
 If there are other opinions, let me know, before planning major changes.
 Basically RTF is a bastard between outlining and streaming format,
 that's why some neet features cannot be done (page-number-citation,
 pages are rendered in the viewer, not in the document) and I'll remove
 that code (additional warning-option can be switched on). Nested tables
 (and graphics) are definitely hard work. All graphics will be embedded
 bitmaps (it should be possible to convert GIFs, any copyright issues? I
 won't write GIF images but convert them like PNG-files) and maybe
 there's an refer-option, I'll have to find out, if RTF supports
 referenced images (another additional parameter).
 
 Nested tables will be a quite dirty hack. RTF does not support nested
 tables and there must be a transformation from nested tables to a table
 with additional columns. Another interesting fact is RTF's limitation to
 fixed widths, no proportional widths are allowed. Here I'll have to
 insert some code (maybe some remember the nested PercentageContexts)
 which basically does the same as PDF's LayoutManager (talked about more
 abstract PercentageContext being able to have a parent to be able to
 calculate remaing width, indenting and margins up to this context). I
 also found some comments like «maybe configurable?» with some
 Office-version-dependant corrections. I'll add some more configuration
 parameters for RTF rendering.
 
 I dunno when this will be finished and in status «testing», but end of
 June is quite realistic. So there's no necessity to branch anything (and
 merge again). And I won't commit it to the repository but forward it as
 tar.gz to others who want to test it (kinda review).


Jeremias Maerki


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]