RE: DITA-OT and FOP

2014-04-01 Thread Jan Tosovsky
On 2014-03-30 Ron Wheeler wrote:
 On 30/03/2014 5:19 PM, Jan Tosovsky wrote:
  
  I personally think it would be much easier to attract developers 
  to create a completely new paged media CSS engine than to add 
  several niche XSL-FO features into FOP. But when mentioning the 
  new engine, I don't think it is a good idea. Sooner or later 
  these CSS features will be adopted by major browsers and in that 
  time it won't be necessary to produce PDFs at all :-)

 The supposes that paper documentation will disappear. There are
 regulatory issues, industry practices, etc. that need to change.
 There will still be face to face meetings where someone wants to hand a
 piece of paper to someone for the next few years.

Slightly off-topic, but a short comment...

I didn't mean paper-less way, but http://en.wikipedia.org/wiki/MHTML way, where 
all dependencies are embedded in a single file which you can open in a browser 
(it acts as PDF, but it is HTML/CSS based).

Jan



Re: DITA-OT and FOP

2014-04-01 Thread Ron Wheeler

On 01/04/2014 1:33 PM, Jan Tosovsky wrote:

On 2014-03-30 Ron Wheeler wrote:

On 30/03/2014 5:19 PM, Jan Tosovsky wrote:

I personally think it would be much easier to attract developers
to create a completely new paged media CSS engine than to add
several niche XSL-FO features into FOP. But when mentioning the
new engine, I don't think it is a good idea. Sooner or later
these CSS features will be adopted by major browsers and in that
time it won't be necessary to produce PDFs at all :-)

The supposes that paper documentation will disappear. There are
regulatory issues, industry practices, etc. that need to change.
There will still be face to face meetings where someone wants to hand a
piece of paper to someone for the next few years.

Slightly off-topic, but a short comment...

I didn't mean paper-less way, but http://en.wikipedia.org/wiki/MHTML way, where 
all dependencies are embedded in a single file which you can open in a browser 
(it acts as PDF, but it is HTML/CSS based).

Jan


Good point. PDF is not the only form that moves nicely to paper. Issues 
like signatures and version control are still open issues for me.


DITA is already ahead in this area with a single set of source documents 
being able to be processed into PDF, HTML and other output formats.


My feeling is that bringing this standard XML source document format 
with its editing tools into XMLGraphics will accelerate the development 
of better processes for producing these new formats and put the 
XMLGraphics tools into the mainstream of document production in for 
organizations that want a process that is supported  by standards and a 
strong technical open source organization such as Apache.


Ron

--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102



RE: DITA-OT and FOP

2014-03-30 Thread Jan Tosovsky
On 2014-03-26 Ron Wheeler wrote:
 On 26/03/2014 2:59 PM, Jan Tosovsky wrote:
  On 2014-03-26 Christopher R. Maden wrote:
  Although I don’t get a vote, I completely agree with Glenn that DITA
  integration into FOP is completely inappropriate.
  +1
 
  FOP consumes standardized XSL-FO.
 
  DITA-OT should produce standardized XSL-FO. Period ;-)

 It does but FOP does not support all PDF features so some of the things
 that people can express in DITA's XML can not produce the PDF output
 that people want/need.
 

There are two possible interpretations:
(1) user's intent cannot be achieved by XSL-FO
(2) user's intent can be achieved by XSL-FO, but the given functionality is not 
supported by FOP

Anyway, it would be nice to collect these requests (wish list) and create a 
poll targetting DITA community to get real demand for them. If there is 
something FOP related, FOP devs could roughly estimate mandays (and thereof 
costs). The final part is to find sponsors or involve crowdfunding.

But keep in mind that XSL-FO is loosing its attraction. Its standardization has 
been discontinued and no one can expect new features in the near future. I 
strongly suggest joining already mentioned W3C PPL group 
http://www.w3.org/community/ppl/ which is trying to move these things forward.

Standardization has moved primarily into the CSS. There are many proposals 
trying to enhance this standard with paged media related stuff. It is clear 
that this is considered as the future. Check also this article:
http://alistapart.com/article/building-books-with-css3

I personally think it would be much easier to attract developers to create a 
completely new paged media CSS engine than to add several niche XSL-FO features 
into FOP. But when mentioning the new engine, I don't think it is a good idea. 
Sooner or later these CSS features will be adopted by major browsers and in 
that time it won't be necessary to produce PDFs at all :-) 

PDF engines (XSL-FO, CSS) will survive only if they offer any added value. I 
can think of an export into other formats (PS, AFP), a sophisticated index 
rendering (XSL-FO 1.1) or microtypography features 
http://en.wikipedia.org/wiki/Microtypography (not covered by standards yet). 
But AFAIK the latter would require major FOP redesign...

Jan




Re: DITA-OT and FOP

2014-03-30 Thread Ron Wheeler
These are great ideas and I think that getting the DITA community which 
is skilled at document management and document authoring connected with 
the people here who are looking at these issues from a technical point 
of view could energize the modernization of document production.
DITA provides an authoring language that is independent of the output 
format and media. The language is XML so the processing technology has 
to start with XML but could have the output in any form (currently HTML 
and PDF but there is no theoretical restrictions).


More comments in-line.

On 30/03/2014 5:19 PM, Jan Tosovsky wrote:

On 2014-03-26 Ron Wheeler wrote:

On 26/03/2014 2:59 PM, Jan Tosovsky wrote:

On 2014-03-26 Christopher R. Maden wrote:

Although I don’t get a vote, I completely agree with Glenn that DITA
integration into FOP is completely inappropriate.

+1

FOP consumes standardized XSL-FO.

DITA-OT should produce standardized XSL-FO. Period ;-)

It does but FOP does not support all PDF features so some of the things
that people can express in DITA's XML can not produce the PDF output
that people want/need.


There are two possible interpretations:
(1) user's intent cannot be achieved by XSL-FO
(2) user's intent can be achieved by XSL-FO, but the given functionality is not 
supported by FOP

Another possible reason
(3) the DITA-OT developers have not been able how to build the XSLT to 
get the required XSL-FO.


I think that it will take a collaboration between the DITA-OT developers 
and the developers of FOP to determine which features are blocked by 
each of these reasons.



Anyway, it would be nice to collect these requests (wish list) and create a 
poll targetting DITA community to get real demand for them. If there is 
something FOP related, FOP devs could roughly estimate mandays (and thereof 
costs). The final part is to find sponsors or involve crowdfunding.

But keep in mind that XSL-FO is loosing its attraction. Its standardization has 
been discontinued and no one can expect new features in the near future. I 
strongly suggest joining already mentioned W3C PPL group 
http://www.w3.org/community/ppl/ which is trying to move these things forward.
I can not imagine that PDF will disappear in the intermediate term but 
mobile is likely to shift the emphasis on DITA-HTML and CSS.

Standardization has moved primarily into the CSS. There are many proposals 
trying to enhance this standard with paged media related stuff. It is clear 
that this is considered as the future. Check also this article:
http://alistapart.com/article/building-books-with-css3




I personally think it would be much easier to attract developers to create a 
completely new paged media CSS engine than to add several niche XSL-FO features 
into FOP. But when mentioning the new engine, I don't think it is a good idea. 
Sooner or later these CSS features will be adopted by major browsers and in 
that time it won't be necessary to produce PDFs at all :-)
The supposes that paper documentation will disappear. There are 
regulatory issues, industry practices, etc. that need to change.
There will still be face to face meetings where someone wants to hand a 
piece of paper to someone for the next few years.

PDF engines (XSL-FO, CSS) will survive only if they offer any added value. I 
can think of an export into other formats (PS, AFP), a sophisticated index 
rendering (XSL-FO 1.1) or microtypography features 
http://en.wikipedia.org/wiki/Microtypography (not covered by standards yet). 
But AFAIK the latter would require major FOP redesign...
Collaboration with the largest authoring community might help keep the 
XMLGraphics group at the forefront of these ideas.

Jan






--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102



Re: DITA-OT and FOP

2014-03-28 Thread Ron Wheeler

On 27/03/2014 6:26 PM, Vincent Hennebert wrote:

(CC-ing to general@ as it’s the place where I believe the conversation
should take place.)

I too would be in favour of welcoming DITA-OT as a sub-project of XML
Graphics. I think it matches rather well the purpose of this project.

A few comments:

On 27/03/14 18:32, Ron Wheeler wrote:

I think that you are right about being a sister project to FOP within
XMLGraphics.

The discussion from everyone has been very helpful even if the writer 
was

negative to the idea.

//I would be happy to see some interaction in a number of areas.
a) Clearly, the DITA group should be pushing for the FOP features 
that are

required to produce the documents that can be described in DITA.


Pushing for having some features implemented is one thing; actually
implementing them is another. Correct me if I’m wrong, but I don’t
believe the skill sets of a DITA developer and a FOP developer
intersect, do they?
The DITA-OT uses XSLT to transform DITA xml to other forms so there is a 
certain amount of XSLT expertise.
The DITA-OT includes a Maven plug-in and some Ant so there is expertise 
in this area.
The biggest contribution might be at the specification stage and if you 
can get some DITA users interested in the other XMLGraphics actvities, 
you might get some help in documentation since most of the users are 
Technical Writers by profession.


So IIUC the goal of this would be to attract sponsors who use the
DITA-OT to fund specific developments? 


I think that there is pent-up demand for customization and improvements 
to FOP.



Or are DITA developers willing to
learn what is necessary to contribute new features to FOP?
I can not speak to that since I do not know the developers that well to 
comment on the range of their abilities and interests nor do I know what 
is required to make any specific upgrade to FOP.
I suspect that it will depend on the difficulty and the technology 
involved in the specific feature.
They certainly can provide input on functional details and provide test 
suites to test the new features as incorporated into DITA-OT.


It may also attract some of the companies that develop portals, CMSs and 
editors for DITA (proprietary and open source) that see the XMLGraphics 
group as a place to acquire big chunks of functionality for their projects.
They may have an interest in participating in the XMLGraphics 
development efforts in order to get features that they need or to 
contribute functionality that they have developed to existing projects.
They may also want to draw on the expertise here to find better ways to 
process XML in their products.





b)I think that the current Apache members could make positive 
suggestions

about the processing and structure of the DITA toolkit.


Again, because of the different skill sets you might have put your hopes
a bit high I’m afraid.

I have a lot of respect for the Apache processes and products and have 
used FOP for quite a few years in a very minor way.
I suspect that there are people here that are very good at XSLT and 
software architecture in general.




c) The DITA language specification is constantly undergoing updates 
and there
are likely things that current FOP users could suggest as additions 
to allow

them to use a standard language to author documents.


FWIW you might want to become involved in the W3C Print ang Page Layout
Community Group:
http://www.w3.org/community/ppl/



d) The users of the DITA tool kit need some customization of FOP
configuration  to produce nice looking documents and there is likely 
a lot of

experts here that could be engaged to do these projects.


We can certainly increase the collaboration between the two projects,
although I don’t think becoming part of XML Graphics is a requirement
for that. Just out of curiosity, what are your reasons for wanting to
become part of Apache?

This is a bit of a personal exploration and anything that I find needs 
to go back to the DITA-OT group as a suggestion for further 
investigation by the key people.
I am a minor user of DITA (2 small projects - one internal software 
manual and one to create a study for a client on modernizing their 
Learning and Development processes). Apache software is an integral part 
of everything that we develop.
I see a lot of frustration in the DITA group about the limitations of 
DITA production caused by features missing from FOP (perhaps some are 
just misunderstandings about how to output XSL-FO in a way that will get 
the job done).


DITA has a huge community of writers and documentation managers who are 
committed to producing reusable documentation using XML.
It seems to me that putting these two communities together on the 
technical side would have a lot of mutual advantages.
The document writers and front-end people who are using FOP directly 
might find that the DITA community has a lot of the answers for their 
document management questions and the DITA community would benefit from 
a 

Re: DITA-OT and FOP

2014-03-27 Thread Ron Wheeler
I think that you are right about being a sister project to FOP within 
XMLGraphics.


The discussion from everyone has been very helpful even if the writer 
was negative to the idea.


//I would be happy to see some interaction in a number of areas.
a) Clearly, the DITA group should be pushing for the FOP features that 
are required to produce the documents that can be described in DITA.
b)I think that the current Apache members could make positive 
suggestions about the processing and structure of the DITA toolkit.
c) The DITA language specification is constantly undergoing updates and 
there are likely things that current FOP users could suggest as 
additions to allow them to use a standard language to author documents.
d) The users of the DITA tool kit need some customization of FOP 
configuration  to produce nice looking documents and there is likely a 
lot of experts here that could be engaged to do these projects.


Ron

On 27/03/2014 12:18 PM, Chris Bowditch wrote:

Hi Ron,

Thanks for your e-mail. I can certainly see why joining the 2 
communities seems like a good idea to help improve funding for FOP, 
and deliver the features DITA requires. I'd certainly be interested in 
seeing what the list of missing XSL-FO features are from a DITA 
perspective.


The challenge with joining multiple communities together is that it 
makes the merged project too complex and it scares users/developers 
away. This is a problem faced by FOP already. The codebase is so vast 
and complex, many people struggle to get to grips with it.


I would be against DITA and FOP merging for this reason, plus the 
separation of concerns already stated by others. That said, if DITA 
wanted to join Apache, adding DITA as a sub project of XMLGraphics 
would seem like a logical place for the DITA community. That way there 
would be some links between the 2 projects and an oversight of both 
from a common committee. This should help to improve interoperability 
a little although perhaps not to the extent you had hoped for.


Thanks,

Chris

On 26/03/2014 14:42, Ron Wheeler wrote:
The DITA-OT (DITA Open Toolkit) (http://www.ditaopentoolkit.org/) is 
an open source project that depends on FOP.
It takes DITA 
(http://en.wikipedia.org/wiki/Darwin_Information_Typing_Architecture) 
XML input and produces a number of different document types.
One of the main output types is PDF and it uses FOP to do this. It 
takes DITA input (xml) and produces an intermediate set of files that 
are processed by FOP to produce a PDF. It can also produce HTML
There is a Maven plug-in to control the production in an IDE 
environment. It calls all the bits and pieces required to take the 
raw DITA XML files and output a document in PDF in the target folder.


There is some interest in adding DITA-OT to the Apache family and in 
my opinion, the XMLGraphics group seems like a natural home and the 
FOP sub-project might be a good place for DITA-OT to reside.


DITA-OT is a large community of users but a small community of 
developers.
There are also a few enhancement ideas that would require FOP 
enhancements to complete.
In addition, my own belief is that the FOP community could add some 
technical advice to the DITA-OT community that would be helpful.


http://sourceforge.net/projects/dita-ot/ is the download page. 
DITA-OT is currently distributed under the Apache License V2.0.


I am not one of the main players in the group but have taken on the 
task of seeing if there is a possibility of opening the discussion 
with the FOP group.


Would there be any interest in considering adding front-end XML 
processing to the FOP production project?


Ron







--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102



DITA-OT and FOP

2014-03-26 Thread Ron Wheeler
The DITA-OT (DITA Open Toolkit) (http://www.ditaopentoolkit.org/) is an 
open source project that depends on FOP.
It takes DITA 
(http://en.wikipedia.org/wiki/Darwin_Information_Typing_Architecture) 
XML input and produces a number of different document types.
One of the main output types is PDF and it uses FOP to do this. It takes 
DITA input (xml) and produces an intermediate set of files that are 
processed by FOP to produce a PDF. It can also produce HTML
There is a Maven plug-in to control the production in an IDE 
environment. It calls all the bits and pieces required to take the raw 
DITA XML files and output a document in PDF in the target folder.


There is some interest in adding DITA-OT to the Apache family and in my 
opinion, the XMLGraphics group seems like a natural home and the FOP 
sub-project might be a good place for DITA-OT to reside.


DITA-OT is a large community of users but a small community of developers.
There are also a few enhancement ideas that would require FOP 
enhancements to complete.
In addition, my own belief is that the FOP community could add some 
technical advice to the DITA-OT community that would be helpful.


http://sourceforge.net/projects/dita-ot/ is the download page. DITA-OT 
is currently distributed under the Apache License V2.0.


I am not one of the main players in the group but have taken on the task 
of seeing if there is a possibility of opening the discussion with the 
FOP group.


Would there be any interest in considering adding front-end XML 
processing to the FOP production project?


Ron

--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102



Re: DITA-OT and FOP

2014-03-26 Thread Glenn Adams
On Wed, Mar 26, 2014 at 8:42 AM, Ron Wheeler rwhee...@artifact-software.com
 wrote:

 The DITA-OT (DITA Open Toolkit) (http://www.ditaopentoolkit.org/) is an
 open source project that depends on FOP.
 It takes DITA (http://en.wikipedia.org/wiki/Darwin_Information_Typing_
 Architecture) XML input and produces a number of different document types.
 One of the main output types is PDF and it uses FOP to do this. It takes
 DITA input (xml) and produces an intermediate set of files that are
 processed by FOP to produce a PDF. It can also produce HTML
 There is a Maven plug-in to control the production in an IDE environment.
 It calls all the bits and pieces required to take the raw DITA XML files
 and output a document in PDF in the target folder.

 There is some interest in adding DITA-OT to the Apache family and in my
 opinion, the XMLGraphics group seems like a natural home and the FOP
 sub-project might be a good place for DITA-OT to reside.

 DITA-OT is a large community of users but a small community of developers.
 There are also a few enhancement ideas that would require FOP enhancements
 to complete.
 In addition, my own belief is that the FOP community could add some
 technical advice to the DITA-OT community that would be helpful.

 http://sourceforge.net/projects/dita-ot/ is the download page. DITA-OT is
 currently distributed under the Apache License V2.0.

 I am not one of the main players in the group but have taken on the task
 of seeing if there is a possibility of opening the discussion with the FOP
 group.

 Would there be any interest in considering adding front-end XML processing
 to the FOP production project?


FOP already has a front-end XML processor. It's called XSLT. It is provided
solely as a convenience function. Some of us have doubts about whether that
was a good idea or not, since it tends to generate a lot of traffic
unrelated to FOP's core function.

Why do you want *more* front end processing and why isn't XSLT sufficient?




 Ron

 --
 Ron Wheeler
 President
 Artifact Software Inc
 email: rwhee...@artifact-software.com
 skype: ronaldmwheeler
 phone: 866-970-2435, ext 102




Re: DITA-OT and FOP

2014-03-26 Thread Ron Wheeler

On 26/03/2014 11:46 AM, Glenn Adams wrote:



On Wed, Mar 26, 2014 at 8:42 AM, Ron Wheeler 
rwhee...@artifact-software.com 
mailto:rwhee...@artifact-software.com wrote:


The DITA-OT (DITA Open Toolkit) (http://www.ditaopentoolkit.org/)
is an open source project that depends on FOP.
It takes DITA
(http://en.wikipedia.org/wiki/Darwin_Information_Typing_Architecture)
XML input and produces a number of different document types.
One of the main output types is PDF and it uses FOP to do this. It
takes DITA input (xml) and produces an intermediate set of files
that are processed by FOP to produce a PDF. It can also produce HTML
There is a Maven plug-in to control the production in an IDE
environment. It calls all the bits and pieces required to take the
raw DITA XML files and output a document in PDF in the target folder.

There is some interest in adding DITA-OT to the Apache family and
in my opinion, the XMLGraphics group seems like a natural home and
the FOP sub-project might be a good place for DITA-OT to reside.

DITA-OT is a large community of users but a small community of
developers.
There are also a few enhancement ideas that would require FOP
enhancements to complete.
In addition, my own belief is that the FOP community could add
some technical advice to the DITA-OT community that would be helpful.

http://sourceforge.net/projects/dita-ot/ is the download page.
DITA-OT is currently distributed under the Apache License V2.0.

I am not one of the main players in the group but have taken on
the task of seeing if there is a possibility of opening the
discussion with the FOP group.

Would there be any interest in considering adding front-end XML
processing to the FOP production project?


FOP already has a front-end XML processor. It's called XSLT. It is 
provided solely as a convenience function. Some of us have doubts 
about whether that was a good idea or not, since it tends to generate 
a lot of traffic unrelated to FOP's core function.


Why do you want *more* front end processing and why isn't XSLT sufficient?


More means a set of XSLT stylesheets that validates a standard 
language called DITA (Docbook supported as well) and transforms it to 
FOP input (or HTML).


You do raise a good point about the user group traffic and splitting the 
conversation between FOP concerns and the more diverse discussions about 
how to get FOP input from the various sources of text and graphics.


The addition of the DITA group to the project might relieve(or redirect) 
some of that traffic.
It will, at least, add quite a few people to the project who are used to 
dealing with the front-end requirements of creating the original DITA 
XML with editors, converting non-XML source text to XML and writing XSLT 
stylesheets, who can respond to questions that are more about 
documentation management than document construction.


Ron



--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102



Re: DITA-OT and FOP

2014-03-26 Thread Glenn Adams
On Wed, Mar 26, 2014 at 10:14 AM, Ron Wheeler 
rwhee...@artifact-software.com wrote:

  On 26/03/2014 11:46 AM, Glenn Adams wrote:



 On Wed, Mar 26, 2014 at 8:42 AM, Ron Wheeler 
 rwhee...@artifact-software.com wrote:

 The DITA-OT (DITA Open Toolkit) (http://www.ditaopentoolkit.org/) is an
 open source project that depends on FOP.
 It takes DITA (
 http://en.wikipedia.org/wiki/Darwin_Information_Typing_Architecture) XML
 input and produces a number of different document types.
 One of the main output types is PDF and it uses FOP to do this. It takes
 DITA input (xml) and produces an intermediate set of files that are
 processed by FOP to produce a PDF. It can also produce HTML
 There is a Maven plug-in to control the production in an IDE environment.
 It calls all the bits and pieces required to take the raw DITA XML files
 and output a document in PDF in the target folder.

 There is some interest in adding DITA-OT to the Apache family and in my
 opinion, the XMLGraphics group seems like a natural home and the FOP
 sub-project might be a good place for DITA-OT to reside.

 DITA-OT is a large community of users but a small community of developers.
 There are also a few enhancement ideas that would require FOP
 enhancements to complete.
 In addition, my own belief is that the FOP community could add some
 technical advice to the DITA-OT community that would be helpful.

 http://sourceforge.net/projects/dita-ot/ is the download page. DITA-OT
 is currently distributed under the Apache License V2.0.

 I am not one of the main players in the group but have taken on the task
 of seeing if there is a possibility of opening the discussion with the FOP
 group.

 Would there be any interest in considering adding front-end XML
 processing to the FOP production project?


  FOP already has a front-end XML processor. It's called XSLT. It is
 provided solely as a convenience function. Some of us have doubts about
 whether that was a good idea or not, since it tends to generate a lot of
 traffic unrelated to FOP's core function.

  Why do you want *more* front end processing and why isn't XSLT
 sufficient?


 More means a set of XSLT stylesheets that validates a standard language
 called DITA (Docbook supported as well) and transforms it to FOP input (or
 HTML).


IMO, FOP should remain neutral to pre-FOP input formats. I'm one of those
who would like to remove the XSLT front-end functions from FOP entirely,
whereas you propose to take it further towards the pre-transformation XML
domain. So, I would vote against such a proposal.



 You do raise a good point about the user group traffic and splitting the
 conversation between FOP concerns and the more diverse discussions about
 how to get FOP input from the various sources of text and graphics.

 The addition of the DITA group to the project might relieve(or redirect)
 some of that traffic.
 It will, at least, add quite a few people to the project who are used to
 dealing with the front-end requirements of creating the original DITA XML
 with editors, converting non-XML source text to XML and writing XSLT
 stylesheets, who can respond to questions that are more about documentation
 management than document construction.


 Ron



 --
 Ron Wheeler
 President
 Artifact Software Inc
 email: rwhee...@artifact-software.com
 skype: ronaldmwheeler
 phone: 866-970-2435, ext 102




Re: DITA-OT and FOP

2014-03-26 Thread Ron Wheeler

On 26/03/2014 12:25 PM, Glenn Adams wrote:



On Wed, Mar 26, 2014 at 10:14 AM, Ron Wheeler 
rwhee...@artifact-software.com 
mailto:rwhee...@artifact-software.com wrote:


On 26/03/2014 11:46 AM, Glenn Adams wrote:



On Wed, Mar 26, 2014 at 8:42 AM, Ron Wheeler
rwhee...@artifact-software.com
mailto:rwhee...@artifact-software.com wrote:

The DITA-OT (DITA Open Toolkit)
(http://www.ditaopentoolkit.org/) is an open source project
that depends on FOP.
It takes DITA
(http://en.wikipedia.org/wiki/Darwin_Information_Typing_Architecture)
XML input and produces a number of different document types.
One of the main output types is PDF and it uses FOP to do
this. It takes DITA input (xml) and produces an intermediate
set of files that are processed by FOP to produce a PDF. It
can also produce HTML
There is a Maven plug-in to control the production in an IDE
environment. It calls all the bits and pieces required to
take the raw DITA XML files and output a document in PDF in
the target folder.

There is some interest in adding DITA-OT to the Apache family
and in my opinion, the XMLGraphics group seems like a natural
home and the FOP sub-project might be a good place for
DITA-OT to reside.

DITA-OT is a large community of users but a small community
of developers.
There are also a few enhancement ideas that would require FOP
enhancements to complete.
In addition, my own belief is that the FOP community could
add some technical advice to the DITA-OT community that would
be helpful.

http://sourceforge.net/projects/dita-ot/ is the download
page. DITA-OT is currently distributed under the Apache
License V2.0.

I am not one of the main players in the group but have taken
on the task of seeing if there is a possibility of opening
the discussion with the FOP group.

Would there be any interest in considering adding front-end
XML processing to the FOP production project?


FOP already has a front-end XML processor. It's called XSLT. It
is provided solely as a convenience function. Some of us have
doubts about whether that was a good idea or not, since it tends
to generate a lot of traffic unrelated to FOP's core function.

Why do you want *more* front end processing and why isn't XSLT
sufficient?


More means a set of XSLT stylesheets that validates a standard
language called DITA (Docbook supported as well) and transforms it
to FOP input (or HTML).


IMO, FOP should remain neutral to pre-FOP input formats. I'm one of 
those who would like to remove the XSLT front-end functions from FOP 
entirely, whereas you propose to take it further towards the 
pre-transformation XML domain. So, I would vote against such a proposal.



You do raise a good point about the user group traffic and
splitting the conversation between FOP concerns and the more
diverse discussions about how to get FOP input from the various
sources of text and graphics.

The addition of the DITA group to the project might relieve(or
redirect) some of that traffic.
It will, at least, add quite a few people to the project who are
used to dealing with the front-end requirements of creating the
original DITA XML with editors, converting non-XML source text to
XML and writing XSLT stylesheets, who can respond to questions
that are more about documentation management than document
construction.


Ron


I can certainly understand your concern about supporting the front-end 
activities but without a way to get FOP input, FOP is a niche product 
for software developers (my company's initial use for which we are 
grateful for the work that the FOP team has done).


Can you suggest a solution that would integrate the current front-end 
support with the DITA group while still tightening the link between FOP 
and DITA?


I think that the DITA-OT community is the largest user group for FOP.
If you look at LinkedIn, the DITA awareness group has 552,722 members, 
the DITA for Small Teams has 64,551 members, DITA Metrics has 74,388.

There are country-specific DITA groups with 15-20,000 members.
The Tools for Change for Publishing has 3,175,059 members.
DITA Machine Industry for technical docs for automotive and and 
machinery has 18,463 members.


DITA-OT including FOP has been downloaded 344 times this week and over 
4,100 times in the last 12 months.
How does this compare to the direct downloads of FOP from the Apache 
mirrors?


This represents a lot of opportunity to get support for FOP 
functionality improvements as well as consulting work for people who can 
configure/customize FOP to meet specific needs of corporate and 
government clients for internal and external documentation.


It might be worth the pain

Re: DITA-OT and FOP

2014-03-26 Thread Christopher R. Maden
Although I don’t get a vote, I completely agree with Glenn that DITA
integration into FOP is completely inappropriate.

I also challenge Ron’s assertion that DITA users are the biggest users
of FOP; DocBook is way up there too, along with an uncountable number of
other document types.

If the DITA team wants to turn ownership over to the Apache Foundation,
that’s one thing; there are several projects that coöperate with FOP
(and some on which FOP depends), but they are not and should not be
*part* of FOP.

“Tightening the link between FOP and DITA” is a bad thing.  Separation
of concerns makes semantic markup work and makes Free/Open Source
Software work.  DITA (and other good XML vocabularies) are all about
describing information in a presentation-independent kind of way, and
then applying one or more presentation specifications to produce output.
 The presentation layer should be separate.  While many (most?) DITA
users depend on FOP, not all do,[*] and there are certainly many FOP
users who do not need to be saddled with DITA (or DocBook, or CALS, or
TEI, or ...).

~Chris

[*] My entire experience with DITA has been around an MS-Word-centric
publication system, going into and out of Office Open XML, with no XSL
in sight.
-- 
Chris Maden, text nerd  URL: http://crism.maden.org/ 
Surround hate and force it to surrender.
GnuPG fingerprint: DB08 CF6C 2583 7F55 3BE9  A210 4A51 DBAC 5C5C 3D5E


Re: DITA-OT and FOP

2014-03-26 Thread Ron Wheeler

On 26/03/2014 2:14 PM, Christopher R. Maden wrote:

Although I don’t get a vote, I completely agree with Glenn that DITA
integration into FOP is completely inappropriate.
Thanks for taking the time to comment. I appreciate the input - voting 
or not.

I also challenge Ron’s assertion that DITA users are the biggest users
of FOP; DocBook is way up there too, along with an uncountable number of
other document types.



You certainly have reason to challenge my assertion!
I could not determine the number of downloads of FOP from Apache last 
week but FOP was downloaded 344 times last week as part of DITA-OT.


There could be other ways to measure biggest user of FOP but I could 
not find any way to determine these.

Is there some data from FOP about usage?

DITA-OT does DocBook as well but I am not sure how many DITA-OT users 
are still working with DocBook.





If the DITA team wants to turn ownership over to the Apache Foundation,


I think that this would be the result and the current license is Apache V2.


that’s one thing; there are several projects that coöperate with FOP
(and some on which FOP depends), but they are not and should not be
*part* of FOP.
Part of the issue that I think needs to be addressed is the difficulty 
in using DITA-OT caused by missing FOP functionality.
These are not getting fixed or even addressed for a number of reasons 
that relate to communication and probably resources where users of 
DITA-OT need FOP support but the people supporting DITA-OT don't have a 
connection to the FOP team  and the end-user with the financial 
resources does not see the connection between FOP and DITA-OT. They just 
get told that DITA-OT can't do X because FOP can't support it - end of 
story.


The lack of FOP consultants in the DITA-OT community also makes it 
difficult to get customization/configuration of FOP done.
This makes it difficult to get nice looking output or dynamic creation 
of documentation on demand.



“Tightening the link between FOP and DITA” is a bad thing.  Separation
of concerns makes semantic markup work and makes Free/Open Source
Software work.  DITA (and other good XML vocabularies) are all about
describing information in a presentation-independent kind of way, and
then applying one or more presentation specifications to produce output.
  The presentation layer should be separate.  While many (most?) DITA
users depend on FOP, not all do,[*] and there are certainly many FOP
users who do not need to be saddled with DITA (or DocBook, or CALS, or
TEI, or ...).


I would not suggest that the code be integrated but it would be nice to 
have a single group that can say. if you want to support X in DITA-OT 
for PDF output, then you to add Y to DITA-OT and Z to FOP and these can 
be done through the release of FOP version zzz and version yyy of DITA-OT.


One alternative would be to fork FOP and do the extensions of FOP in the 
DITA-OT group but that is not anyone's preference



There are alternatives to processing DITA.  FOP is not used in DITA-OT 
for HTML or OOxml or rtf output.


I am not sure how many of the alternative DITA processors have forked 
the Apache FOP base and integrated it into the programs. It certainly 
would give one a head start on building a proprietary DITA to PDF 
process and would likely have a lot of bits and pieces that one could 
use for other XML output.


I think that there is a lot of expertise in the XMLGraphics group that 
could make a big difference in the usefulness and functionality of 
DITA-OT just by commenting on the internal processes and reviewing the 
enhancement plans.





~Chris

[*] My entire experience with DITA has been around an MS-Word-centric
publication system, going into and out of Office Open XML, with no XSL
in sight.



--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102



RE: DITA-OT and FOP

2014-03-26 Thread Jan Tosovsky
On 2014-03-26 Christopher R. Maden wrote:
 Although I don’t get a vote, I completely agree with Glenn that DITA
 integration into FOP is completely inappropriate.

+1

FOP consumes standardized XSL-FO.

DITA-OT should produce standardized XSL-FO. Period ;-)

From my POV it has nothing to do with FOP, it is rather XSLT (or whatever) 
part.

 
But I agree FOP could be enhanced to be more conformant to the standard. It 
requires engaging developers and sponsors. Some ideas appear in this mailing 
list from time to time.

Jan




Re: DITA-OT and FOP

2014-03-26 Thread Ron Wheeler

On 26/03/2014 2:59 PM, Jan Tosovsky wrote:

On 2014-03-26 Christopher R. Maden wrote:

Although I don’t get a vote, I completely agree with Glenn that DITA
integration into FOP is completely inappropriate.

+1

FOP consumes standardized XSL-FO.

DITA-OT should produce standardized XSL-FO. Period ;-)
It does but FOP does not support all PDF features so some of the things 
that people can express in DITA's XML can not produce the PDF output 
that people want/need.


From my POV it has nothing to do with FOP, it is rather XSLT (or whatever) 
part.

  
But I agree FOP could be enhanced to be more conformant to the standard. It requires engaging developers and sponsors.


I would hope that engaging the DITA community that has the need and the 
resources will get the support for these features.
The separation into tool-making and document making communities makes it 
harder for the tool guys to get funded or staffed.

The document makers don't have the knowhow to fix the tools.
There is a lot of potential locked up in the current situation.

Some ideas appear in this mailing list from time to time.

The DITA community has not been very good at pestering the FOP team to 
get things fixed.
They have tended to accept that FOP is static and can not respond to the 
needs of the document makers.



Jan






--
Ron Wheeler
President
Artifact Software Inc
email: rwhee...@artifact-software.com
skype: ronaldmwheeler
phone: 866-970-2435, ext 102