Re: [docbook-apps] html2docbook: issue with h2/

2010-02-10 Thread Mathieu Malaterre
On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote:
 Hello Mathieu,

 herold produces DocBook5, which has no ulink element at all. Therefore
 the linking is based on xlink.

Thanks !

BTW I found out herold does not support table/

!DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN
http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd;
html xmlns=http://www.w3.org/1999/xhtml;
head
titlet/title
/head
body
table border=0 cellspacing=0 cellpadding=0 width=800
tr
td
hello
/td
/tr
/table
/body
/html


leads to:

?xml version=1.0 encoding=UTF-8?
article version=1.0 xmlns=http://docbook.org/ns/docbook;
xmlns:xl=http://www.w3.org/1999/xlink;
  info
titlet/title
  /info
  row
entrypara hello /para/entry
  /row
/article

I am forwarding that to the dbdoclet mailing list...oh well


-- 
Mathieu

-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org



Re: [docbook-apps] html2docbook: issue with h2/

2010-02-10 Thread Michael Fuchs
Hello Mathieu,

this is a bug in herold, because I build in a test groovy script by
mistake. If you remove it from the jar file (/scripts/table.groovy)
everything should work. I will release a fixed version with full support
for customization via groovy and accompanying documents as soon as
possible.

Regards
Michael

Am Mittwoch, den 10.02.2010, 10:25 +0100 schrieb Mathieu Malaterre:
 On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote:
  Hello Mathieu,
 
  herold produces DocBook5, which has no ulink element at all. Therefore
  the linking is based on xlink.
 
 Thanks !
 
 BTW I found out herold does not support table/
 
 !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN
 http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd;
 html xmlns=http://www.w3.org/1999/xhtml;
 head
 titlet/title
 /head
 body
 table border=0 cellspacing=0 cellpadding=0 width=800
 tr
 td
 hello
 /td
 /tr
 /table
 /body
 /html
 
 
 leads to:
 
 ?xml version=1.0 encoding=UTF-8?
 article version=1.0 xmlns=http://docbook.org/ns/docbook;
 xmlns:xl=http://www.w3.org/1999/xlink;
   info
 titlet/title
   /info
   row
 entrypara hello /para/entry
   /row
 /article
 
 I am forwarding that to the dbdoclet mailing list...oh well
 
 



-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org



Re: [docbook-apps] html2docbook: issue with h2/

2010-02-10 Thread Mathieu Malaterre
Here is what I did:

  wget http://www.dbdoclet.org/archives/herold_5.2.2.jar
  jar xvf herold_5.2.2.jar
  rm ./scripts/table.groovy
  rm herold_5.2.2.jar
  jar cvf herold_5.2.3.jar fonts groovy groovyjarjarantlr
groovyjarjarasm groovyjarjarcommonscli images META-INF org scripts
  java -jar herold_5.2.3.jar -i t.xhtml -o o.xml
Failed to load Main-Class manifest attribute from
herold_5.2.3.jar

What am I missing ?

Thanks

On Wed, Feb 10, 2010 at 12:48 PM, Michael Fuchs ml...@dbdoclet.org wrote:
 Hello Mathieu,

 this is a bug in herold, because I build in a test groovy script by
 mistake. If you remove it from the jar file (/scripts/table.groovy)
 everything should work. I will release a fixed version with full support
 for customization via groovy and accompanying documents as soon as
 possible.

 Regards
 Michael

 Am Mittwoch, den 10.02.2010, 10:25 +0100 schrieb Mathieu Malaterre:
 On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote:
  Hello Mathieu,
 
  herold produces DocBook5, which has no ulink element at all. Therefore
  the linking is based on xlink.

 Thanks !

 BTW I found out herold does not support table/

 !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN
     http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd;
 html xmlns=http://www.w3.org/1999/xhtml;
 head
 titlet/title
 /head
 body
 table border=0 cellspacing=0 cellpadding=0 width=800
 tr
 td
 hello
 /td
 /tr
 /table
 /body
 /html


 leads to:

 ?xml version=1.0 encoding=UTF-8?
 article version=1.0 xmlns=http://docbook.org/ns/docbook;
 xmlns:xl=http://www.w3.org/1999/xlink;
   info
     titlet/title
   /info
   row
     entrypara hello /para/entry
   /row
 /article

 I am forwarding that to the dbdoclet mailing list...oh well








-- 
Mathieu

-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org



Re: [docbook-apps] html2docbook: issue with h2/

2010-02-10 Thread Michael Fuchs
Hello,

you have to define the manifest file. Got the directory where you
expanded the jar file and call:

jar cvmf META-INF/MANIFEST.MF ../herold_5.2.3.jar .

I'll  send you a fixed version to your mail account and I will deliver a
new version today.

Regards
Michael

Am Mittwoch, den 10.02.2010, 14:14 +0100 schrieb Mathieu Malaterre:
 Here is what I did:
 
   wget http://www.dbdoclet.org/archives/herold_5.2.2.jar
   jar xvf herold_5.2.2.jar
   rm ./scripts/table.groovy
   rm herold_5.2.2.jar
   jar cvf herold_5.2.3.jar fonts groovy groovyjarjarantlr
 groovyjarjarasm groovyjarjarcommonscli images META-INF org scripts
   java -jar herold_5.2.3.jar -i t.xhtml -o o.xml
 Failed to load Main-Class manifest attribute from
 herold_5.2.3.jar
 
 What am I missing ?
 
 Thanks
 
 On Wed, Feb 10, 2010 at 12:48 PM, Michael Fuchs ml...@dbdoclet.org wrote:
  Hello Mathieu,
 
  this is a bug in herold, because I build in a test groovy script by
  mistake. If you remove it from the jar file (/scripts/table.groovy)
  everything should work. I will release a fixed version with full support
  for customization via groovy and accompanying documents as soon as
  possible.
 
  Regards
  Michael
 
  Am Mittwoch, den 10.02.2010, 10:25 +0100 schrieb Mathieu Malaterre:
  On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote:
   Hello Mathieu,
  
   herold produces DocBook5, which has no ulink element at all. Therefore
   the linking is based on xlink.
 
  Thanks !
 
  BTW I found out herold does not support table/
 
  !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN
  http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd;
  html xmlns=http://www.w3.org/1999/xhtml;
  head
  titlet/title
  /head
  body
  table border=0 cellspacing=0 cellpadding=0 width=800
  tr
  td
  hello
  /td
  /tr
  /table
  /body
  /html
 
 
  leads to:
 
  ?xml version=1.0 encoding=UTF-8?
  article version=1.0 xmlns=http://docbook.org/ns/docbook;
  xmlns:xl=http://www.w3.org/1999/xlink;
info
  titlet/title
/info
row
  entrypara hello /para/entry
/row
  /article
 
  I am forwarding that to the dbdoclet mailing list...oh well
 
 
 
 
 
 
 
 



-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org



Re: [docbook-apps] html2docbook: issue with h2/

2010-01-28 Thread Michael Fuchs
Hello Mathieu,

herold produces DocBook5, which has no ulink element at all. Therefore
the linking is based on xlink.

Regards
Michael
 
 Thanks for the info. Indeed herold seems at first to be giving
 slighlty better result (at least it parses h2 as I expected).
 However it turn all links to href, instead of doing what I think is
 smarter:
 
 Absolute links (starting with http://;) remain absolute and become
 ulinks. Other links become xrefs.
 Ref: 
 http://wiki.docbook.org/topic/Html2DocBook#head-f915b9937f1226e0abb325e6a6335e12d20be0c4
 




-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org



Re: [docbook-apps] html2docbook: issue with h2/

2010-01-27 Thread Michael Fuchs
Hello,

perhaps it's an option for you to try the HTML to DocBook converter
herold (http://www.dbdoclet.org/archives/herold_5.2.2.jar). I ran
java -jar herold_5.2.2.jar -i input.xhtml -o output.xml 
and the result looks like:

?xml version=1.0 encoding=UTF-8?
article version=1.0 xmlns=http://docbook.org/ns/docbook;
xmlns:xl=http://www.w3.org/1999/xlink;
  info/
  section remap=h1
titleTitle1/title
parabla 1/para
section remap=h2
  titleTitle2/title
  parabla 2/para
/section
  /section
/article

Regards


 Hi there,
 
   I'd like to know if anyone is using the script from the page:
 http://wiki.docbook.org/topic/Html2DocBook
 
   I tried on a very tidy example:
 
 $ cat input.xhtml
 !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Transitional//EN
 http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd;
 html xmlns=http://www.w3.org/1999/xhtml;
 body
 h1Title1/h1
 pbla 1/p
 h2Title2/h2
 pbla 2/p
 /body
 /html
 
 
 Here is what I get as output:
 
 $ cat output.xml
 ?xml version=1.0?
 section
   titleTitle1/title
   parabla 1/para
   parabla 2/para
 /section
 
 The title in h2 element is lost during the conversion.
 
 Any idea on how to fix that ?
 
 Thanks,




Re: [docbook-apps] html2docbook: issue with h2/

2010-01-27 Thread Bob Stayton

Looking the stylesheet, it appears to not be able to generate nested sections 
from
h1, h2, etc. headings.  It only generates a single section element for the
entire HTML file.

The basic challenge is that plain HTML has a linear structure, in which h2 is 
just
another block element like para, while DocBook has a nested structure where
section contains title and para and other section elements.  Converting 
a
linear structure to a nested structure requires a more complex stylesheet than 
this
one.

I would suggest you try the standalone HTML-to-DocBook conversion tool 
herold, which
used to be part of the dbdoclet Java app but is now a stand alone app.  It can 
be
downloaded from:

http://www.dbdoclet.org/

Bob Stayton
Sagehill Enterprises
b...@sagehill.net


- Original Message - 
From: Mathieu Malaterre mathieu.malate...@gmail.com

To: docbook-apps@lists.oasis-open.org
Sent: Tuesday, January 26, 2010 8:03 AM
Subject: [docbook-apps] html2docbook: issue with h2/



Hi there,

 I'd like to know if anyone is using the script from the page:
http://wiki.docbook.org/topic/Html2DocBook

 I tried on a very tidy example:

$ cat input.xhtml
!DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Transitional//EN
   http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd;
html xmlns=http://www.w3.org/1999/xhtml;
body
h1Title1/h1
pbla 1/p
h2Title2/h2
pbla 2/p
/body
/html


Here is what I get as output:

$ cat output.xml
?xml version=1.0?
section
 titleTitle1/title
 parabla 1/para
 parabla 2/para
/section

The title in h2 element is lost during the conversion.

Any idea on how to fix that ?

Thanks,
--
Mathieu

-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org






-
To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org
For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org