Re: [docbook-apps] html2docbook: issue with h2/
On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote: Hello Mathieu, herold produces DocBook5, which has no ulink element at all. Therefore the linking is based on xlink. Thanks ! BTW I found out herold does not support table/ !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd; html xmlns=http://www.w3.org/1999/xhtml; head titlet/title /head body table border=0 cellspacing=0 cellpadding=0 width=800 tr td hello /td /tr /table /body /html leads to: ?xml version=1.0 encoding=UTF-8? article version=1.0 xmlns=http://docbook.org/ns/docbook; xmlns:xl=http://www.w3.org/1999/xlink; info titlet/title /info row entrypara hello /para/entry /row /article I am forwarding that to the dbdoclet mailing list...oh well -- Mathieu - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org
Re: [docbook-apps] html2docbook: issue with h2/
Hello Mathieu, this is a bug in herold, because I build in a test groovy script by mistake. If you remove it from the jar file (/scripts/table.groovy) everything should work. I will release a fixed version with full support for customization via groovy and accompanying documents as soon as possible. Regards Michael Am Mittwoch, den 10.02.2010, 10:25 +0100 schrieb Mathieu Malaterre: On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote: Hello Mathieu, herold produces DocBook5, which has no ulink element at all. Therefore the linking is based on xlink. Thanks ! BTW I found out herold does not support table/ !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd; html xmlns=http://www.w3.org/1999/xhtml; head titlet/title /head body table border=0 cellspacing=0 cellpadding=0 width=800 tr td hello /td /tr /table /body /html leads to: ?xml version=1.0 encoding=UTF-8? article version=1.0 xmlns=http://docbook.org/ns/docbook; xmlns:xl=http://www.w3.org/1999/xlink; info titlet/title /info row entrypara hello /para/entry /row /article I am forwarding that to the dbdoclet mailing list...oh well - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org
Re: [docbook-apps] html2docbook: issue with h2/
Here is what I did: wget http://www.dbdoclet.org/archives/herold_5.2.2.jar jar xvf herold_5.2.2.jar rm ./scripts/table.groovy rm herold_5.2.2.jar jar cvf herold_5.2.3.jar fonts groovy groovyjarjarantlr groovyjarjarasm groovyjarjarcommonscli images META-INF org scripts java -jar herold_5.2.3.jar -i t.xhtml -o o.xml Failed to load Main-Class manifest attribute from herold_5.2.3.jar What am I missing ? Thanks On Wed, Feb 10, 2010 at 12:48 PM, Michael Fuchs ml...@dbdoclet.org wrote: Hello Mathieu, this is a bug in herold, because I build in a test groovy script by mistake. If you remove it from the jar file (/scripts/table.groovy) everything should work. I will release a fixed version with full support for customization via groovy and accompanying documents as soon as possible. Regards Michael Am Mittwoch, den 10.02.2010, 10:25 +0100 schrieb Mathieu Malaterre: On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote: Hello Mathieu, herold produces DocBook5, which has no ulink element at all. Therefore the linking is based on xlink. Thanks ! BTW I found out herold does not support table/ !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd; html xmlns=http://www.w3.org/1999/xhtml; head titlet/title /head body table border=0 cellspacing=0 cellpadding=0 width=800 tr td hello /td /tr /table /body /html leads to: ?xml version=1.0 encoding=UTF-8? article version=1.0 xmlns=http://docbook.org/ns/docbook; xmlns:xl=http://www.w3.org/1999/xlink; info titlet/title /info row entrypara hello /para/entry /row /article I am forwarding that to the dbdoclet mailing list...oh well -- Mathieu - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org
Re: [docbook-apps] html2docbook: issue with h2/
Hello, you have to define the manifest file. Got the directory where you expanded the jar file and call: jar cvmf META-INF/MANIFEST.MF ../herold_5.2.3.jar . I'll send you a fixed version to your mail account and I will deliver a new version today. Regards Michael Am Mittwoch, den 10.02.2010, 14:14 +0100 schrieb Mathieu Malaterre: Here is what I did: wget http://www.dbdoclet.org/archives/herold_5.2.2.jar jar xvf herold_5.2.2.jar rm ./scripts/table.groovy rm herold_5.2.2.jar jar cvf herold_5.2.3.jar fonts groovy groovyjarjarantlr groovyjarjarasm groovyjarjarcommonscli images META-INF org scripts java -jar herold_5.2.3.jar -i t.xhtml -o o.xml Failed to load Main-Class manifest attribute from herold_5.2.3.jar What am I missing ? Thanks On Wed, Feb 10, 2010 at 12:48 PM, Michael Fuchs ml...@dbdoclet.org wrote: Hello Mathieu, this is a bug in herold, because I build in a test groovy script by mistake. If you remove it from the jar file (/scripts/table.groovy) everything should work. I will release a fixed version with full support for customization via groovy and accompanying documents as soon as possible. Regards Michael Am Mittwoch, den 10.02.2010, 10:25 +0100 schrieb Mathieu Malaterre: On Thu, Jan 28, 2010 at 9:43 AM, Michael Fuchs ml...@dbdoclet.org wrote: Hello Mathieu, herold produces DocBook5, which has no ulink element at all. Therefore the linking is based on xlink. Thanks ! BTW I found out herold does not support table/ !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Strict//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd; html xmlns=http://www.w3.org/1999/xhtml; head titlet/title /head body table border=0 cellspacing=0 cellpadding=0 width=800 tr td hello /td /tr /table /body /html leads to: ?xml version=1.0 encoding=UTF-8? article version=1.0 xmlns=http://docbook.org/ns/docbook; xmlns:xl=http://www.w3.org/1999/xlink; info titlet/title /info row entrypara hello /para/entry /row /article I am forwarding that to the dbdoclet mailing list...oh well - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org
Re: [docbook-apps] html2docbook: issue with h2/
Hello Mathieu, herold produces DocBook5, which has no ulink element at all. Therefore the linking is based on xlink. Regards Michael Thanks for the info. Indeed herold seems at first to be giving slighlty better result (at least it parses h2 as I expected). However it turn all links to href, instead of doing what I think is smarter: Absolute links (starting with http://;) remain absolute and become ulinks. Other links become xrefs. Ref: http://wiki.docbook.org/topic/Html2DocBook#head-f915b9937f1226e0abb325e6a6335e12d20be0c4 - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org
Re: [docbook-apps] html2docbook: issue with h2/
Hello, perhaps it's an option for you to try the HTML to DocBook converter herold (http://www.dbdoclet.org/archives/herold_5.2.2.jar). I ran java -jar herold_5.2.2.jar -i input.xhtml -o output.xml and the result looks like: ?xml version=1.0 encoding=UTF-8? article version=1.0 xmlns=http://docbook.org/ns/docbook; xmlns:xl=http://www.w3.org/1999/xlink; info/ section remap=h1 titleTitle1/title parabla 1/para section remap=h2 titleTitle2/title parabla 2/para /section /section /article Regards Hi there, I'd like to know if anyone is using the script from the page: http://wiki.docbook.org/topic/Html2DocBook I tried on a very tidy example: $ cat input.xhtml !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Transitional//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd; html xmlns=http://www.w3.org/1999/xhtml; body h1Title1/h1 pbla 1/p h2Title2/h2 pbla 2/p /body /html Here is what I get as output: $ cat output.xml ?xml version=1.0? section titleTitle1/title parabla 1/para parabla 2/para /section The title in h2 element is lost during the conversion. Any idea on how to fix that ? Thanks,
Re: [docbook-apps] html2docbook: issue with h2/
Looking the stylesheet, it appears to not be able to generate nested sections from h1, h2, etc. headings. It only generates a single section element for the entire HTML file. The basic challenge is that plain HTML has a linear structure, in which h2 is just another block element like para, while DocBook has a nested structure where section contains title and para and other section elements. Converting a linear structure to a nested structure requires a more complex stylesheet than this one. I would suggest you try the standalone HTML-to-DocBook conversion tool herold, which used to be part of the dbdoclet Java app but is now a stand alone app. It can be downloaded from: http://www.dbdoclet.org/ Bob Stayton Sagehill Enterprises b...@sagehill.net - Original Message - From: Mathieu Malaterre mathieu.malate...@gmail.com To: docbook-apps@lists.oasis-open.org Sent: Tuesday, January 26, 2010 8:03 AM Subject: [docbook-apps] html2docbook: issue with h2/ Hi there, I'd like to know if anyone is using the script from the page: http://wiki.docbook.org/topic/Html2DocBook I tried on a very tidy example: $ cat input.xhtml !DOCTYPE html PUBLIC -//W3C//DTD XHTML 1.0 Transitional//EN http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd; html xmlns=http://www.w3.org/1999/xhtml; body h1Title1/h1 pbla 1/p h2Title2/h2 pbla 2/p /body /html Here is what I get as output: $ cat output.xml ?xml version=1.0? section titleTitle1/title parabla 1/para parabla 2/para /section The title in h2 element is lost during the conversion. Any idea on how to fix that ? Thanks, -- Mathieu - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org - To unsubscribe, e-mail: docbook-apps-unsubscr...@lists.oasis-open.org For additional commands, e-mail: docbook-apps-h...@lists.oasis-open.org