Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-23 Thread Idris Samawi Hamid

Dear gang,

I have followed this discussion with interest. I edit a journal myself. 
Despite announcing loudly that it is TeX-friendly, the only person who 
writes articles in TeX for it is, you guessed it, myself.


I know next-to-nothing about xml, so I apologize if the next question is 
ignorant:


Would it be possible to define an xml format for the journal so that I 
could more easily process both ConTeXt/LaTeX articles as well as the docs 
and rtfs I generally receive? Is this more work than it's worth? It's a 
humanities journal, so little-to-no math.


Best
Idris

On Thu, 22 Sep 2005 22:54:47 +0200, Christopher Creutzig 
[EMAIL PROTECTED] wrote:



So if I understand wml,  I agree that xml is a format for filtering,
not a human writable format.
TeX, LaTeX or conTeXt is in input langage, which should be able to
be converted to the powerfull master XML format.


 No, sorry.  This only works for extremely simple TeX code.  forget
about any real-world mathematics.  Forget about 80% of what real-world
LaTeX users type into their computers.  TeX has simply never been
written to be easily parsed.


--
Professor Idris Samawi Hamid
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-23 Thread Christopher Creutzig
Idris Samawi Hamid wrote:
 Would it be possible to define an xml format for the journal so that I
 could more easily process both ConTeXt/LaTeX articles as well as the
 docs and rtfs I generally receive? Is this more work than it's worth?
 It's a humanities journal, so little-to-no math.

 Math is, in my experience, the worst part of it, so you an consider
yourself happy that you don't need it.

 The question is, what problems of the current process are you trying to
improve/solve with a possible move to xml?

 If your most pressing problem is the variety of data formats you
receive articles in, then no, xml won't help.  You'd still need some way
of transforming the articles to the format of your choice.  That being
said, XML may be a very good intermediate step from Word or rtf to
ConTeXt, if only because OpenOffice has pretty advanced import filters
and stores its data in a straightforward xml format that should be easy
to transform, assuming you start with a sufficiently rich set of
predefined formats and somehow get people to either use them (fat
chance, I know) or have them be sufficiently different that you can
automatically or at least semi-automatically classify the author's
formatting to your presets.  In really simple cases (e.g., pure prose)
you may get away with accepting HTML and converting that.

 If your most serious problem is a variety of output formats you want to
support (print/pdf, html, some eBook variants, ...), xml is a perfect
technique to develop a solution.

 If getting lots of different encodings is a problem of yours, xml
solves that nicely as well.  But just for that, there are simpler and
less intrusive ways.


 Other things xml may solve well:

- archivability (although your ConTeXt files are probably no worse)

- reusability: Almost everything in a file following a well-designed
  xml format is local and you can simply copy a (complete) block of
  text + markup and insert it into another file.

- consistency, enforcing rules: While it is possible to enforce things
  like “every article must start with an abstract containing one to
  three paragraphs” in TeX, it is way easier in xml.

- all sorts of conversions, including shuffling around or extracting
  data of interest


 Things xml won't do any magic for:

- layout.  You'd need to write a conversion to ConTeXt or whatever.
  Depending on your needs, this can be anything from trivial (say,
  two hours) to almost undoable (although this would mean the xml
  format is particularly badly designed for your journal).


 Both lists are certainly incomplete.  I hope you will get other answers
as well.


regards,
Christopher
___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion (tex4ht)

2005-09-22 Thread Maurice Diamantini


Le 21 sept. 05 à 17:12, Louis F.Springer a écrit :

What are the options for conversion from ConTeXt to other formats,  
if any? I'm particularly interested in rtf and/or html.


I saw there is a module tex4ht usable for converting conTeXt to html.

   http://wiki.contextgarden.net/Modules
   http://source.contextgarden.net/tex/context/base/m-tex4ht.tex

Does anyboby have any experience to share on it?
I didn't find anything about it in the wiki.

About the other answer on this subject, my (dummy) point of view is
that :

  - pdf in not an input format, but an output format,
(difficult to reuse for translating)
  - xml in not an input format but (a well structured) interchange  
format.


I don't want to enter xml version of:

   We know that $E=mC^2$ and also that $\delta = \sqrt{b^2-4ac}$

But I'd like to export it (with table of content, index, ...)
in html or xhtml.

-- Maurice

P.S.

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-22 Thread Duncan Hothersall
- xml in not an input format but (a well structured) interchange  
 format.

XML is a very good master format from which to derive all outputs. For
example, I receive wordprocessor files from academics and convert them
to XML by a combination of automated processes and hand-tagging. The XML
is then stored and maintained as the master version of that document,
and when we need to produce a new release in HTML, PDF or eBook, a new
style of PDF, a text version optimised for screen-readers, etc. we take
a snapshot of the latest XML and run it through batch production
processes - including ConTeXt for the typesetting side of things.

That might be overkill for small projects (we're currently holding 25
million words in XML), but the principle applies no matter what size of
content you have.

Duncan
___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-22 Thread Maurice Diamantini


Le 22 sept. 05 à 12:23, Duncan Hothersall a écrit :


   - xml in not an input format but (a well structured) interchange
 format.


XML is a very good master format from which to derive all outputs. For
example, I receive wordprocessor files from academics and convert them
to XML by a combination of automated processes and hand-tagging.  
The XML

is then stored and maintained as the master version of that document,
and when we need to produce a new release in HTML, PDF or eBook, a new
style of PDF, a text version optimised for screen-readers, etc. we  
take

a snapshot of the latest XML and run it through batch production
processes - including ConTeXt for the typesetting side of things.


So if I understand wml,  I agree that xml is a format for filtering,
not a human writable format.
TeX, LaTeX or conTeXt is in input langage, which should be able to
be converted to the powerfull master XML format.

So we need something to convert ConTeXt to XML more than something.
I suppose this imply easy convertion to HTML too !

So my question was, is there any exeprience about the use of
the ConTeXt module m-tex4ht?


That might be overkill for small projects (we're currently holding 25
million words in XML), but the principle applies no matter what  
size of

content you have.


-- Maurice

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-22 Thread Christopher Creutzig
Maurice Diamantini wrote:
 So if I understand wml,  I agree that xml is a format for filtering,
 not a human writable format.
 TeX, LaTeX or conTeXt is in input langage, which should be able to
 be converted to the powerfull master XML format.

 No, sorry.  This only works for extremely simple TeX code.  forget
about any real-world mathematics.  Forget about 80% of what real-world
LaTeX users type into their computers.  TeX has simply never been
written to be easily parsed.  Besides, our actual users are way too much
concerned with what their stuff looks like on their screens with their
settings to bother about structured information and the like.  Believe
me, I have almost finished the translation of our highly structured
program documentation files to some DocBook-based XML format, and I am
very happy that I had decided to make this a one-time conversion with
the automated process only trying to get some 95% or so correct.  My
experience with the new format (which is still limited, I've been
working with it the last four months or so) leads me to believe that it
is no more difficult to use than some TeX dialect.  The only slightly
awkward thing is that you have to explicitly mark all paragraphs.  I
don't mind, but if you do, that sort of thing can be scripted.

 Short summary: Define an xml format that embeds what you need at the
moment.  One mistake I made: I didn't go for short names, but used
DocBook names.  I probably should have started from XHTML, using p,
em, a etc.  Then use that format as your master and edit in this
format.  There are magnitudes more decent editors to help you with
editing all sorts of xml than you will ever find for any TeX variant.
(I know, one is sufficient, but finding one that does exactly what *you*
want is much easier with more editors to choose from.)

 So my question was, is there any exeprience about the use of
 the ConTeXt module m-tex4ht?

 I do have experience with using tex4ht in LaTeX, which is its native
setting.  It is definitely much better than all the alternatives I
tried, but it does have problems with formulas, it is rather difficult
to teach it your new local commands and the generated HTML code is
usable for exactly one thing: Rendering in a graphical browser, for us
lucky ones without visual impairments.  I would not dream of using this
pile of mess for anything else.  HTML generated by Word simply can't be
worse.


regards,
Christopher
___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


[NTG-context] ConTeXt to RTF Conversion

2005-09-21 Thread Louis F . Springer
What are the options for conversion from ConTeXt to other formats, if  
any? I'm particularly interested in rtf and/or html.


Lou

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-21 Thread Pr. Erich Fickel

andrea valle wrote:

I've asked too, but it seems that that there are none to rtf.
This lack of conversion is a problem for me.


Would it be possible for you to use any of the software that converts 
PDF to RTF? Write in Context. When you need RTF you convert the PDF to 
RTF instead of the TEX.


-Erich Fickel
-- newbie to context and tex, reading TeXbook and Context manual


--
No virus found in this outgoing message.
Checked by AVG Anti-Virus.
Version: 7.0.344 / Virus Database: 267.10.25/102 - Release Date: 9/14/2005

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


RE: [NTG-context] ConTeXt to RTF Conversion

2005-09-21 Thread Idris Samawi Hamid
Hi,

= Original Message From Pr. Erich Fickel [EMAIL PROTECTED] =
andrea valle wrote:
 I've asked too, but it seems that that there are none to rtf.
 This lack of conversion is a problem for me.

Would it be possible for you to use any of the software that converts
PDF to RTF? Write in Context. When you need RTF you convert the PDF to
RTF instead of the TEX.

Only works for the most trivial of documents, and if you have lots of accents 
and diacritics u r up the creek-)

Best
Idris


Professor Idris Samawi Hamid
Department of Philosophy
Colorado State University
Fort Collins, CO 80523

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-21 Thread Mojca Miklavec

Louis F.Springer wrote:
What are the options for conversion from ConTeXt to other formats, if  
any? I'm particularly interested in rtf and/or html.


The main problem is: PDF is extremely complex format (and Hans tries to 
explore almost every single capability of it) and you can't grant a good 
conversion for obscure documents. You can't get html out of the box, but 
there are some options depending on what your documents look like, what 
effort are you ready to invest and what quality of HTML/RTF you expect.


(I can't imagine a tool which would satisfactory convert the ConTeXt 
manuals into HTML without manual intervention.)


1. The best quality can be achieved if you prepare all your stuff in XML 
and then write both a stylesheet for conversion into HTML and a couple 
of ConTeXt definitions to handle formating for output in PDF documents. 
I never did that (I consider it too complex and time consuming), but if 
you're ready to go that way and sacrifice some time, there are some 
people on the list who can help you.


2. exTeX is going to natively support HTML output. The question is when 
a stable version is going to appear and if ConTeXt will ever support it 
/ if they will support ConTeXt. (not a satisfactory answer yet)


3. latex2html, latex2word, tth - like

No, no such tool yet and I doubt that anyone is going to write it soon.

4. PDF - HTML / PDF - RTF conversion

Currently the best possibility if you have simple documents, you don't 
want to spend too much time on it and you don't mind too much about high 
quality of the produced HTML/RTF.


I often use pdftotext and then manually reformat everything (not for my 
own documents however), but I was impressed by the quality of what ABBYY 
PDF Transformer was able to do with sample documents that I saw (it 
converts a table in PDF into a table in Word, preserves images and page 
layout, ...).


About the accents that Idris mentioned: As long as the accented 
characters aren't faked, there is a way to get them out of PDF.


Mojca
___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-21 Thread Adam Lindsay
Mojca Miklavec said this at Wed, 21 Sep 2005 22:09:32 +0200:

1. The best quality can be achieved if you prepare all your stuff in XML 
and then write both a stylesheet for conversion into HTML and a couple 
of ConTeXt definitions to handle formating for output in PDF documents. 
I never did that (I consider it too complex and time consuming), but if 
you're ready to go that way and sacrifice some time, there are some 
people on the list who can help you.

very much agreed here. I'd personally advocate an intermediate XML
format (that matches your document needs), and transforming to Word-
friendly HTML or to a ConTeXt-friendly format.

For a taste of some XML workflows, see:
http://pragma-ade.com/general/manuals/example.pdf 
http://pragma-ade.com/general/magazines/mag-0008.pdf
and talk with the list!
-- 
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
 Adam T. Lindsay, Computing Dept. [EMAIL PROTECTED]
 Lancaster University, InfoLab21+44(0)1524/510.514
 Lancaster, LA1 4WA, UK Fax:+44(0)1524/510.492
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context


Re: [NTG-context] ConTeXt to RTF Conversion

2005-09-21 Thread andrea valle

Thanks, I' ve never thought about that.
Does anyone know about a free pdf2rtf converter for macosx?

Best
-a-

On 21 Sep 2005, at 17:58, Pr. Erich Fickel wrote:


andrea valle wrote:

I've asked too, but it seems that that there are none to rtf.
This lack of conversion is a problem for me.


Would it be possible for you to use any of the software that converts 
PDF to RTF? Write in Context. When you need RTF you convert the PDF to 
RTF instead of the TEX.


-Erich Fickel
-- newbie to context and tex, reading TeXbook and Context manual


--
No virus found in this outgoing message.
Checked by AVG Anti-Virus.
Version: 7.0.344 / Virus Database: 267.10.25/102 - Release Date: 
9/14/2005


___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context



Andrea Valle
Laboratorio multimediale G. Quazza
Facoltà di Scienze della Formazione
Università degli Studi di Torino
[EMAIL PROTECTED]

___
ntg-context mailing list
ntg-context@ntg.nl
http://www.ntg.nl/mailman/listinfo/ntg-context