Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX

2009-06-08 Thread J.A.J. Pater

Hello Piotr

Sorry for the late reply, but could you post it to the net somewhere or 
to the list?


Thanks,

Adriaan.


Hello list,

Inevitably, it's a recurring subject. Here are my 2p. After playing
with all sorts of convertors to TeX, Latex, HTML and scraping the
output with Perl to obtain something useful for ConTeXt I found that
what I in fact really need to preserve from a Word file are italics
and footnotes. To make the long story short. IMHO the only reasonable
way to go is via XSL stylesheet for ooffice. Fortunately you don't
have to develop a new one from scratch which would be quite a task.
There is an excellent stylesheet converting odt to mediawiki by
Bernhard Haumacher odt2mediawiki.xsl  It took me less than an hour to
adapt it for ConTeXt output. Then you only add it as an xml filter to
Open Office and from then on can convert Word to ConTeXt straight from
ooffice as if it was one of its built-in export formats.

Piotr




2009/5/28 luigi scarso luigi.sca...@gmail.com:
  

I guess I should build a new converter suite (there's also a InDesign Tags
to ConTeXt converter anywhere on my harddisk).
But I won't make GUI apps, just scripts.
  

That's sound good !
If in python, even better !
If only scripts, the best !

Can we have more details ?

--
luigi


___
If your question is of interest to others as well, please add an entry to
the Wiki!

maillist : ntg-context@ntg.nl /
http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki : http://contextgarden.net
___







  


___
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki : http://contextgarden.net
___


Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX

2009-05-28 Thread Henning Hraban Ramm

Am 2009-05-28 um 00:05 schrieb Vyatcheslav Yatskovsky:

I'm glad to report that I made a simple application (sorry, only  
forWindows at the moment) that coverts text from Ms Word (or other  
editors) or HTML pages (web sites) into TeX.


The app recognizes at the moment only following formats/tags:
Bold (b), Italic (i), Header 1 (h1), Header 2 (h2), Header 3  
(h3).



Sorry for stealing your thread, but it's related...

I just found there's still a collection of my old (2002) Perl scripts at
http://www.fiee.net/texnique/material/fiee-perl.zip
It contains simple converters from HTML, LaTeX and XPress Tags to  
ConTeXt.


While this one (2006):
http://www.fiee.net/texnique/material/mab2bib.zip
contains (besides a mab2bib bibliography converter) a simple Python  
script to convert arbitrary encodings - just rename it from  
utf8_to_latex.py to e.g. latin1_to_utf8.py: If the parts of its  
file name are encodings known to Python, it'll just work.
latex encoding is included, so latex_to_utf8.py can convert cruft  
like \c{C} to Ç.


I guess I should build a new converter suite (there's also a InDesign  
Tags to ConTeXt converter anywhere on my harddisk).

But I won't make GUI apps, just scripts.


Greetlings from Lake Constance!
Hraban
---
http://www.fiee.net/texnique/
http://wiki.contextgarden.net
https://www.cacert.org (I'm an assurer)

___
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki : http://contextgarden.net
___


Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX

2009-05-28 Thread luigi scarso


 I guess I should build a new converter suite (there's also a InDesign Tags
 to ConTeXt converter anywhere on my harddisk).
 But I won't make GUI apps, just scripts.

That's sound good !
If in python, even better !
If only scripts, the best !

Can we have more details ?

-- 
luigi
___
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki : http://contextgarden.net
___


Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX

2009-05-28 Thread Piotr Kopszak
Hello list,

Inevitably, it's a recurring subject. Here are my 2p. After playing
with all sorts of convertors to TeX, Latex, HTML and scraping the
output with Perl to obtain something useful for ConTeXt I found that
what I in fact really need to preserve from a Word file are italics
and footnotes. To make the long story short. IMHO the only reasonable
way to go is via XSL stylesheet for ooffice. Fortunately you don't
have to develop a new one from scratch which would be quite a task.
There is an excellent stylesheet converting odt to mediawiki by
Bernhard Haumacher odt2mediawiki.xsl  It took me less than an hour to
adapt it for ConTeXt output. Then you only add it as an xml filter to
Open Office and from then on can convert Word to ConTeXt straight from
ooffice as if it was one of its built-in export formats.

Piotr




2009/5/28 luigi scarso luigi.sca...@gmail.com:

 I guess I should build a new converter suite (there's also a InDesign Tags
 to ConTeXt converter anywhere on my harddisk).
 But I won't make GUI apps, just scripts.

 That's sound good !
 If in python, even better !
 If only scripts, the best !

 Can we have more details ?

 --
 luigi


 ___
 If your question is of interest to others as well, please add an entry to
 the Wiki!

 maillist : ntg-context@ntg.nl /
 http://www.ntg.nl/mailman/listinfo/ntg-context
 webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
 archive  : https://foundry.supelec.fr/projects/contextrev/
 wiki     : http://contextgarden.net
 ___





-- 
http://okle.pl
___
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki : http://contextgarden.net
___


[NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX

2009-05-27 Thread Vyatcheslav Yatskovsky

Hello,

I'm glad to report that I made a simple application (sorry, only 
forWindows at the moment) that coverts text from Ms Word (or other 
editors) or HTML pages (web sites) into TeX.


DOWNLOAD LINK (280 KB):
http://ul.to/hmpy60

The app recognizes at the moment only following formats/tags:
Bold (b), Italic (i), Header 1 (h1), Header 2 (h2), Header 3 (h3).

It coverts NOBREAK_SPACE (A0) into ~, nbsp into \enskip, quot; into , 
amp; into \, and lt; gt; into  .


It is UTF-8 ready.

USAGE: copy desired text fragment from Word or web page into clipboard, 
and click big Get... button and see the result in the bottom field. 
Click Copy Result to get TeX-formatted text back into the clipboard, 
and paste it into your editor.


KNOW ISSUES: Some crap from Word formatting like !-- -- tags happens 
to leak, but it is easier at the moment to delete it manually. And 
sorry... awful interface.


It is very-very first alpha, I want to show it just as proof-of-concept 
and to get some feedback. Actually, I did it for myself to simplyfy 
conversion from Word into TeX. I have some documents to be converted 
(e.g., lecture notes), and this happens to be easy task with my tool :).


Best,
Vyatcheslav
___
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki : http://contextgarden.net
___