Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX
Hello Piotr Sorry for the late reply, but could you post it to the net somewhere or to the list? Thanks, Adriaan. Hello list, Inevitably, it's a recurring subject. Here are my 2p. After playing with all sorts of convertors to TeX, Latex, HTML and scraping the output with Perl to obtain something useful for ConTeXt I found that what I in fact really need to preserve from a Word file are italics and footnotes. To make the long story short. IMHO the only reasonable way to go is via XSL stylesheet for ooffice. Fortunately you don't have to develop a new one from scratch which would be quite a task. There is an excellent stylesheet converting odt to mediawiki by Bernhard Haumacher odt2mediawiki.xsl It took me less than an hour to adapt it for ConTeXt output. Then you only add it as an xml filter to Open Office and from then on can convert Word to ConTeXt straight from ooffice as if it was one of its built-in export formats. Piotr 2009/5/28 luigi scarso : I guess I should build a new converter suite (there's also a InDesign Tags to ConTeXt converter anywhere on my harddisk). But I won't make GUI apps, just scripts. That's sound good ! If in python, even better ! If only scripts, the best ! Can we have more details ? -- luigi ___ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___ ___ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___
Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX
Hello list, Inevitably, it's a recurring subject. Here are my 2p. After playing with all sorts of convertors to TeX, Latex, HTML and scraping the output with Perl to obtain something useful for ConTeXt I found that what I in fact really need to preserve from a Word file are italics and footnotes. To make the long story short. IMHO the only reasonable way to go is via XSL stylesheet for ooffice. Fortunately you don't have to develop a new one from scratch which would be quite a task. There is an excellent stylesheet converting odt to mediawiki by Bernhard Haumacher odt2mediawiki.xsl It took me less than an hour to adapt it for ConTeXt output. Then you only add it as an xml filter to Open Office and from then on can convert Word to ConTeXt straight from ooffice as if it was one of its built-in export formats. Piotr 2009/5/28 luigi scarso : >> >> I guess I should build a new converter suite (there's also a InDesign Tags >> to ConTeXt converter anywhere on my harddisk). >> But I won't make GUI apps, just scripts. > > That's sound good ! > If in python, even better ! > If only scripts, the best ! > > Can we have more details ? > > -- > luigi > > > ___ > If your question is of interest to others as well, please add an entry to > the Wiki! > > maillist : ntg-context@ntg.nl / > http://www.ntg.nl/mailman/listinfo/ntg-context > webpage : http://www.pragma-ade.nl / http://tex.aanhet.net > archive : https://foundry.supelec.fr/projects/contextrev/ > wiki : http://contextgarden.net > ___ > > -- http://okle.pl ___ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___
Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX
> > > I guess I should build a new converter suite (there's also a InDesign Tags > to ConTeXt converter anywhere on my harddisk). > But I won't make GUI apps, just scripts. > That's sound good ! If in python, even better ! If only scripts, the best ! Can we have more details ? -- luigi ___ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___
Re: [NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX
Am 2009-05-28 um 00:05 schrieb Vyatcheslav Yatskovsky: I'm glad to report that I made a simple application (sorry, only forWindows at the moment) that coverts text from Ms Word (or other editors) or HTML pages (web sites) into TeX. The app recognizes at the moment only following formats/tags: Bold (), Italic (), Header 1 (), Header 2 (), Header 3 (). Sorry for stealing your thread, but it's related... I just found there's still a collection of my old (2002) Perl scripts at http://www.fiee.net/texnique/material/fiee-perl.zip It contains simple converters from HTML, LaTeX and XPress Tags to ConTeXt. While this one (2006): http://www.fiee.net/texnique/material/mab2bib.zip contains (besides a mab2bib bibliography converter) a simple Python script to convert arbitrary encodings - just rename it from "utf8_to_latex.py" to e.g. "latin1_to_utf8.py": If the parts of its file name are encodings known to Python, it'll just work. "latex" encoding is included, so "latex_to_utf8.py" can convert cruft like \c{C} to Ç. I guess I should build a new converter suite (there's also a InDesign Tags to ConTeXt converter anywhere on my harddisk). But I won't make GUI apps, just scripts. Greetlings from Lake Constance! Hraban --- http://www.fiee.net/texnique/ http://wiki.contextgarden.net https://www.cacert.org (I'm an assurer) ___ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___
[NTG-context] TexPaste alpha - my Win application converting Word/HTML to TeX
Hello, I'm glad to report that I made a simple application (sorry, only forWindows at the moment) that coverts text from Ms Word (or other editors) or HTML pages (web sites) into TeX. DOWNLOAD LINK (280 KB): http://ul.to/hmpy60 The app recognizes at the moment only following formats/tags: Bold (), Italic (), Header 1 (), Header 2 (), Header 3 (). It coverts NOBREAK_SPACE (A0) into ~, into \enskip, " into ", & into \&, and < > into < >. It is UTF-8 ready. USAGE: copy desired text fragment from Word or web page into clipboard, and click big "Get..." button and see the result in the bottom field. Click "Copy Result" to get TeX-formatted text back into the clipboard, and paste it into your editor. KNOW ISSUES: Some crap from Word formatting like tags happens to leak, but it is easier at the moment to delete it manually. And sorry... awful interface. It is very-very first alpha, I want to show it just as proof-of-concept and to get some feedback. Actually, I did it for myself to simplyfy conversion from Word into TeX. I have some documents to be converted (e.g., lecture notes), and this happens to be easy task with my tool :). Best, Vyatcheslav ___ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___