Re: extracting text from docx files

2011-08-11 Thread Polytropon
On Tue, 9 Aug 2011 21:16:11 +0200, Christian Barthel wrote: On Tue, Aug 09, 2011 at 02:36:32PM +0100, Anton Shterenlikht wrote: I often receive information in *.docx format from my MS using colleagues. Sometimes I can ask for a pdf (or similar) instead, but not always. You have a lot of

Re: extracting text from docx files

2011-08-11 Thread Anton Shterenlikht
On Thu, Aug 11, 2011 at 12:14:51PM +0200, Polytropon wrote: On Tue, 9 Aug 2011 21:16:11 +0200, Christian Barthel wrote: On Tue, Aug 09, 2011 at 02:36:32PM +0100, Anton Shterenlikht wrote: I often receive information in *.docx format from my MS using colleagues. Sometimes I can ask for

Re: extracting text from docx files

2011-08-11 Thread Ruben de Groot
There are several docx converters online (google docx2pdf). Haven't tried them though. LibreOffice handles docx quite well. On Thu, Aug 11, 2011 at 12:22:22PM +0100, Anton Shterenlikht typed: On Thu, Aug 11, 2011 at 12:14:51PM +0200, Polytropon wrote: On Tue, 9 Aug 2011 21:16:11 +0200,

Re: extracting text from docx files

2011-08-09 Thread Rod Person
On Tue, 9 Aug 2011 14:36:32 +0100 Anton Shterenlikht me...@bristol.ac.uk wrote: Usually I unzip a docx and then search through all *xml files to find the useful data. However, I can't find any xml styles to use, so I have to convert the relevant xml file(s) to plain text by hand. I wonder

Re: extracting text from docx files

2011-08-09 Thread Anton Shterenlikht
On Tue, Aug 09, 2011 at 09:40:26AM -0400, Rod Person wrote: On Tue, 9 Aug 2011 14:36:32 +0100 Anton Shterenlikht me...@bristol.ac.uk wrote: Usually I unzip a docx and then search through all *xml files to find the useful data. However, I can't find any xml styles to use, so I have to

Re: extracting text from docx files

2011-08-09 Thread Kurt Buff
On Tue, Aug 9, 2011 at 06:36, Anton Shterenlikht me...@bristol.ac.uk wrote: I often receive information in *.docx format from my MS using colleagues. Sometimes I can ask for a pdf (or similar) instead, but not always. Usually I unzip a docx and then search through all *xml  files to find the

Re: extracting text from docx files

2011-08-09 Thread Matthias Apitz
El día Tuesday, August 09, 2011 a las 10:25:30AM -0700, Kurt Buff escribió: My installation of OpenOffice 3.3 on my Win7 machine will open a Winword 2010 .docx file. I'm guessing it will do the same on FreeBSD, but I don't have an install with a GUI running at the moment. It does, using

Re: extracting text from docx files

2011-08-09 Thread Christian Barthel
On Tue, Aug 09, 2011 at 02:36:32PM +0100, Anton Shterenlikht wrote: I often receive information in *.docx format from my MS using colleagues. Sometimes I can ask for a pdf (or similar) instead, but not always. You have a lot of nice options: - Force them to use BSD/Linux ;) - explain them,

Re: extracting text from docx files

2011-08-09 Thread Antonio Olivares
But if you really, really need to read docx, you can try the web application from Microsoft. A few months ago, I got also a lot of docx and I opend it with the microsoft web app; this worked for me to extract the information... More information: http://office.microsoft.com/en-us/web-apps/

Re: extracting text from docx files

2011-08-09 Thread Christian Barthel
On Tue, Aug 09, 2011 at 02:57:51PM -0500, Antonio Olivares wrote: But if you really, really need to read docx, you can try the web application from Microsoft. A few months ago, I got also a lot of docx and I opend it with the microsoft web app; this worked for me to extract the

Re: extracting text from docx files

2011-08-09 Thread Alejandro Imass
On Tue, Aug 9, 2011 at 3:57 PM, Antonio Olivares olivares14...@gmail.com wrote: But if you really, really need to read docx, you can try the web application from Microsoft. A few months ago, I got also a lot of docx and I opend it with the microsoft web app; this worked for me to extract the

Re: extracting text from docx files

2011-08-09 Thread Anton Shterenlikht
On Tue, Aug 09, 2011 at 02:57:51PM -0500, Antonio Olivares wrote: But if you really, really need to read docx, you can try the web application from Microsoft. A few months ago, I got also a lot of docx and I opend it with the microsoft web app; this worked for me to extract the

Re: extracting text from docx files

2011-08-09 Thread Antonio Olivares
Well.. I don't really want to install anything just to read docx. So probably something as small as possible. libreoffice (even if it's in ports, which I dearly love) looks like a monster of a package, so I'm not sure. Thanks anyway -- abiword is a word processor that opens docx files,

Re: extracting text from docx files

2011-08-09 Thread Warren Block
On Tue, 9 Aug 2011, Anton Shterenlikht wrote: Well.. I don't really want to install anything just to read docx. So probably something as small as possible. libreoffice (even if it's in ports, which I dearly love) looks like a monster of a package, so I'm not sure. Although still relatively

Re: extracting text from docx files

2011-08-09 Thread Chris Hill
On Tue, 9 Aug 2011, Anton Shterenlikht wrote: On Tue, Aug 09, 2011 at 02:57:51PM -0500, Antonio Olivares wrote: But if you really, really need to read docx, you can try the web application from Microsoft. A few months ago, I got also a lot of docx and I opend it with the microsoft web app;