Bug#842117: file: detect MacWrite II documents

2016-10-30 Thread Christoph Biedl
# upstream commit FILE5_29-3-gd49f629
tags 842117 pending
thanks

Paul Wise wrote...

> On Fri, 2016-10-28 at 22:43 +0200, Christoph Biedl wrote:
> 
> > Some research reveals this is actually Word for Mac versions 4 or 5.
> 
> I see, sorry for the mis-diagnosis, I was mislead by this site:
> 
> http://mark0.net/onlinetrid.aspx

No worries, even https://www.w3.org/History/1989/proposal.html was
a bit uncertain about it.

In the meantime, I gathered enough information about the old Word for
Macintosh files, fix is already upstream.

If there are some *actual* MacWrite II documents around, I'll be
happy to include support for these as well.

Christoph


signature.asc
Description: Digital signature


Bug#842117: file: detect MacWrite II documents

2016-10-28 Thread Paul Wise
On Fri, 2016-10-28 at 22:43 +0200, Christoph Biedl wrote:

> Some research reveals this is actually Word for Mac versions 4 or 5.

I see, sorry for the mis-diagnosis, I was mislead by this site:

http://mark0.net/onlinetrid.aspx

> The libreoffice suite can open this file, although the rendering looks
> a bit weird. I however failed to derive the file format from the
> code in sw/source/filter/ww8/ww8scan.cxx.

The WW8Fib::GetFIBVersion function looks interesting and matches the
first two bytes of the file. 

https://sources.debian.net/src/libreoffice/1:5.2.3~rc1-4/sw/source/filter/ww8/ww8scan.cxx/#L5414

Interestingly, the file referenced by that function *is* detected by
`file` as a 'Microsoft Office Document':

http://cmsdoc.cern.ch/documents/docformat/CMS_CERN_LetterHead.word

The WW8Fib::WW8Fib function looks to be the place where the header is
read out of the file:

https://sources.debian.net/src/libreoffice/1:5.2.3~rc1-4/sw/source/filter/ww8/ww8scan.cxx/#L5459

-- 
bye,
pabs

https://wiki.debian.org/PaulWise


signature.asc
Description: This is a digitally signed message part


Bug#842117: file: detect MacWrite II documents

2016-10-28 Thread Christoph Biedl
Christoph Biedl wrote...

> Um, I can certainly prepare something for upstream to include, however
> the sample you provided does not match the description:
> 
>   fe 37 00 1c 00 00 00 00  00 00 28 00 00 19 00 00  |.7(.|
> 0010  00 00 00 00 00 00 01 00  00 00 75 35 00 01 0b 3f  |..u5...?|
> (...)

Some research reveals this is actually Word for Mac versions 4 or 5.
The libreoffice suite can open this file, although the rendering looks
a bit weird. I however failed to derive the file format from the
code in sw/source/filter/ww8/ww8scan.cxx.

Christoph



signature.asc
Description: Digital signature


Bug#842117: file: detect MacWrite II documents

2016-10-28 Thread Christoph Biedl
tags 842117 confirmed upstream moreinfo
thanks

Paul Wise wrote...

> Please detect MacWrite II documents:
> 
> Name: MacWrite II document
> Sample: https://www.w3.org/History/1989/proposal
> Info: http://fileformats.archiveteam.org/wiki/MacWrite

Um, I can certainly prepare something for upstream to include, however
the sample you provided does not match the description:

  fe 37 00 1c 00 00 00 00  00 00 28 00 00 19 00 00  |.7(.|
0010  00 00 00 00 00 00 01 00  00 00 75 35 00 01 0b 3f  |..u5...?|
(...)

does not match the documents that state the first two octets should be
00 03. Can you shed some light on this?

Christoph


signature.asc
Description: Digital signature


Bug#842117: file: detect MacWrite II documents

2016-10-25 Thread Paul Wise
Package: file
Severity: wishlist

Please detect MacWrite II documents:

Name: MacWrite II document
Sample: https://www.w3.org/History/1989/proposal
Info: http://fileformats.archiveteam.org/wiki/MacWrite

-- 
bye,
pabs

https://wiki.debian.org/PaulWise


signature.asc
Description: This is a digitally signed message part