I am sorry to say that I cannot see the attachment as I view the list through
Nabble so I need to ask a couple of questions please.

Firstly, are you absolutely certain that you have a binary Word document
file? Often other types of file are saved with a .doc extension and Word is
able to open them easilly and silently - html and rtf are the two obvious
candidates here. Also, if you have one of the nwere xml based files - with
an extension something like .docx - then you should use XWPF to process this
file rather than try to create a POIFSFileSystem object attached to it.

Secondly, have you tried to open the file using Word, resave it in the
binary format - as a .doc - and then try opening it using POI? It may be
that the application that created the file left some artifact that POI
cannot recognise and a simple resave will clear up this problem.

Finally, remember please that HWPF is immature and in need of further
development - the original developer had to discontinue work on it and
cannot now be approached for help and advice as he signed a non-disclosure
agreement with Microsoft. As a result HWPF has languished a little and, if
you are intending to use the library to perform extensive operations on Word
files, it may well be worthwhile considering joining the team of developers
working on POI to help with ongoing work.

Yours

Mark B


Bugzilla from [email protected] wrote:
> 
> https://issues.apache.org/bugzilla/show_bug.cgi?id=48854
> 
>            Summary: Word Document: Invalid Header Signature
>            Product: POI
>            Version: 3.5-FINAL
>           Platform: All
>         OS/Version: All
>             Status: NEW
>           Severity: critical
>           Priority: P2
>          Component: POIFS
>         AssignedTo: [email protected]
>         ReportedBy: [email protected]
> 
> 
> Hi there,
> 
> I am trying to extract the content of a word document by using
> POIFSFileSystem
> to open the document. 
> 
> POIFSFileSystem throws the following exception when trying to load the
> document:
> 
> Invalid header signature; read 0x615C316674725C7B, expected
> 0xE11AB1A1E011CFD0
> 
> I have attached the document which has the problem.
> 
> Please help resolve the issue.
> 
> Thanks and Regards,
> Gitu
> 
> -- 
> Configure bugmail:
> https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
> ------- You are receiving this mail because: -------
> You are the assignee for the bug.
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
> 
> 
> 

-- 
View this message in context: 
http://old.nabble.com/DO-NOT-REPLY--Bug-48854--New%3A-Word-Document%3A-Invalid-Header-Signature-tp27779216p27793265.html
Sent from the POI - Dev mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to