Yes I tried using stripFields(). It strips some part of the unwanted text
(with the EMBED tag), but some part still remains.

I suspect the problem might be with the encoding format of the "embedded
object strings" (the ones starting with EMBED tag and ending with embedded
doc's progID).

The stripFields() does not strip all of the encoded text.


Regards
Som Ranjan


On Tue, Apr 28, 2009 at 2:44 PM, Nick Burch <[email protected]> wrote:

> On Tue, 28 Apr 2009, Som Satpathy wrote:
>
>> Is there any way by which we can *exclude* embedded ole information which
>> we
>> get on calling *wordExtractor.getText() ?
>>
>
> Did you try stripFields?
>
> http://poi.apache.org/apidocs/org/apache/poi/hwpf/extractor/WordExtractor.html#stripFields(java.lang.String)<http://poi.apache.org/apidocs/org/apache/poi/hwpf/extractor/WordExtractor.html#stripFields%28java.lang.String%29>
>
> Nick
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to