Hi, I have a Parent doc file with many attachments(children) into it. I need to extract text content of Parent doc file but do not need text extract of its children.
I have used AutoDetectParser.parse(inputStream, BodyContentHandler, metadata, ParseContext) method to extract text for Parent file. But the text extract has text of its children too, I do not want this. Has anyone done this before? If yes could you please provide me the code snippet? Regards, Shiv
