On 2/9/07, Yegor Kozlov <[EMAIL PROTECTED]> wrote:

4010 is TextSpecInfoAtom. It stores special format runs that can't be
described by normal style records. For example,
if a part of a string is a hyperlink this info is stored in TextSpecInfoAtom.
AFAIK it has nothing to do with the language information.

Yegor

Quote from ppt spec downloaded from wotsit.org describing the
TextSpecInfoAtom (4010)

"The special info runs contained in this text. "Special infos" are
character properties that don't follow styles, such as background
spelling info or language ID. Special parsing code is needed to parse
content of this atom."

End quote.

This led me to believe that this atom holds the information I seek. It
is also the only place that language information in mentioned in the
spec. Even if I am wrong, is there any knowledge as to where language
ID for the text runs is held? Right now I am experimenting with
StyleTextPropAtom. I will get back with more info and hopefully more
precise questions.

Thanks for your time,
Chris Gioran

CG> Hi all,

CG> I am searching for a way to extract language information from .ppt
CG> documents that corresponds to specific bits of text. Googling up and
CG> searching both through scratchpad hslf code and file format specs from
CG> wotsit gave me no useful information. The thing i learned however is
CG> that it is altogether different from .doc documents, where i have
CG> successfully carried out this task in the past. Any pointers at all
CG> would be appreciated.
CG> In an attempt to be more specific, from my understanding, I need to grab
CG> record with type 4010, which the code at Sheet.java in method
CG> findTextRuns() just skips as "Safe to ignore". Is this stored and parsed
CG> somewhere that i missed, or must i do all the parsing myself? In such a
CG> case, could you please provide me with some pointers as to where to
CG> begin?

CG> Thanks in advance for any help.

CG> Chris Gioran

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
Mailing List:     http://jakarta.apache.org/site/mail2.html#poi
The Apache Jakarta Poi Project:  http://jakarta.apache.org/poi/

Reply via email to