4010 is TextSpecInfoAtom. It stores special format runs that can't be
described by normal style records. For example,
if a part of a string is a hyperlink this info is stored in TextSpecInfoAtom.
AFAIK it has nothing to do with the language information.

Yegor

CG> Hi all,

CG> I am searching for a way to extract language information from .ppt
CG> documents that corresponds to specific bits of text. Googling up and
CG> searching both through scratchpad hslf code and file format specs from
CG> wotsit gave me no useful information. The thing i learned however is
CG> that it is altogether different from .doc documents, where i have
CG> successfully carried out this task in the past. Any pointers at all
CG> would be appreciated.
CG> In an attempt to be more specific, from my understanding, I need to grab
CG> record with type 4010, which the code at Sheet.java in method
CG> findTextRuns() just skips as "Safe to ignore". Is this stored and parsed
CG> somewhere that i missed, or must i do all the parsing myself? In such a
CG> case, could you please provide me with some pointers as to where to
CG> begin?

CG> Thanks in advance for any help.

CG> Chris Gioran


CG> ---------------------------------------------------------------------
CG> To unsubscribe, e-mail: [EMAIL PROTECTED]
CG> Mailing List:     http://jakarta.apache.org/site/mail2.html#poi
CG> The Apache Jakarta Poi Project:  http://jakarta.apache.org/poi/


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
Mailing List:     http://jakarta.apache.org/site/mail2.html#poi
The Apache Jakarta Poi Project:  http://jakarta.apache.org/poi/

Reply via email to