4010 is TextSpecInfoAtom. It stores special format runs that can't be described by normal style records. For example, if a part of a string is a hyperlink this info is stored in TextSpecInfoAtom. AFAIK it has nothing to do with the language information.
Yegor CG> Hi all, CG> I am searching for a way to extract language information from .ppt CG> documents that corresponds to specific bits of text. Googling up and CG> searching both through scratchpad hslf code and file format specs from CG> wotsit gave me no useful information. The thing i learned however is CG> that it is altogether different from .doc documents, where i have CG> successfully carried out this task in the past. Any pointers at all CG> would be appreciated. CG> In an attempt to be more specific, from my understanding, I need to grab CG> record with type 4010, which the code at Sheet.java in method CG> findTextRuns() just skips as "Safe to ignore". Is this stored and parsed CG> somewhere that i missed, or must i do all the parsing myself? In such a CG> case, could you please provide me with some pointers as to where to CG> begin? CG> Thanks in advance for any help. CG> Chris Gioran CG> --------------------------------------------------------------------- CG> To unsubscribe, e-mail: [EMAIL PROTECTED] CG> Mailing List: http://jakarta.apache.org/site/mail2.html#poi CG> The Apache Jakarta Poi Project: http://jakarta.apache.org/poi/ --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] Mailing List: http://jakarta.apache.org/site/mail2.html#poi The Apache Jakarta Poi Project: http://jakarta.apache.org/poi/
