On 2/9/07, Yegor Kozlov <[EMAIL PROTECTED]> wrote:
4010 is TextSpecInfoAtom. It stores special format runs that can't be described by normal style records. For example, if a part of a string is a hyperlink this info is stored in TextSpecInfoAtom. AFAIK it has nothing to do with the language information. Yegor
Quote from ppt spec downloaded from wotsit.org describing the TextSpecInfoAtom (4010) "The special info runs contained in this text. "Special infos" are character properties that don't follow styles, such as background spelling info or language ID. Special parsing code is needed to parse content of this atom." End quote. This led me to believe that this atom holds the information I seek. It is also the only place that language information in mentioned in the spec. Even if I am wrong, is there any knowledge as to where language ID for the text runs is held? Right now I am experimenting with StyleTextPropAtom. I will get back with more info and hopefully more precise questions. Thanks for your time, Chris Gioran
CG> Hi all, CG> I am searching for a way to extract language information from .ppt CG> documents that corresponds to specific bits of text. Googling up and CG> searching both through scratchpad hslf code and file format specs from CG> wotsit gave me no useful information. The thing i learned however is CG> that it is altogether different from .doc documents, where i have CG> successfully carried out this task in the past. Any pointers at all CG> would be appreciated. CG> In an attempt to be more specific, from my understanding, I need to grab CG> record with type 4010, which the code at Sheet.java in method CG> findTextRuns() just skips as "Safe to ignore". Is this stored and parsed CG> somewhere that i missed, or must i do all the parsing myself? In such a CG> case, could you please provide me with some pointers as to where to CG> begin? CG> Thanks in advance for any help. CG> Chris Gioran
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] Mailing List: http://jakarta.apache.org/site/mail2.html#poi The Apache Jakarta Poi Project: http://jakarta.apache.org/poi/
