https://issues.apache.org/bugzilla/show_bug.cgi?id=47875

           Summary: reading word written in Chinese, paragraph nums is not
                    correct.
           Product: POI
           Version: 3.2-FINAL
          Platform: PC
        OS/Version: Windows XP
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HWPF
        AssignedTo: [email protected]
        ReportedBy: [email protected]


FileInputStream fileIn = new FileInputStream("D:\\111.doc"); 

WordExtractor extractor = new WordExtractor(fileIn); 

String[] paras =extractor.getParagraphText(); 
System.out.println(paras.length); 


why the paragraph nums is not correct? Reading in English looks like no
problem. But my word is written in Chinese.

thanks!

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to