jinjiankang wrote: > There are many excellent open source third party(for example iText), each > publishes some E-documents freely:Tutorial.pdf, DevelopersGuide.pdf, > Spec.pdf and so on. Why not developing a tool that would translate RAPIDLY > those *.pdf in Enghish to Chinese.pdf, French.pdf, Korea.pdf ... with the > help of Google AJAX Language API? Indeed, the translating is NOT accurately. > > After contacting corresponding third party charger, I think legal issues are > not obstacle, and it can help enlarging power of influence for third party. > > I'm looking forwards to your reply.
Legal and translation problems set aside, then you still have the first obstacle: how to extract the text from the existing PDFs. I my book, I recommend PdfBox and maybe JPedal can help you get a long way to, but... due to the nature of PDF there are limitations: text in a PDF is 'drawn' on a page. The concept of paragraphs, chapters, sections, tables,... is lost. How are you going to solve that? br, Bruno ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions Do you like iText? Buy the iText book: http://www.1t3xt.com/docs/book.php Or leave a tip: https://tipit.to/itexttipjar