Hi Albert, and Zhang Peng, On 10/11/2008, at 6:39 AM, Albert Astals Cid wrote:
A Diumenge 09 Novembre 2008, zhang peng va escriure:I have set CMap files to the poppler data directory. It seems that the font can't display! The file can't be displayed! Thanks!Poppler 0.10.0 works here.
pdftotext worked fine for me,
both with Poppler v0.8.2 and Poppler v0.10.0
However there were problems with readme.pdf
when using other software.
e.g., Adobe Reader v8.1.0 and v9.0.0
both showed just blank pages;
Adobe Acrobat Pro v8.1.2
displayed the PDF just fine
Preview (MacOS X, v10.4.11)
displayed the PDF just fine
pdftohtml translated the PDF to a 2-page HTML, with frames
*but* there were some errors:
(files attached)
readme.html has junk in the <TITLE> tag
looks like a 2-byte sequence hasn't been correctly
translated into UTF-8
readmes.html has junk in both <LI> tags
again it looks like some bytes (for page titles?)
were not properly translated to UTF8.
Title: ��
<FRAMESET cols="100,*"> <FRAME name="links" src="readme_ind.html"> <FRAME name="contents" src="readmes.html"> </FRAMESET>file:///C|/Documents and Settings/bob/æ¡é¢/readme.txt
å åµPDFçæ¹æ¡æ¯è¾ï¼
ç®åæäºè§£å°å ç±»å¯ä¾æä»¬ç´æ¥ä½¿ç¨çPDFæä»¶ï¼æåã产åï¼ä¸»è¦æä¸¤ä¸ªï¼
1ã YCanPDFï¼
YCanPDFæ¯å½å å·¥ä½å®¤å¼åçä¸ä¸ªPDFæä»¶ï¼ç½åæ¯ï¼http://www.ycanpdf.cn/ ï¼åºæ¬åè½æ¯ï¼
1. æ¯ææ¥æ¾ã缩æ¾ãæè½¬ãé¼ æ æå¨ãå页å¤é¡µåæ¢ãç®å½ãæå°çåè½ï¼
2.æ¯æå ååURLå½¢å¼çPDFæä»¶ï¼
3.æ¯æWEBè°ç¨ï¼æ éå®è£ ï¼ç´æ¥éè¿æµè§å¨é 读PDFæä»¶ï¼
4.æ¯æä¸æä»¥åå å¯(å æ¬è¯ä¹¦å å¯)çPDFï¼
5.æ§ä»¶å¯ç¬ç«è¿è¡ï¼æ éä»»ä½å ¶ä»ç¯å¢æ¯æã
ææ¢ç´¢ååç°çç¹ç¹æ¯ï¼
1ã ç¬¬ä¸æ¬¡å¦ææå¼çæ¯æä¸ªæä»¶ï¼å¯è½ä¼ææ¶æ¾ç¤ºä¸æ£å¸¸ï¼ä½ä»¥å就好äºãââç¨³å®æ§å¯è½
ç¨æä¸è¶³ã
2ã å½äººå¼åãææ¡£ã交æµå®¹æã
3ã åè½ç®æ´ç²¾æï¼åºæ¬åè½é½æäºã
4ã ä»·æ ¼å¨5000-8000ï¼å¯æä¾å°éçäºæ¬¡å¼åã
5ã 估计åçæµï¼è¿ä¸ªç³»ç»ä¹æ¯åºäºä¸äºå¼æºè½¯ä»¶ä¿®æ¹èæ¥ï¼å 为ç®åæ®äºè§£ï¼PDFçè§£æé¤äº
å¤§å ¬å¸å¤ï¼é½æ¯åºäºä¸äºåºä¿®æ¹èæ¥ã
2ã FoxitPDFï¼
FoxitPDFæ¯å½å¤ç䏿¬¾PDFæä»¶ï¼æ®è¯´å¾®è½¯çä¹é½å¨ç¨å®ãåºæ¬ç¹ç¹åæå¦ä¸ï¼
1ã ç¹å«ç¨³å®ï¼æ®è¯´æ¯å¯ä»¥å®å ¨æ¿ä»£Adobe PDFç䏿¬¾è½¯ä»¶
2ã é«ç«¯äº§åï¼è´¨éå¯é ï¼æä»¥å®¢æ·ä¼å¤ã
3ã åè½é常é½å ¨ï¼çè³å¯ä»¥æ¯æææºãåµå ¥å¼çå¹³å°ã
4ã 缺ç¹ä¹æ¯è¾ææ¾ï¼ä»·æ ¼è¾é«ï¼å ·ä½è¿æ²¡è°å®ï¼é®ä»¶åéåååºç¼æ ¢ï¼ä¼¼ä¹å½å ç代çè¿éè¦
询é®å½å¤æ»å ¬å¸ï¼é®ç®±æ¯æ»å ¬å¸çé®ä»¶ãä¼°è®¡ä»·æ ¼è¦1000ç¾å 以ä¸ã
5ã æå¡æ¯æå¯è½ååºç¼æ ¢ï¼ç¼ºå°ä¸æææ¡£ã
è¿æä¸ç§æ¹æ¡æ¯ï¼åºäºxPDFæè sumatrapdfè¿è¡èªå·±çå¼åï¼åæå¦ä¸ï¼
1ã sumatrapdfä»ä»¬æä¸å®çç¨³å®æ§ï¼ä½ä¼¼ä¹ç¨³å®æ§ä¸å¦YCanPDFï¼å 为YCanPDF对æµè¯çPDFæ
ä»¶åªæ¯ç¬¬ä¸æ¬¡æ¾ç¤ºä¸æ£å¸¸ï¼ä»¥åå°±æ£å¸¸äºãä½sumatrapdfä¸ç´æ æ³æ£å¸¸æ¾ç¤ºç¬¬17页ãæç对æäºæ
ä»¶ï¼ç¹å«æ¯ä¸ææä»¶ä¸å¾å ¼å®¹ã
2ã å¼å卿å¯è½è¾é¿ï¼çº¦é1ï¼2个æçæ¶é´ï¼ç¶åè¿éæµè¯å稳å®ãæä¸å®çå¼åé£é©ã
3ã ä¼ç¹æ¯ï¼ææºç ï¼å¯æ§å¶ã以åä¸ååéå¶ã
è¡¥å ï¼
YCanPDF
å¦ææ¯æ¬å°ä½¿ç¨æ§ä»¶ï¼éè¦çå¤é¨èµæºå°±ä¸ç¨ä»ç½ç»ä¸è½½äºï¼å¯ä»¥ç´æ¥åæ§ä»¶ä¸èµ·å®è£ ï¼å°±ä¸ä¼åºç°
ä¹±ç é®é¢äºã
Foxit
file:///C|/Documents and Settings/bob/æ¡é¢/readme.txtï¼ç¬¬ 1ï¼2 页ï¼2008-10-24 13:43:07
file:///C|/Documents and Settings/bob/æ¡é¢/readme.txt
缺ç¹ï¼æ æ³è§£æè¯ä¹¦å å¯çPDFæä»¶ã
ä¼ç¹ï¼æ¾ç¤ºæçæ¯è¿ä¸ä¸ªäº§å䏿好çï¼å ¬å¸ææ¯å®åå¾å¼ºã
Sumatrapdf
è¿æ¯ä¸ªçº¯ç²¹ç西æ¹è¯ç³»äº§åï¼æ²¡æCJKï¼ä¸æ¥é©è¯è¨ï¼å¤çæºå¶ï¼ä»ææ¯ä¸æ¥è¯´å°±æ¯å 鍿²¡ææ¯æCJKï¼
å¯¹äºæ²¡æå åµç䏿PDFï¼æ¾ç¤ºæ¯ä¹±ç ãï¼ä½æ¯å¯ä»¥æ¾ç¤ºå åµç䏿åä½ï¼è¦å¢å å®å ¨çä¸ææ¯æï¼é
è¦ææ¡PDFå é¨å¯¹type1ãtruetypeçå¤ç§åä½çå¤çæºå¶ï¼è¿è¦ææ¡ç´æ¥ä»å使件ç»å¶åä½çæ¹æ³ï¼
å 为PDFæ¾ç¤ºåä½ä¸æ¯ç¨windows apiï¼èæ¯ç´æ¥ä»åä½æä»¶è§£æï¼é¾åº¦æ¯è¾å¤§ã
file:///C|/Documents and Settings/bob/æ¡é¢/readme.txtï¼ç¬¬ 2ï¼2 页ï¼2008-10-24 13:43:07
Document Outline
- þÿg,W0xÁvØ
The bulk of the content in readmes.html looks fine. So it seems that the titles are not being subjected to the same translation routines as is the body of the document. Is this related to the CMap resources? Quite possibly.
Or at least seems to work as i don't know if the printed chinese charactersmake sense or not.
I've no reason to believe that there's any problem with these. It's just that there are some bytes that should have been translated to chinese characters, but were not.
Albert
I hope this report encourages someone to take a closer look
at pdftohtml .
Cheers,
Ross
------------------------------------------------------------------------
Ross Moore [EMAIL PROTECTED]
Mathematics Department office: E7A-419
Macquarie University tel: +61 (0)2 9850 8955
Sydney, Australia 2109 fax: +61 (0)2 9850 8114
------------------------------------------------------------------------
_______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
