https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #39 from Ziyuan Yao yaoziy...@gmail.com 2012-01-19 23:50:38 UTC
---
Are render servers updated yet? As I still see Chinese lines not take up a
page's full width (there's much space left on each Chinese line's right side).
--
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #37 from Volker Haas volker.h...@pediapress.com 2012-01-13
14:08:02 UTC ---
I updated to the latest reportlab version. The problem mixing cjk and non-cjk
text should be fixed. The render servers will be updated sometime next week.
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #38 from Ziyuan Yao yaoziy...@gmail.com 2012-01-13 14:25:05 UTC
---
Volker: Appreciate your hard work!
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because:
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #34 from Volker Haas volker.h...@pediapress.com 2012-01-12
13:04:48 UTC ---
I just found out that the latest reportlab version seems to handle non-cjk text
inside cjk text (with wordWrap='CJK') correctly. Installation of the newest
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #35 from Ziyuan Yao yaoziy...@gmail.com 2012-01-12 13:14:08 UTC
---
Great to hear that. Eager to see a sample PDF of your latest finding.
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #36 from Ziyuan Yao yaoziy...@gmail.com 2012-01-12 16:40:16 UTC
---
I confirm. I downloaded and installed the latest snapshot
reportlab-20120111203740 successfully and ran your test script. It does wrap
both CJK and Western text
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
Volker Haas volker.h...@pediapress.com changed:
What|Removed |Added
Status|NEW |RESOLVED
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #24 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 10:31:56 UTC
---
Great news. Can you give me a PDF that demostrates your smallest-size spaces?
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #25 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 10:36:18 UTC
---
I like your solution for non-cjk wikis (using tiny spaces). But you didn't
mention what to do with cjk wikis. I assume you will use wordwrap=CJK for them,
right?
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #26 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 10:40:57 UTC
---
I just tried out your tiniest space concept in LibreOffice. Perfect!
Virtually invisible spaces! You're a genius. No need to show me the PDF now.
--
Configure
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #27 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 10:46:51 UTC
---
One more question: Your tiny-space idea is a universal solution that can also
apply to cjk wikis, because a cjk wiki can also contain Western words (which
better
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #28 from Volker Haas volker.h...@pediapress.com 2012-01-11
13:03:00 UTC ---
For cjk wikis the built-in cjk word wrapping of reportlab is used. This
probably breaks non-cjk text that is embedded...But I am pretty sure that at
least
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #29 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 14:07:10 UTC
---
First, using ReportLab's cjk wordwrap algorithm will break English words into
two lines. This is well demonstrated by your own test script.
Second, also
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #30 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 14:07:48 UTC
---
s/also/although
--
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #31 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 14:22:29 UTC
---
Here are two Wikipedia links that talk about the so-called CJK wordwrap rules:
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #32 from Ziyuan Yao yaoziy...@gmail.com 2012-01-12 04:32:58 UTC
---
Found a problem with the tiny space approach: Chinese characters don't take up
the full space of a line; there is still much space left on the right side of
each
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #33 from Ziyuan Yao yaoziy...@gmail.com 2012-01-12 05:07:13 UTC
---
If we can't easily modify ReportLab to distinguish tiny space widths from
normal space widths, I'd rather see this arrangement:
For non-cjk wikis, insert a
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
Volker Haas volker.h...@pediapress.com changed:
What|Removed |Added
CC|
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #14 from Volker Haas volker.h...@pediapress.com 2012-01-10
14:46:35 UTC ---
Created attachment 9833
-- https://bugzilla.wikimedia.org/attachment.cgi?id=9833
test script for linebreak check for mixed cjk and non-cjk text
--
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #15 from Ziyuan Yao yaoziy...@gmail.com 2012-01-10 15:36:52 UTC
---
First, I don't have MediaWiki installed on my computer so I can't run your test
script.
If ReportLab doesn't support line wrapping for mixed cjk and non-cjk text
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #16 from Ziyuan Yao yaoziy...@gmail.com 2012-01-10 15:44:17 UTC
---
The line-wrapping rule for CJK/non-CJK mixed text is actually very simple: You
should either wrap the line at a whitespace (as in a Western text), or after a
CJK
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #17 from Ziyuan Yao yaoziy...@gmail.com 2012-01-10 15:51:36 UTC
---
OK, now I installed python-reportlab in my Fedora 16 and can run your test
script. I understand your problem. I'll test if I can insert U+200B after every
CJK
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #18 from Ziyuan Yao yaoziy...@gmail.com 2012-01-10 16:05:04 UTC
---
OK. I tried. U+200B doesn't work with ReportLab:
p1 =
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #19 from Ziyuan Yao yaoziy...@gmail.com 2012-01-10 16:08:13 UTC
---
I'll write to ReportLab's mailing list, suggesting them to create a new
wordWrap option mixed, so that ReportLab can directly support wrapping mixed
text.
--
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #20 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 03:34:56 UTC
---
ReportLab says working on this problem is not their priority. So I'm trying to
fix it personally in their source code.
I found (and they told me) their source
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #21 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 04:26:15 UTC
---
Cite http://en.wikipedia.org/wiki/TCPDF :
TCPDF is currently the only PHP-based library that includes complete support
for UTF-8 Unicode and right-to-left
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #22 from Ziyuan Yao yaoziy...@gmail.com 2012-01-11 06:11:47 UTC
---
OK, Volker Haas, I have come up with a simple way to fix all this:
We will first determine whether a wiki page is mostly Western (then we'll use
wordwrap=Western)
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #10 from Ziyuan Yao yaoziy...@gmail.com 2012-01-02 11:10:35 UTC
---
It seems currently MediaWiki's Collection extension uses the ReportLab PDF
library to render PDF files
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #11 from Ziyuan Yao yaoziy...@gmail.com 2012-01-02 11:19:32 UTC
---
On ReportLab's Samples page
(http://www.reportlab.com/software/documentation/rml-samples/), there is a
test_031_japanese.pdf
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #12 from Ziyuan Yao yaoziy...@gmail.com 2012-01-02 12:05:17 UTC
---
Good news, everybody! The solution to this problem has been given by
ReportLab's personnel, as follows:
On 2 January 2012 11:33, Yao Ziyuan yaoziy...@gmail.com
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #3 from Ziyuan Yao yaoziy...@gmail.com 2011-12-30 11:47:20 UTC ---
Enabling the Chinese Wikipedia to provide ebook creation properly can help
spread Wikipedia knowledge in China freely.
--
Configure bugmail:
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
Mark A. Hershberger m...@everybody.org changed:
What|Removed |Added
Priority|Unprioritized |High
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
Christoph Kepper christoph.kep...@pediapress.com changed:
What|Removed |Added
CC|
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #5 from Ziyuan Yao yaoziy...@gmail.com 2011-12-30 21:08:24 UTC ---
(In reply to comment #4)
Fixing this bug will probably be only a partial success. About 18 month ago we
(PediaPress) were experimenting a little bit with Japanese,
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #6 from Ziyuan Yao yaoziy...@gmail.com 2011-12-30 21:31:07 UTC ---
I have played around with some Chinese pages on mediawiki.org and so far the
only problem I have seen is no line wrapping. I don't see problems you
mentioned like
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #7 from Ziyuan Yao yaoziy...@gmail.com 2011-12-30 21:42:48 UTC ---
Although Chinese and Japanese don't use spaces to separate words, you can
actually think there is an invisible space before and after every
Chinese/Japanese
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #8 from Ziyuan Yao yaoziy...@gmail.com 2011-12-30 21:46:06 UTC ---
Either of the above two rule sets can solve the line wrapping problem, although
in the long run I recommend using a mature HTML-to-PDF library instead of
reinventing
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #9 from Ziyuan Yao yaoziy...@gmail.com 2011-12-31 03:46:09 UTC ---
I just did a little research on what FOSS PDF libraries are available. Here's a
good list:
http://en.wikipedia.org/wiki/List_of_PDF_software#Development_libraries
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
p858snake p858sn...@gmail.com changed:
What|Removed |Added
CC|
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
shi zhao shiz...@gmail.com changed:
What|Removed |Added
Keywords||accessibility
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
Benjamin Chen cn.chenmi...@gmail.com changed:
What|Removed |Added
CC|
https://bugzilla.wikimedia.org/show_bug.cgi?id=33430
--- Comment #2 from Ziyuan Yao yaoziy...@gmail.com 2011-12-30 07:58:28 UTC ---
Benjamin Chen: AFAIK, Only Chinese and Japanese apply. Korean uses square-like
characters but it does have spaces between words.
--
Configure bugmail:
42 matches
Mail list logo