Re: [CODE4LIB] Unicode font for PDF generation?

2012-03-18 Thread Andrew Cunningham
There are no pan Unicode fonts. Last one I saw was for Unicode 2.0

There is a limit to the number of glyphs a font can contain.

It is possible to create a subset of unicode and place it in a single font,
but you need to be able to identify your current and future character
requirements.

But not sure why you need a single font, unless your xml to pdf conversion
can't process stylesheets.

Andrew

On Saturday, 17 March 2012, Mark Redar mark.re...@ucop.edu wrote:
 Hi All,

 We're having some fun with unicode characters in PDF generation. We have
a process that automatically generates a pdf from XML input. The tool stack
doesn't support multiple fonts for displaying different codepoints so we
need a good pan-unicode font to bundle with the pdfs.

 Currently, we use the DejaVu font family for creating the pdfs. This has
good coverage for latin  cyrillic characters but has no CJK
(chinese-japanese-korean) coverage. We've looked into licensing a
commercial fonts, but for web server use these require annual licensing
fees that are substantial (in the thousands of $).
 A number of our source documents contain CJK characters and some
contributors have noticed the lack of support for these characters.

 Does anyone know of a good pan-unicode free font that includes CJK
codepoints that looks good? Gnu unifont has the coverage, but it is not the
best looking font.

 Barring that, we're thinking of rolling our own pan-unicode font. There
are good open source fonts for portions of the unicode character sets.
We're hoping to find some way to take a number of open source fonts and
combine them into one large pan-unicode font.

 Does anyone have experience with font authoring and merging different
fonts?

 It looks as though FontForge can merge fonts, but it's not clear how to
deal with overlapping codepoints in the merged fonts.

 Thanks,

 Mark


-- 
Andrew Cunningham
Senior Project Manager, Research and Development
Vicnet
State Library of Victoria
Australia

andr...@vicnet.net.au
lang.supp...@gmail.com


Re: [CODE4LIB] Unicode font for PDF generation?

2012-03-18 Thread Andrew Cunningham
A couple of additional thoughts:

• The most complete cjk font projects require 2 fonts to handle all cjk
characters
• There are language specific glyph variations between chinese and
japanese, so ideal situation is to use diffetent fonts tailored for each
On Saturday, 17 March 2012, Mark Redar mark.re...@ucop.edu wrote:
 Hi All,

 We're having some fun with unicode characters in PDF generation. We have
a process that automatically generates a pdf from XML input. The tool stack
doesn't support multiple fonts for displaying different codepoints so we
need a good pan-unicode font to bundle with the pdfs.

 Currently, we use the DejaVu font family for creating the pdfs. This has
good coverage for latin  cyrillic characters but has no CJK
(chinese-japanese-korean) coverage. We've looked into licensing a
commercial fonts, but for web server use these require annual licensing
fees that are substantial (in the thousands of $).
 A number of our source documents contain CJK characters and some
contributors have noticed the lack of support for these characters.

 Does anyone know of a good pan-unicode free font that includes CJK
codepoints that looks good? Gnu unifont has the coverage, but it is not the
best looking font.

 Barring that, we're thinking of rolling our own pan-unicode font. There
are good open source fonts for portions of the unicode character sets.
We're hoping to find some way to take a number of open source fonts and
combine them into one large pan-unicode font.

 Does anyone have experience with font authoring and merging different
fonts?

 It looks as though FontForge can merge fonts, but it's not clear how to
deal with overlapping codepoints in the merged fonts.

 Thanks,

 Mark


-- 
Andrew Cunningham
Senior Project Manager, Research and Development
Vicnet
State Library of Victoria
Australia

andr...@vicnet.net.au
lang.supp...@gmail.com


Re: [CODE4LIB] Unicode font for PDF generation?

2012-03-18 Thread Mark A. Matienzo
On Fri, Mar 16, 2012 at 6:13 PM, Mark Redar mark.re...@ucop.edu wrote:

 Currently, we use the DejaVu font family for creating the pdfs. This has good 
 coverage for latin  cyrillic characters but has no CJK 
 (chinese-japanese-korean) coverage. We've looked into licensing a commercial 
 fonts, but for web server use these require annual licensing fees that are 
 substantial (in the thousands of $).
 A number of our source documents contain CJK characters and some contributors 
 have noticed the lack of support for these characters.

If you're looking for CJK support in a FLOSS font, I've heard the best
option in terms of CJK coverage are the WenQuanYi [0] fonts, which
have full coverage (if I recall correctly).

As Andrew suggests, you can combine a number of fonts as well.

[0] http://en.wikipedia.org/wiki/WenQuanYi

Mark


Re: [CODE4LIB] Unicode font for PDF generation?

2012-03-18 Thread Andrew Cunningham
For additional CJKV fonts look at:

http://en.wikipedia.org/wiki/List_of_CJK_fonts






-- 
Andrew Cunningham
Senior Project Manager, Research and Development
Vicnet
State Library of Victoria
Australia

andr...@vicnet.net.au
lang.supp...@gmail.com


Re: [CODE4LIB] Learning Microsoft SQL

2012-03-18 Thread David Friggens
 I generally find the w3schools stuff a pretty good starting point to help 
 wrap my head around something I don't know:

I've used w3schools a fair bit in the past too, as they rank pretty
highly in Google, but have recently been made aware of advice that
they aren't necessarily to be relied on, e.g.
http://w3fools.com/

David