Hi Richard,

Welcome to the list.

Richard Kelly <[email protected]> wrote on 03/24/2009 06:24:55 AM:

> Hi there,
>
>    I'm a Masters student in Australia and I'm interested
> in participating with Xerces as part of the Google Summer of
> Code.  I'm interested in working on the unicode normalization
> proposal that you've listed on the ideas page.
>
> I have some exposure to unicode normalization (i've written programs
> that needed to compose/decompose korean unicode characters:
> hangul <-> jamo) and I found it quite interesting and would like to learn
> more.  So this would be ideal for me.  I deal with Java and XML parsers
> regularly in my studies and looking at the codebase I think I have a fair
> understanding of what would be required.

Cool. I knew there was someone out there who knew these details. :-)

> After looking around I found the ICU4J implementation of unicode
> normalization (http://icu-project.
> org/apiref/icu4j/com/ibm/icu/text/Normalizer.html).
> Would this be a suitable starting place to base my proposal on?

I'm familiar with ICU4J. Was hoping if it's part of the solution that there
might be some way to build a smaller jar which only contains the
normalization support. The full jar is much larger than Xerces and I
suspect most of the rest of it (e.g. the calendar services) we wouldn't
use. Also curious if the Normalizer works on JDK 1.3. The docs [1] suggest
that it might but also mentions that certain parts of ICU4J require JDK
1.4. We only recently voted to move to JDK 1.3 as the lowest level of JDK
Xerces supports so hoping that ICU would work with that level.

Have you had much thought about the overall design? I was thinking that the
core support would live in an XNI component which could be shared between
the parser pipeline and the DOM normalizer.

> Thanks for your time,
> Richard Kelly
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]

Look forward to hearing from you.

Thanks.

[1] http://icu-project.org/icu4j_faq.html#Common_2

Michael Glavassevich
XML Parser Development
IBM Toronto Lab
E-mail: [email protected]
E-mail: [email protected]

Reply via email to