Yes I agree. The pre-built postgresql 7.4 supports UNICODE. My question is
how to interpret the part of the Dspace installation doc quoted below:

http://dspace.org/technology/system-docs/install.html

"Unicode (specifically UTF-8) support must be enabled. This is enabled by
default in 8.0+. For 7.x, be sure to compile with the following options to
the 'configure' script:

--enable-multibyte --enable-unicode --with-java"

I compared postgresql doc 7.4 and 8.2 at:

http://www.postgresql.org/docs/7.4/interactive/multibyte.html

and

http://www.postgresql.org/docs/8.2/interactive/multibyte.html

The difference is that in 7.4 "If no -E or --encoding option is given,
SQL_ASCII is used." But in 8.2 "If no -E or --encoding option is given,
initdb attempts to determine the appropriate encoding to use based on the
specified or default locale."

My interpretation is that the pre-built 7.4 RPM does not automatically
create all database with UNICODE by default, that may be the reason why the
dspace installation doc asks us to re-compile from the source. But then the
installation doc also override this by specifying -E UNICODE when create the
dspace database so this should not be a problem. Perhaps this is just an
insurance policy in case someone writes a dpsace extension, creates another
database but forgets to add the encoding. If that's the case, initdb with -E
UNICODE is all I need because "initdb defines the default character set for
a PostgreSQL cluster".

Even for version 8.x, what if my default locale is US English but I still
want the UNICODE database? I guess in any case initdb or createdb with -E
UNICODE is the way to go regardless of the postgresql version, and
re-compiling from source does not seem necessary.

If that's the case, I think the dspace installation doc can use a bit
clarification on this. Apology if I'm stating the obvious.

Zhiwu Xie






-----Original Message-----
From: Jim Downing [mailto:[EMAIL PROTECTED] 
Sent: Monday, February 12, 2007 5:27 AM
To: Zhiwu Xie; [email protected]
Subject: Re: [Dspace-tech] install postgresql 7.4 without compiling?

Hi Zhiwu Xie,


Zhiwu Xie wrote:
> A newbie question:
>
> I'm installing Dspace on a minimum installation of CentOS 4.4. The
> installation doc says:
>
> "Unicode (specifically UTF-8) support must be enabled.  This is
> enabled by default in 8.0+.  For 7.x, be sure to compile with the
> following options to the 'configure' script:
> --enable-multibyte --enable-unicode --with-java "
>
> Since I can easily yum the postgresql 7.4 I wanted to avoid compiling
> from source. My later admin work can be a lot easier if I yum install
> postgresql. If the --enable-unicode is just to enable using Unicode in
> the database, can I just add -E UNICODE when I do initdb? The
> --with-java option is to build the jdbc, which I can also easily yum.
>   

I'd be surprised if the packaged postgresql didn't already have unicode 
support, in which case you'll be able to specify the encoding when you 
createdb.

Best regards,

jim


-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to