Hi Tupac, > Thanks Richard > > I apreciate your comment and i agree, the first case is imposible to be > taken as a solution, our site is www.saber.ula.ve it is really huge and > really old!! the download and visit stats are extremely large. and really > important sites like http://www.comunidadandina.org/bda/default.aspx point > to us, sometimes directly to a pdf file, we don't see a page linking to > our dspace like a solution. > > The second case is the one i post here: > > >>> the idea is >>> to keep utf-8 because of it's universality... i thougth about a few code >>> lines working as a cgi program in apache that translate or convert iso >>> to >>> utf, but i don't know if i have to do it manually or there is a function >>> or a library in any language that can be usefull to my problem. any >>> comment is greetly received... >>> >>> > > there is no difference if i do that with apache or with tomcat, the thing > is that i don't know if i have to do that conversion manually (make the > regular expressions and equivalence table by myself), or if any > programming language have a library with functions that are able to help > me!!! i have good knowledges in c++ programming, java, html, php and > others. also i can design efficient algorithms, but i have been handling > only a few libraries in my career, and certainly right there is the tool i > need right now to solve my problem! I suspect that the following will do what you need (in java):
URLDecoder.decode(incomingURL, "iso-8859-1"); URLEncoder.encode(outgoingURL, "UTF-8"); (although don't quote me on it, as I've not actually tried it). Cheers, Richard > Sorry but i'm not a professional, i'm still a pre-grade student making my > thesis in a really good theme, endorsed by a very good corporation. > > Thanks for the help and excuse my english. > > >> Hi Tupac, >> >>> I'm from Merida, Venezuela. I work at SABER-ULA, the largest >>> institutional >>> repository in my country. Right now we're trying to move from privative >>> SW >>> to DSpace 1.4.2, but we don't want to lose our visibility in the WWW. >>> that's why we are trying to redirect all the link's from our old site to >>> the new one via Apache mod_rewrite. we use spanish language in both, but >>> the encoding is iso-8859-1 in the old repository, and utf-8 in the new >>> one. that creates a problem while we try to redirect url's that contain >>> special characters like ñ, í, ó, ú... etc. for example: in iso-8859-1 ó >>> is >>> encoded to %f3, in utf-8 ó is encoded to %c3%b3, that means DSpace is >>> not >>> able to understand an url containing ó, i changed encoding settings in >>> tomcat from utf-8 to iso-8859-1 and it worked perfectly, but the idea is >>> to keep utf-8 because of it's universality... i thougth about a few code >>> lines working as a cgi program in apache that translate or convert iso >>> to >>> utf, but i don't know if i have to do it manually or there is a function >>> or a library in any language that can be usefull to my problem. any >>> comment is greetly received... >>> >>> >> There are also problems with other special characters in DSpace when you >> do not use UTF-8 url encoding which have come up on this list before, so >> you should definitely try to stick with that. >> >> Perhaps the "safest" solution is to offer a web page at the old urls to >> point to the new urls, and a slightly delayed http redirect? Not very >> nice solution - I guess it will depend on the volume of your data and >> how many visitors you get as to whether it's worth the suffering. >> >> Or: Can you run another web server (tomcat) which is set up to receive >> your iso-8859-1 encoded urls, which runs them through a java process >> which will literally just take the string tomcat gives it (correctly >> decoded) and re-codes it in UTF-8, and sends back an http redirect >> immediately to your dspace tomcat instance set up to receive utf-8? >> >> Cheers, >> >> Richard >> >> >>> Tupac Amaru Bastidas Celis >>> >>> SABER-ULA >>> Centro de Teleinformación - ULA >>> Corporación Parque Tecnológico de Mérida >>> Av. 4, Edif. Gral. Masini, Piso 3, Of. A-34 >>> Tlf: +58 274 2524192 ext. 21 >>> >>> >>> ------------------------------------------------------------------------- >>> SF.Net email is sponsored by: >>> Check out the new SourceForge.net Marketplace. >>> It's the best place to buy or sell services >>> for just about anything Open Source. >>> http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace >>> _______________________________________________ >>> DSpace-tech mailing list >>> [email protected] >>> https://lists.sourceforge.net/lists/listinfo/dspace-tech >>> >>> >> -- >> Richard Jones >> Research Engineer, HP Labs >> >> eml: [EMAIL PROTECTED] >> blg: http://chronicles-of-richard.blogspot.com/ >> >> >> > > > Tupac Amaru Bastidas Celis > > SABER-ULA > Centro de Teleinformación - ULA > Corporación Parque Tecnológico de Mérida > Av. 4, Edif. Gral. Masini, Piso 3, Of. A-34 > Tlf: +58 274 2524192 ext. 21 > > -- Richard Jones Research Engineer, HP Labs eml: [EMAIL PROTECTED] blg: http://chronicles-of-richard.blogspot.com/ ------------------------------------------------------------------------- SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://ad.doubleclick.net/clk;164216239;13503038;w?http://sf.net/marketplace _______________________________________________ DSpace-tech mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dspace-tech

