Here is a simple fix to the jsp page from the previous post: BufferedReader in = new BufferedReader(new InputStreamReader(content,"ISO-8859-1"));
Thanks Yohann! On Mon, Jun 8, 2009 at 10:04 PM, Alain de Raynal <[email protected]>wrote: > Below is a short jsp page that illustrates the problem I'm facing > the result on my local dev environment is Adrián Beltré > the result on the gae servers is Adri��n Beltr�� > > can somebody explain the difference ? > is there a way to decode these string to get the accents back ? > > <%@ page contentType="text/html;charset=UTF-8" language="java" %> > <%@ page import="java.io.BufferedReader" %> > <%@ page import="java.io.InputStream" %> > <%@ page import="java.io.InputStreamReader" %> > <%@ page import="java.net.URL" %> > <html> > <body> > <% > String urlString = "http://en.wikipedia.org/wiki/Adri%C3%A1n_Beltr%C3%A9"; > InputStream content = new URL(urlString).openStream(); > BufferedReader in = new BufferedReader(new InputStreamReader(content)); > String line; > while ((line = in.readLine()) != null) { > if (line.contains("<h1 id=\"firstHeading\"")){ > out.println(line); > } > } > %> > </body> > </html> > > On Fri, Jun 5, 2009 at 9:41 PM, Alain de Raynal <[email protected]>wrote: > >> Hello again, >> one week later, the problem is slightly different: >> When I tried the same operation from the gae servers, urlFetch returns >> >> Adri�n B�ltre >> >> instead of the expected "Adrián Béltre" >> The accentuated characters are no longer recognized... >> >> Does anybody know what has changed in the past week ? >> >> Thanks in advance, >> Alain >> >> >> >> >> On Sun, May 31, 2009 at 2:41 PM, Alain de R <[email protected]> wrote: >> >>> Hi everyone, >>> >>> I've noticed that when I try to retrieve the content of a web page >>> (for instance: http://sports.yahoo.com/mlb/players/6039) >>> the same code returns >>> "Adrián Béltre" from my dev/eclipse environment >>> and >>> "Adrian Beltre" from GAE servers >>> >>> It's important for me to retrieve the content of a web page with the >>> accentuated characters. >>> is there a way around this ? >>> >>> Thanks in advance, >>> Alain >> >> >> > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Google App Engine" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en -~----------~----~----~----~------~----~------~--~---
