Please ignore my previous email. I was forwarding Zheng's email to my
friend. Sorry for inconvenience caused.

Best,

Ping


On Wed, Jul 21, 2010 at 9:02 AM, Ping Zhu <[email protected]> wrote:

> There are built-in Hadoop UTF8 checker.
>
>
> ---------- Forwarded message ----------
> From: Zheng Shao <[email protected]>
> Date: Tue, Jul 20, 2010 at 11:40 PM
> Subject: Re: built-in UTF8 checker
> To: [email protected]
>
>
> No, but it's very simple to write one.
>
> public class MyUTF8StringChecker extends UDF {
>  public boolean evaluate(Text t) {
>    try {
>      Text.validateUTF8(t.getBytes(), 0, t.getLength());
>      return true;
>     } catch (MalformedInputException e) {
>       return false;
>     }
>  }
> }
>
>
> On Tue, Jul 20, 2010 at 12:03 PM, Ping Zhu <[email protected]> wrote:
> > Hi,
> >   Are there are any built-in functions in Hive to check whether a string
> is
> > UTF8-encoding? I did some research about this issue but did not find
> useful
> > resources. Thanks for your suggestions and help.
> >   Ping
>
>
>
> --
> Yours,
> Zheng
> http://www.linkedin.com/in/zshao
>
>

Reply via email to