I'm trying to convert sanitized HTML data into standard HTML for a query, and 
found that the Commons Lang project has existing methods to do this.  I 
downloaded the jar, with the thought of using the reflect() UDF to call the 
unescapeHtml function, but it doesn't work (java.lang.ClassNotFoundException: 
org.apache.commons.lang.StringEscapeUtils).

add jar commons-lang3-3.1.jar;
SELECT
    reflect("org.apache.commons.lang.StringEscapeUtils", "unescapeHtml", 
htmlData) htmlData,
    COUNT(*)
FROM TableName
WHERE
    calendar_date >= '2012-03-29' AND
    calendar_date <= '2012-04-02' AND
    LOWER(htmlData) LIKE '%see%more%'
GROUP BY reflect("org.apache.commons.lang.StringEscapeUtils", "unescapeHtml", 
htmlData);

I'm not yet familiar with reflect(), so I'm not sure if it's an issue with the 
jar that I'm adding or the reflect() statement.  I get the same error if I 
remove the aggregate function from the query.

Matt Tucker

Reply via email to