[ 
https://issues.apache.org/jira/browse/IMPALA-14576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18051091#comment-18051091
 ] 

ASF subversion and git services commented on IMPALA-14576:
----------------------------------------------------------

Commit c96b7b082dff6bb2fd0062e63b508e3924e14297 in impala's branch 
refs/heads/master from Csaba Ringhofer
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=c96b7b082 ]

IMPALA-14576, IMPALA-14577: add rewrite rules for geospatial relations

Apply rewrites to geospatial relations like st_intersects.
3 rewrites are added:

1. NormalizeGeospatialRelationsRule moves const arguments
to the first position (this can be useful as the current Java
implementation optimizes const first arguments, see IMPALA-14575):

st_intersects(geom_col, ST_Polygon(1,1, 1,4, 4,4, 4,1))
  ->
st_intersects(ST_Polygon(1,1, 1,4, 4,4, 4,1), geom_col)

2. AddEnvIntersectsRule adds st_envintersects() before
relations that can be only true when the bounding rectangles
intersect. This is useful as st_envintersects() has native
implementation (IMPALA-14573):

st_intersects(geom1, geom2)
  ->
st_envintersects(geom1, geom2) AND
st_intersects(geom1, geom2)

3. PointEnvIntersectsRule replaces bounding rect (envelope)
intersection on geometries from st_point with predicates directly
on coordinates:

st_envintersects(CONST_GEOM, st_point(x, y))
  ->
x >= st_minx(CONST_GEOM) AND y >= st_miny(CONST_GEOM) AND
x <= st_maxx(CONST_GEOM) AND y <= st_maxy(CONST_GEOM)

Note that AddEnvIntersectsRule is only valid in planar geometry (the
relation functions in HIVE_ESRI are all planar).

2 and 3 are not applied if the cost of child expression is
above some treshold.

AddEnvIntersectsRule needed a new type of expression rewrite
("non-idempotent rules") that runs rules only once to avoid
triggering the rules multiple times on the same input predicate.

Other changes:
- Changed handling of malformed geometries in c++ functions from
  error to warning. This is consistent with handling in Java.

Change-Id: Id65f646db6f1c89a74253e9ff755c39c400328be
Reviewed-on: http://gerrit.cloudera.org:8080/23719
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Add st_EnvIntersects  before other st_ relations
> ------------------------------------------------
>
>                 Key: IMPALA-14576
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14576
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Frontend
>            Reporter: Csaba Ringhofer
>            Assignee: Csaba Ringhofer
>            Priority: Major
>
> Example rewrite:
> {code}
> st_intersects(<geom1>, <geom2>) ->
> st_envintersects(<geom1>, <geom2>) AND st_intersects(<geom1>, <geom2>) 
> {code}
> This can't be implemented as a simple expression rewrite rule because the 
> original st_intersects() is unchanged and the same rule could be applied 
> again. A new "inferred predicate" logic is needed that applies a rule only 
> once to the expression tree.
> This may be a temporary but significant improvement - st_envintersects() is 
> much easier to port to c++ than the "full" relation functions like 
> st_intersects.
> st_envIntersects is rewritten here: 
> https://gerrit.cloudera.org/#/c/23700/6/be/src/exprs/geo/geospatial-functions-ir.cc
> other st_ relations are rewritten here using boost::geometry:
> https://gerrit.cloudera.org/#/c/20602/6/be/src/exprs/geo/relation-wrapper.h
> The first change is much simpler and safer and could be merged in the near 
> future.
> Note that IMPALA-14571 ensures that native functions are evaluated before 
> Java ones in predicates, so st_envIntersects would be evaluated before the 
> "full" relation (if c++ rewrite for st_envIntersects gets merged while other 
> relations stay in Java).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to