Ahmed Eldawy created PIG-3344:
---------------------------------
Summary: Add a spatial datatype to Pig
Key: PIG-3344
URL: https://issues.apache.org/jira/browse/PIG-3344
Project: Pig
Issue Type: New Feature
Components: parser
Reporter: Ahmed Eldawy
This issue is about adding a new datatype to Pig that abstracts a spatial
attribute. Following OGC [http://www.opengeospatial.org/], we will add a new
datatype called 'Geometry' that abstracts all standard shapes (e.g., Point,
Polygon and Linestring). This datatype is automatically parsed from either a
Well-Known Text (WKT) or Well-Known Binary (WKB) represented as a Hex string.
These two types are the standard export formats for OGC shapes and they are
supported by many existing tools including PostGIS [http://postgis.net/].
Exporting through PigStorage should default to a WKB represented as Hex string
and there will be additional functions to convert to WKT.
This new datatype maps internally to the class OGCGeometry
[https://github.com/Esri/geometry-api-java/blob/master/src/com/esri/core/geometry/ogc/OGCGeometry.java]
licensed under Apache license. This class contains functionality to
import/export to the WKT and WKB formats.
Data manipulation functions to the new datatype will be all done through UDFs.
Currently, there is a spatial extension to Pig (called Pigeon)
[https://github.com/aseldawy/pigeon] that provides basic spatial functionality
via UDFs powered by the aforementioned library. Currently, it automatically
converts WKB and WKT fields to OGCGeometry class, performs the spatial
operation, and produces the result back as WKB. Once the Geometry datatype is
added, it will natively use it to avoid the conversion.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira