Hi Team,
I was wondering if someone has encountered this problem before.
While trying to work with PXF on hawq 2.0 I am encountering the following error:
gagan=# CREATE EXTERNAL TABLE ext_get_foo (i int, bar text) LOCATION
('pxf://my-hadoop-cluster:50070/foo_bar?profile=HdfsTextSimple')
FORMAT 'text' (delimiter '|' null 'null');
gagan=# SELECT * FROM ext_get_foo ;
ERROR: remote component error (404): PXF service could not be
reached. PXF is not running in the tomcat container (libchurl.c:878)
The same happens when I try to write to an external table using PXF.
I believe the above error signifies that PXF service isn't running or
unavailable. But PXF is running on port 51200.
Curl response works fine as well:
# curl -s http://localhost:51200/pxf/v0
Wrong version v0, supported version is v14
PXF is build using gradlew and installed as RPM files. I also have
tomcat 7.0.62 installed with the PXF packages.
The following is how PXF is running on the instance:
pxf 21405 0.3 2.8 825224 115164 ? Sl 02:07 0:10
/usr/java/latest/bin/java
-Djava.util.logging.config.file=/var/pxf/pxf-service/conf/logging.properties
-Djava.util.logging.manager=org.apache.juli.ClassLoaderLogManager
-Xmx512M -Xss256K -Djava.endorsed.dirs=/var/pxf/pxf-service/endorsed
-classpath
/var/pxf/pxf-service/bin/bootstrap.jar:/var/pxf/pxf-service/bin/tomcat-juli.jar
-Dcatalina.base=/var/pxf/pxf-service
-Dcatalina.home=/var/pxf/pxf-service
-Djava.io.tmpdir=/var/pxf/pxf-service/temp
org.apache.catalina.startup.Bootstrap start
I do not have apache-tomcat running. Not sure how are the two
interrelated. But the RPM file created by gradlew requires tomcat for
pxf-service.
I would appreciate any inputs into this problem.
Regards,
Gagan Brahmi