GitHub user selvaganesang opened a pull request:
https://github.com/apache/trafodion/pull/1417
[TRAFODION-2917] Refactor Trafodion implementation of hdfs scan for tâ¦
â¦ext formatted hive tables
Part-1 changes.
Created a new class org.trafodion.sql.HDFSClient. Any direct HDFS access
will be routed to this class via JNI instead of using libhdfs.
Modified the existing code expect for the following to route the HDFS
request via this class
1. LOB access
2. Direct HDFS scan of the table
3. Sample data creation during update stats
Added a new class org.trafodio.sql.HdfsScan for scanning one or many ranges
of a Hive
text formatted table. This class will be used for Direct HDFS scan in near
future.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/selvaganesang/trafodion hdfs_scan_improvements
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/trafodion/pull/1417.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1417
----
commit 60db153329d1ee7088f6805ef3c5eb9eb8b600de
Author: selvaganesang <selva.govindarajan@...>
Date: 2018-01-26T16:40:37Z
[TRAFODION-2917] Refactor Trafodion implementation of hdfs scan for text
formatted hive tables
Part-1 changes.
Created a new class org.trafodion.sql.HDFSClient. Any direct HDFS access
will be routed to this class via JNI instead of using libhdfs.
Modified the existing code expect for the following to route the HDFS
request via this class
1. LOB access
2. Direct HDFS scan of the table
3. Sample data creation during update stats
Added a new class org.trafodio.sql.HdfsScan for scanning one or many ranges
of a Hive
text formatted table. This class will be used for Direct HDFS scan in near
future.
----
---