GitHub user AnatoliShein opened a pull request:
https://github.com/apache/orc/pull/134
Orc 17
In this pull request I added **LIBHDFS++** library for reading files from
HDFS to **ORC** project.
Libhdfs++ is located in orc/c++/lib/libhdfspp and by default builds as a
light-weight library without examples, tests, and tools (and by this avoids
dependencies on JDK, valgrind and gmock). However, if the flag
**-DHDFSPP_LIBRARY_ONLY=FALSE** is passed to cmake, then it will build the
examples, tests, and tools as well.
Libhdfs++ depends on protobuf libraries in orc/c++/libs/protobuf-2.6.0 and
is searching the system for packages Doxygen, OpenSSL, CyrusSASL, GSasl, and
Threads dynamically (however only OpenSSL and Threads are required).
The folder libhdfspp also includes a script pull_hdfs.sh which pulls the
latest changes from Libhdfs++ Hadoop branch to ORC, and generates file
'imported_timestamp' with the timestamp and the information about the latest
commit.
I also updated all the ORC tools to automatically use Libhdfs++ to read ORC
files on HDFS if their path begins with 'hdfs://'.
Please review.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/AnatoliShein/orc ORC-17
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/orc/pull/134.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #134
----
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---