This is an automated email from the ASF dual-hosted git repository. william pushed a commit to branch news180 in repository https://gitbox.apache.org/repos/asf/orc.git
commit fbe4694cf01798959fa30807cee186f18e8506eb Author: William Hyun <[email protected]> AuthorDate: Sat Sep 3 18:59:42 2022 -0700 Add news for 1.8.0 --- site/_data/releases.yml | 12 +++++- site/_includes/orc_1.8.md | 15 +++++++ site/_posts/2022-09-03-ORC-1.8.0.md | 79 +++++++++++++++++++++++++++++++++++++ site/releases/index.md | 2 +- site/releases/releases.md | 2 +- 5 files changed, 106 insertions(+), 4 deletions(-) diff --git a/site/_data/releases.yml b/site/_data/releases.yml index e3e9b7cd8..5a6e77f26 100644 --- a/site/_data/releases.yml +++ b/site/_data/releases.yml @@ -1,8 +1,16 @@ # List the releases in reverse logical order # Only one release should be tagged latest +1.8.0: + date: 2022-09-03 + state: latest + tar: orc-1.8.0.tar.gz + signed-by: William Hyun (DECDFA29) + sha256: 859d78bfded98405c32ccb2847b565a57bcc53f473a74087c1f750aeb5932e62 + known-issues: + 1.7.6: date: 2022-08-17 - state: latest + state: stable tar: orc-1.7.6.tar.gz signed-by: William Hyun (DECDFA29) sha256: a75e0cccaaf5e03f1699bb804f640246bec4d134cb57957125707b0b2822160d @@ -60,7 +68,7 @@ 1.6.14: date: 2022-04-14 - state: stable + state: archived tar: orc-1.6.14.tar.gz signed-by: Dongjoon Hyun (34F0FC5C) sha256: f2701d27d197a0b483727790414c1c3b50cbe4506dae947d3e6dc3d951f1ee09 diff --git a/site/_includes/orc_1.8.md b/site/_includes/orc_1.8.md new file mode 100644 index 000000000..74eeb0574 --- /dev/null +++ b/site/_includes/orc_1.8.md @@ -0,0 +1,15 @@ +The New Features and Notable Changes of ORC 1.8: + +- [ORC-450]({{site.jira}}/ORC-450) Support selecting list indices without materializing list items +- [ORC-824]({{site.jira}}/ORC-824) Add column statistics for List and Map +- [ORC-1004]({{site.jira}}/ORC-1004) Java ORC writer supports the selection vector +- [ORC-1075]({{site.jira}}/ORC-1075) Support reading ORC files with no column statistics +- [ORC-1125]({{site.jira}}/ORC-1125) Support decoding decimals in RLE +- [ORC-1136]({{site.jira}}/ORC-1136) Optimize reads by combining multiple reads without significant separation into a single read +- [ORC-1138]({{site.jira}}/ORC-1138) Seek vs Read Optimization +- [ORC-1172]({{site.jira}}/ORC-1172) Add row count limit config for one stripe +- [ORC-1212]({{site.jira}}/ORC-1212) Upgrade protobuf-java to 3.17.3 +- [ORC-1220]({{site.jira}}/ORC-1220) Set min.hadoop.version to 2.7.3 +- [ORC-1248]({{site.jira}}/ORC-1248) Redefine Hadoop dependency for Apache ORC 1.8.0 +- [ORC-1256]({{site.jira}}/ORC-1256) Publish test-jar to maven central +- [ORC-1260]({{site.jira}}/ORC-1260) Publish shaded-protobuf classifier artifacts \ No newline at end of file diff --git a/site/_posts/2022-09-03-ORC-1.8.0.md b/site/_posts/2022-09-03-ORC-1.8.0.md new file mode 100644 index 000000000..b4bdbbef2 --- /dev/null +++ b/site/_posts/2022-09-03-ORC-1.8.0.md @@ -0,0 +1,79 @@ +--- +layout: news_item +title: "ORC 1.8.0 Released" +date: "2022-09-03 03:00:00 -0800" +author: william +version: 1.8.0 +categories: [release] +--- + +The ORC team is excited to announce the release of ORC v1.8.0. + +{% assign releaseName = "1.8.0" %} +{% include release_description.md %} + +New Feature and Notable Changes: +- [ORC-450]({{site.jira}}/ORC-450) Support selecting list indices without materializing list items +- [ORC-824]({{site.jira}}/ORC-824) Add column statistics for List and Map +- [ORC-1004]({{site.jira}}/ORC-1004) Java ORC writer supports the selection vector +- [ORC-1075]({{site.jira}}/ORC-1075) Support reading ORC files with no column statistics +- [ORC-1125]({{site.jira}}/ORC-1125) Support decoding decimals in RLE +- [ORC-1136]({{site.jira}}/ORC-1136) Optimize reads by combining multiple reads without significant separation into a single read +- [ORC-1138]({{site.jira}}/ORC-1138) Seek vs Read Optimization +- [ORC-1172]({{site.jira}}/ORC-1172) Add row count limit config for one stripe +- [ORC-1212]({{site.jira}}/ORC-1212) Upgrade protobuf-java to 3.17.3 +- [ORC-1220]({{site.jira}}/ORC-1220) Set min.hadoop.version to 2.7.3 +- [ORC-1248]({{site.jira}}/ORC-1248) Redefine Hadoop dependency for Apache ORC 1.8.0 +- [ORC-1256]({{site.jira}}/ORC-1256) Publish test-jar to maven central +- [ORC-1260]({{site.jira}}/ORC-1260) Publish shaded-protobuf classifier artifacts + +Improvements: +- [ORC-825]({{site.jira}}/ORC-825) Use Empty Array For Collections toArray +- [ORC-826]({{site.jira}}/ORC-826) Do Not Use Collection Contains/Get +- [ORC-828]({{site.jira}}/ORC-828) Improve Fetch Data Set Process +- [ORC-829]({{site.jira}}/ORC-829) Optimize Serialization percentileBits +- [ORC-831]({{site.jira}}/ORC-831) Do Not Copy String When Flushing Dictionary +- [ORC-833]({{site.jira}}/ORC-833) RunLengthIntegerReaderV2 Calculate Batch Size Once +- [ORC-834]({{site.jira}}/ORC-834) Do Not Convert to String in DecimalFromTimestampTreeReader +- [ORC-835]({{site.jira}}/ORC-835) Cache TRUE/FALSE Bytes in StringGroupFromBooleanTreeReader +- [ORC-836]({{site.jira}}/ORC-836) StringGroupFromDoubleTreeReader Use Double toString +- [ORC-837]({{site.jira}}/ORC-837) Reuse HiveDecimalWritable in ConvertTreeReaderFactory +- [ORC-838]({{site.jira}}/ORC-838) Simplify compareTo/equals/putBuffer of ByteBufferAllocatorPool +- [ORC-840]({{site.jira}}/ORC-840) Remove Superfluous Array Fill in RecordReaderImpl +- [ORC-841]({{site.jira}}/ORC-841) Remove Superfluous Array Fill in StringHashTableDictionary +- [ORC-842]({{site.jira}}/ORC-842) Remove newKey from StringHashTableDictionary +- [ORC-844]({{site.jira}}/ORC-844) Improve hashCode Methods +- [ORC-847]({{site.jira}}/ORC-847) Do Not Create Empty Array in StringGroupFromBinaryTreeReader +- [ORC-852]({{site.jira}}/ORC-852) Allow DynamicByteArray to Return a ByteBuffer +- [ORC-853]({{site.jira}}/ORC-853) Optimize writeDouble Implementation +- [ORC-855]({{site.jira}}/ORC-855) Remove Unused isRepeating from RunLengthIntegerReaderV2 +- [ORC-865]({{site.jira}}/ORC-865) Bump opencsv from 3.9 to 5.5.1 +- [ORC-883]({{site.jira}}/ORC-883) Dependency Audit and QA +- [ORC-897]({{site.jira}}/ORC-897) optimization loop termination condition in readerIsCompatible method +- [ORC-935]({{site.jira}}/ORC-935) Bump commons-csv from 1.8 to 1.9.0 +- [ORC-937]({{site.jira}}/ORC-937) Replace deprecated method +- [ORC-958]({{site.jira}}/ORC-958) Convert command support overwrite option +- [ORC-969]({{site.jira}}/ORC-969) Evaluate SearchArguments using file and stripe level stats +- [ORC-975]({{site.jira}}/ORC-975) Avoid double counting closestFixedBits in percentileBits method +- [ORC-982]({{site.jira}}/ORC-982) Extract checkstyle to a single file, help newcomers check code style +- [ORC-988]({{site.jira}}/ORC-988) Bump opencsv from 5.5.1 to 5.5.2 +- [ORC-992]({{site.jira}}/ORC-992) Reached max repeat length, we can directly decide to use DELTA encoding +- [ORC-1005]({{site.jira}}/ORC-1005) Make that the java and C++ implementations of determineEncoding in RunLengthIntegerWriterV2 are consistent. +- [ORC-1007]({{site.jira}}/ORC-1007) Fix a warning from the shade plugin +- [ORC-1013]({{site.jira}}/ORC-1013) Renaming a parameter in constructors of TreeWriter's derived classes +- [ORC-1014]({{site.jira}}/ORC-1014) Add details when we get IOExceptions from file system +- [ORC-1020]({{site.jira}}/ORC-1020) Improve orc::RleDecoderV2::nextDirect +- [ORC-1027]({{site.jira}}/ORC-1027) Filter processing to allow filter injections that cannot be represented via SArgs +- [ORC-1047]({{site.jira}}/ORC-1047) Handle quoted field names during string schema parsing +- [ORC-1077]({{site.jira}}/ORC-1077) Remove commons-codec dependency and use java.util.Base64 +- [ORC-1099]({{site.jira}}/ORC-1099) Extend ReadIntent to support MAP and UNION type +- [ORC-1101]({{site.jira}}/ORC-1101) Improve malformed STRUCT handling +- [ORC-1122]({{site.jira}}/ORC-1122) Add buffer to decode the whole run in RleDecoderV2 +- [ORC-1137]({{site.jira}}/ORC-1137) Improve float/double conversion in DoubleColumnReader::next() +- [ORC-1149]({{site.jira}}/ORC-1149) Bump slf4j.version to 1.7.36 +- [ORC-1150]({{site.jira}}/ORC-1150) Improve RowReaderImpl::computeBatchSize() +- [ORC-1152]({{site.jira}}/ORC-1152) Support encoding short decimals in RLEv2 +- [ORC-1156]({{site.jira}}/ORC-1156) Update opencsv to 5.6 +- [ORC-1163]({{site.jira}}/ORC-1163) Bump zookeeper from 3.7.0 to 3.8.0 +- [ORC-1169]({{site.jira}}/ORC-1169) Use Hadoop 3.3.2 on Java 17+ +- [ORC-1178]({{site.jira}}/ORC-1178) Use hadoop 3.3.3 on Java 17+ \ No newline at end of file diff --git a/site/releases/index.md b/site/releases/index.md index c44149ca2..4b01530c3 100644 --- a/site/releases/index.md +++ b/site/releases/index.md @@ -16,7 +16,7 @@ for ORC files. It also contains tools for working with ORC files and looking at their contents and metadata. {% include release_description.md %} -{% include orc_1.7.md %} +{% include orc_1.8.md %} {% include known_issues.md %} ## Checking signatures diff --git a/site/releases/releases.md b/site/releases/releases.md index d3b16dc31..406db602f 100644 --- a/site/releases/releases.md +++ b/site/releases/releases.md @@ -17,7 +17,7 @@ for ORC files. It also contains tools for working with ORC files and looking at their contents and metadata. {% include release_description.md %} -{% include orc_1.7.md %} +{% include orc_1.8.md %} {% include known_issues.md %} ## Checking signatures
