[
https://issues.apache.org/jira/browse/BEAM-7268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gleb Kanterov updated BEAM-7268:
--------------------------------
Description:
Right now the Java sorter extension depends on Hadoop SequenceFile for external
sort. It'll be nice to re-implement it without the dependency to avoid
conflicts.
Pull request adds a new option to switch to native sorting, the default
behavior remains Hadoop sorting.
{code}
final BufferedExternalSorter.Options sorterOpts =
BufferedExternalSorter
.options()
.withExternalSorterType(ExternalSorter.Options.SorterType.HADOOP);
...
.apply(SortValues.create(sorterOpts));
{code}
was:Right now the Java sorter extension depends on Hadoop SequenceFile for
external sort. It'll be nice to re-implement it without the dependency to avoid
conflicts.
> Make external sorter Hadoop free
> --------------------------------
>
> Key: BEAM-7268
> URL: https://issues.apache.org/jira/browse/BEAM-7268
> Project: Beam
> Issue Type: Improvement
> Components: extensions-java-sorter
> Affects Versions: 2.13.0
> Reporter: Neville Li
> Assignee: Neville Li
> Priority: Minor
> Time Spent: 2h 20m
> Remaining Estimate: 0h
>
> Right now the Java sorter extension depends on Hadoop SequenceFile for
> external sort. It'll be nice to re-implement it without the dependency to
> avoid conflicts.
> Pull request adds a new option to switch to native sorting, the default
> behavior remains Hadoop sorting.
> {code}
> final BufferedExternalSorter.Options sorterOpts =
> BufferedExternalSorter
> .options()
> .withExternalSorterType(ExternalSorter.Options.SorterType.HADOOP);
> ...
> .apply(SortValues.create(sorterOpts));
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)