commits
Thread
Date
Earlier messages
Later messages
Messages by Thread
git commit: CRUNCH-320: Fix PObjectImpl materialization logic.
jwills
git commit: CRUNCH-320: Fix PObjectImpl materialization logic.
jwills
svn commit: r894175 - /websites/production/crunch/content/
greid
svn commit: r1558207 - /crunch/site/trunk/content/user-guide.mdtext
greid
svn commit: r894173 - in /websites/staging/crunch/trunk/content: ./ user-guide.html
buildbot
svn commit: r894140 - /websites/production/crunch/content/
jwills
svn commit: r894139 - in /websites/staging/crunch/trunk/content: ./ getting-started.html
buildbot
svn commit: r1558156 - /crunch/site/trunk/content/getting-started.mdtext
jwills
svn commit: r894137 - in /websites/staging/crunch/trunk/content: ./ getting-started.html user-guide.html
buildbot
svn commit: r1558155 - in /crunch/site/trunk/content: getting-started.mdtext user-guide.mdtext
jwills
svn commit: r894138 - /websites/production/crunch/content/
jwills
svn commit: r1558133 - /crunch/site/trunk/content/user-guide.mdtext
jwills
svn commit: r894128 - in /websites/staging/crunch/trunk/content: ./ user-guide.html
buildbot
svn commit: r1558135 - /crunch/site/trunk/content/user-guide.mdtext
jwills
svn commit: r894129 - in /websites/staging/crunch/trunk/content: ./ user-guide.html
buildbot
svn commit: r1558134 - /crunch/site/trunk/content/user-guide.mdtext
jwills
svn commit: r894131 - /websites/production/crunch/content/
jwills
svn commit: r894127 - /websites/production/crunch/content/
jwills
svn commit: r894126 - in /websites/staging/crunch/trunk/content: ./ user-guide.html
buildbot
svn commit: r894123 - /websites/production/crunch/content/
jwills
svn commit: r1558124 - /crunch/site/trunk/content/user-guide.mdtext
jwills
svn commit: r894121 - in /websites/staging/crunch/trunk/content: ./ user-guide.html
buildbot
svn commit: r894014 - /websites/production/crunch/content/
jwills
svn commit: r894013 - in /websites/staging/crunch/trunk/content: ./ about.html bylaws.html download.html future-work.html getting-started.html index.html mailing-lists.html pipelines.html scrunch.html source-repository.html user-guide.html
buildbot
svn commit: r1557897 - in /crunch/site/trunk: content/download.mdtext lib/path.pm
jwills
svn commit: r894012 - /websites/production/crunch/content/
jwills
svn commit: r893999 - in /websites/staging/crunch/trunk/content: ./ apidocs/0.8.2/ apidocs/0.8.2/org/ apidocs/0.8.2/org/apache/ apidocs/0.8.2/org/apache/crunch/ apidocs/0.8.2/org/apache/crunch/class-use/ apidocs/0.8.2/org/apache/crunch/contrib/ apidocs...
buildbot
svn commit: r1557874 - in /crunch/site/trunk: content/ content/apidocs/0.8.2/ content/apidocs/0.8.2/org/ content/apidocs/0.8.2/org/apache/ content/apidocs/0.8.2/org/apache/crunch/ content/apidocs/0.8.2/org/apache/crunch/class-use/ content/apidocs/0.8.2...
jwills
git commit: CRUNCH-239: Add a Union PType.
jwills
git commit: CRUNCH-239: Add a Union PType.
jwills
svn commit: r893360 - /websites/production/crunch/content/
greid
svn commit: r893311 - in /websites/staging/crunch/trunk/content: ./ intro.html
buildbot
svn commit: r1556538 - /crunch/site/trunk/content/intro.mdtext
greid
[1/5] git commit: CRUNCH-312: Determine the right datum reader/writer for derived Avro types
jwills
[5/5] git commit: CRUNCH-318: Add sleep to fix CheckpointIT.
jwills
[3/5] git commit: CRUNCH-314: Separate shuffle and bundle AvroMode configuration.
jwills
[4/5] git commit: CRUNCH-315: Add support for Empty PCollections/PTables.
jwills
[2/5] git commit: CRUNCH-313: Copy the Configuration object used by CrunchInputSplit so it doesn't conflict with settings from the base Configuration.
jwills
git commit: CRUNCH-316: Integration test for SafeAvroSerialization and ArrayIndexOutOfBoundsException
mkwhit
git commit: CRUNCH-316: Integration test for SafeAvroSerialization and ArrayIndexOutOfBoundsException
mkwhit
git commit: CRUNCH-318: Add sleep to fix CheckpointIT.
jwills
git commit: CRUNCH-316: Converted SafeAvroSerialization to use DirectBinaryEncoder instead of BufferedBinaryEncoder.
mkwhit
git commit: CRUNCH-316: Converted SafeAvroSerialization to use DirectBinaryEncoder instead of BufferedBinaryEncoder.
mkwhit
git commit: CRUNCH-315: Add support for Empty PCollections/PTables.
jwills
git commit: CRUNCH-314: Separate shuffle and bundle AvroMode configuration.
jwills
git commit: CRUNCH-313: Copy the Configuration object used by CrunchInputSplit so it doesn't conflict with settings from the base Configuration.
jwills
git commit: CRUNCH-312: Determine the right datum reader/writer for derived Avro types
jwills
svn commit: r3936 - in /release/crunch: crunch-0.7.0/ crunch-0.8.1/ crunch-0.8.2/ crunch-0.9.0/
jwills
[1/2] git commit: [maven-release-plugin] prepare release apache-crunch-0.9.0
jwills
[2/2] git commit: [maven-release-plugin] prepare for next development iteration
jwills
[1/2] git commit: [maven-release-plugin] prepare release apache-crunch-0.9.0
jwills
[2/2] git commit: [maven-release-plugin] prepare for next development iteration
jwills
[1/2] CRUNCH-308: A working version of Crunch against the HBase 0.96 APIs and Hadoop 2.2.0.
jwills
[1/7] git commit: Spark to 0.8.2-SNAPSHOT on this branch
jwills
[6/7] git commit: [maven-release-plugin] prepare release apache-crunch-0.8.2
jwills
[5/7] git commit: [maven-release-plugin] prepare release apache-crunch-0.8.1-hadoop2
jwills
[7/7] git commit: [maven-release-plugin] prepare for next development iteration
jwills
[3/7] git commit: [maven-release-plugin] prepare release apache-crunch-0.8.0
jwills
[2/7] git commit: Prepare for next development iteration
jwills
[4/7] git commit: [maven-release-plugin] prepare for next development iteration
jwills
git commit: CRUNCH-311: Add support for file renaming to AvroPathPerKeyTarget.
jwills
[1/2] git commit: Trivial Scrunch test fix for Hadoop2
jwills
[2/2] git commit: Get spark distributed cache working on hadoop2
jwills
[1/9] Crunch on Spark
jwills
[4/9] Crunch on Spark
jwills
[7/9] Generalizing Crunch's Collection APIs to support more execution frameworks
jwills
git commit: CRUNCH-306: One output path per key, Avro edition.
jwills
git commit: CRUNCH-310: AvroParquetFileSource with builder interface for selecting fields and specifying a filter class.
jwills
git commit: CRUNCH-309: Fixed Cogroup4 ArrayIndexOutOfBoundsException and corresponding test. Contributed by Nathan Langlois.
jwills
[1/2] git commit: CRUNCH-307: Limit the number of concurrently running jobs
chaoshi
[2/2] git commit: Merge branch 'crunch-307'
chaoshi
git commit: CRUNCH-304: Exposed a cleanup method for consumers
mkwhit
git commit: CRUNCH-303: Disable "combine input" in HFileSource
chaoshi
git commit: Second cut at rewriting custom Writable types to a more compact format.
jwills
svn commit: r888091 - /websites/production/crunch/content/
jwills
svn commit: r888090 - in /websites/staging/crunch/trunk/content: ./ intro.html
buildbot
svn commit: r1545498 - /crunch/site/trunk/content/intro.mdtext
jwills
svn commit: r887983 - in /websites/staging/crunch/trunk/content: ./ intro.html
buildbot
svn commit: r887984 - /websites/production/crunch/content/
jwills
svn commit: r1545156 - in /crunch/site/trunk/content: apidocs/current intro.mdtext
jwills
svn commit: r887979 - /websites/production/crunch/content/
jwills
svn commit: r887976 - in /websites/staging/crunch/trunk/content: ./ intro.html
buildbot
svn commit: r1545153 - in /crunch/site/trunk/content: apidocs/current intro.mdtext
jwills
git commit: CRUNCH-302: Initialize PType input/output functions when writing data in the MemPipeline.
jwills
git commit: CRUNCH-300: Allow MemPipeline to write Avro files by reflection and add more tests for writes done from MemPipeline.
jwills
svn commit: r887613 - in /websites/staging/crunch/trunk/content: ./ download.html
buildbot
svn commit: r1544354 - /crunch/site/trunk/content/download.mdtext
jwills
git commit: CRUNCH-301: Clever deep copies in Scrunch code
jwills
git commit: CRUNCH-294: Cost-based planning with materialize as breakpoint.
jwills
git commit: CRUNCH-293: Add AvroMode to inject avro readers.
mkwhit
svn commit: r3620 - in /release/crunch: crunch-0.8.0/ crunch-0.8.1/
jwills
git commit: CRUNCH-298: Slim down FormatBundle serialization
jwills
git commit: CRUNCH-297: Add parallelism options for joins/cogroups to the Scrunch API
jwills
git commit: CRUNCH-225: Added support for building using Scala 2.10 and 2.9. Also removed unused build.sbt file
mkwhit
svn commit: r886197 - /websites/production/crunch/content/
jwills
svn commit: r886196 - in /websites/staging/crunch/trunk/content: ./ apidocs/0.8.0/ apidocs/0.8.0/org/ apidocs/0.8.0/org/apache/ apidocs/0.8.0/org/apache/crunch/ apidocs/0.8.0/org/apache/crunch/class-use/ apidocs/0.8.0/org/apache/crunch/contrib/ apidocs...
buildbot
svn commit: r1540782 - in /crunch/site/trunk: content/ content/apidocs/0.8.0/ content/apidocs/0.8.0/org/ content/apidocs/0.8.0/org/apache/ content/apidocs/0.8.0/org/apache/crunch/ content/apidocs/0.8.0/org/apache/crunch/class-use/ content/apidocs/0.8.0...
jwills
svn commit: r3462 - /release/crunch/crunch-0.8.0/
jwills
git commit: Correct next release version: 0.9.0-SNAPSHOT
jwills
[1/2] git commit: [maven-release-plugin] prepare branch apache-crunch-0.8
jwills
[2/2] git commit: [maven-release-plugin] prepare for next development iteration
jwills
git commit: License header fix.
jwills
git commit: CRUNCH-292: Hack around job counter limits in Hadoop-2 for in-memory pipelines
jwills
git commit: CRUNCH-286 Allow distinct Combiner to be supplied
greid
git commit: CRUNCH-289 Allow materializing Avro SpecificRecords
greid
git commit: CRUNCH-291 Add toString method on CrunchInputSplit
greid
git commit: CRUNCH-288: Make it easier to identify the ID of the failing job in a long pipeline
jwills
git commit: CRUNCH-270: Removed packaged log4j.properties files
mkwhit
git commit: CRUNCH-287: Switch internal APIs and integration tests to use ReadableData.
jwills
[1/2] CRUNCH-278: Refactor MapsideJoin logic and introduce ReadableData abstraction for working with in-memory datasets in Crunch jobs.
jwills
git commit: CRUNCH-283: Additional diagnostics for the planner dotfile.
jwills
git commit: CRUNCH-281: A proper fix for the issue originally handled by CRUNCH-237.
jwills
git commit: CRUNCH-282: Add a parameter to control the maximum number of reducers for a job
jwills
[1/2] CRUNCH-276: Various static checks and FindBugs fixes. Contributed by Sean Owen.
jwills
git commit: CRUNCH-277: Add licensing info for parquet.
mafr
git commit: CRUNCH-277. Support Parquet.
tomwhite
git commit: Remove import statement for sun internal class
greid
[1/2] git commit: CRUNCH-274: Add extra configuration arguments for ParallelDoOptions
jwills
[2/2] git commit: CRUNCH-275: Support extra config args on Source, Target, and SourceTarget
jwills
git commit: CRUNCH-273: Make PipelineExecution implement ListenableFuture
jwills
git commit: CRUNCH-264: Add map-side outputs to the dot.plan file
jwills
git commit: CRUNCH-271: Cache Counters immediately upon Job completion
jwills
git commit: CRUNCH-268: Use stable names for Crunch's internal Avro schemas for tuple types.
jwills
git commit: CRUNCH-269: Add option for disabling deep copies on intermediate outputs from DoFns.
jwills
git commit: CRUNCH-267: Fix several HFileUtils#scanHFiles related problems
chaoshi
git commit: CRUNCH-266: AvroSpecificDeepCopier needs to use constructor on SpecificDatumReader that takes a class. Contributed by Brian Dougan.
jwills
git commit: CRUNCH-265: Allow clients to specify the number of reducers to use for default and one-to-many joins
jwills
git commit: CRUNCH-263: Provide sensible defaults for the max split size of CrunchCombineFileInputFormat
jwills
git commit: CRUNCH-262: Shorten job name if it is too long (putting "..." to the end)
chaoshi
git commit: CRUNCH-261: Make HBase sources readable.
jwills
git commit: CRUNCH-260: Switched Cloudera Copyright for Apache
mkwhit
git commit: CRUNCH-258: Support multiple output channels from a DoFn. Contributed by Brandon Inman.
jwills
git commit: CRUNCH-165: The latest attempt at using CombineFileInputFormat wherever possible.
jwills
git commit: Fix compilation failure on jdk 1.6 of my previous commit (because of using java.util.Objects)
chaoshi
git commit: CRUNCH-246: HFileSource and related utilities (Thanks Ryan Brush for contributing HFileInputFormat)
chaoshi
git commit: One-liner: com.sun.o.a.commons.logging -> o.a.commons.logging
jwills
git commit: CRUNCH-256: Cache intermediate file IDs for the sequential naming scheme, which is now a singleton.
jwills
git commit: CRUNCH-255: HFileOutputFormatForCrunch should use configuration from table for compression, block encoding, block size...
chaoshi
git commit: Minor change in README about the HBase version
chaoshi
git commit: CRUNCH-251: Fix HFileUtils#sortAndPartition does not work when two instances exist in the same pipeline
chaoshi
git commit: Crunch-254: Added access to the underlying JobId for MRPipeline jobs
mkwhit
git commit: CRUNCH-226: Switch to newer scala-maven-plugin.
mkwhit
git commit: CRUNCH-253: Make the sourceTarget API calls on ParallelDoOptions and GroupingOptions consistent
jwills
git commit: CRUNCH-250: Remove unnecessary bzip2 support; Hadoop handles this natively
jwills
git commit: CRUNCH-249: Fix HFileTargetIT failure under hadoop2
chaoshi
git commit: CRUNCH-248: Fix exception masking issue in CrunchReducer caused by SingleUseIterable
jwills
git commit: CRUNCH-247: Enable the planner to take advantage of to-be-materialized outputs during job planning.
jwills
[1/3] CRUNCH-212: Target wrapper for HFileOuptutFormat
chaoshi
[3/3] git commit: CRUNCH-212: Target wrapper for HFileOuptutFormat
chaoshi
git commit: CRUNCH-245: Fix hbase.zookeeper.quorum is overriden by hbase-default.xml
chaoshi
svn commit: r2664 - /release/crunch/crunch-0.6.0/
jwills
svn commit: r872399 - /websites/production/crunch/content/
jwills
svn commit: r872398 - in /websites/staging/crunch/trunk/content: ./ apidocs/0.7.0/ apidocs/0.7.0/org/ apidocs/0.7.0/org/apache/ apidocs/0.7.0/org/apache/crunch/ apidocs/0.7.0/org/apache/crunch/class-use/ apidocs/0.7.0/org/apache/crunch/contrib/ apidocs...
buildbot
svn commit: r1509548 - in /crunch/site/trunk: content/ content/apidocs/0.7.0/ content/apidocs/0.7.0/org/ content/apidocs/0.7.0/org/apache/ content/apidocs/0.7.0/org/apache/crunch/ content/apidocs/0.7.0/org/apache/crunch/class-use/ content/apidocs/0.7.0...
jwills
svn commit: r2596 - /release/crunch/crunch-0.7.0/
jwills
[1/2] git commit: [maven-release-plugin] prepare branch apache-crunch-0.7
jwills
[2/2] git commit: [maven-release-plugin] prepare for next development iteration
jwills
git commit: CRUNCH-242: Control the input/output conversion via the Source and Target interfaces
jwills
git commit: CRUNCH-243: Support easily extensibility for custom reading of Avro Datum
mkwhit
git commit: CRUNCH-231: Support legacy Mappers and Reducers in Crunch.
jwills
git commit: CRUNCH-241: Write side outputs from the map phase of a MapReduce job
jwills
git commit: CRUNCH-205: Delete superfluous build directory during integration tests
mkwhit
git commit: CRUNCH-240: Make DefaultJoinStrategy.join(PTable, PTable, JoinFn) public
jwills
git commit: CRUNCH-174: Add support for cogrouping 3, 4, or N inputs.
jwills
git commit: CRUNCH-238: Add numReducers options to the SecondarySort lib
jwills
git commit: CRUNCH-237: Improper job dependencies for certain types of long pipelines
jwills
svn commit: r1503358 - /crunch/site/trunk/content/about.mdtext
jwills
svn commit: r869594 - in /websites/staging/crunch/trunk/content: ./ about.html
buildbot
git commit: CRUNCH-236 Set context on wrapped MapFn in OneToManyJoin
greid
git commit: CRUNCH-235. Avoid exposing incompatible Hadoop classes in Crunch API.
tomwhite
git commit: CRUNCH-234: Fix non-zero size on empty intermediate outputs in hadoop2
jwills
git commit: CRUNCH-233: Fix InterruptedException error for hadoop1 caused by my last patch.
jwills
git commit: CRUNCH-233: Handled InterruptedException thrown in hadoop2. Contributed by Micah Whitacre.
jwills
git commit: CRUNCH-232: Ensure that all nodes are cleaned up during joins (or any Crunch job involving unions)
jwills
git commit: CRUNCH-228: FileTargetImpl cuts off extensions of output files
dbeech
git commit: CRUNCH-218: Add a WriteMode for checkpoint outputs, and make invalid checkpoint targets throw a CrunchRuntimeException.
jwills
git commit: CRUNCH-229: Better error handling for incompatible Target/PType combinations.
jwills
git commit: CRUNCH-219: Allow FileSourceImpl to take in multiple paths
jwills
svn commit: r866598 - /websites/production/crunch/content/
mafr
svn commit: r866597 - in /websites/staging/crunch/trunk/content: ./ apidocs/0.3.0/index.html apidocs/0.4.0/index.html apidocs/0.5.0/index.html apidocs/0.6.0/index.html
buildbot
svn commit: r1494921 - in /crunch/site/trunk/content/apidocs: 0.3.0/index.html 0.4.0/index.html 0.5.0/index.html 0.6.0/index.html
mafr
git commit: CRUNCH-224 Support SequenceFiles in MemPipeline
greid
git commit: CRUNCH-221: Ignore hidden files during materialization.
jwills
git commit: CRUNCH-220: Use the FileSystem implied by the target path in FileTargetImpl
jwills
git commit: CRUNCH-223: Fix WordCountHBaseIT failure
chaoshi
git commit: CRUNCH-220: Ensure existing target checking works on all filesystems
jwills
git commit: CRUNCH-214: Fix compilation error on jdk6
chaoshi
git commit: CRUNCH-215 Add BloomFilterJoinStrategy
greid
git commit: CRUNCH-217: Ensure PipelineResult captures pipeline failures. Contributed by Joe Adler.
jwills
git commit: CRUNCH-211 Add one-to-many join functionality
greid
[1/2] CRUNCH-213 Add sharded join
greid
git commit: CRUNCH-162: Add a Shard library for rebalancing the contents of PCollections
jwills
git commit: CRUNCH-210: Remove deprecated MapValuesFn references from cogroup and add support for user-specified parallelism for cogroup jobs
jwills
git commit: CRUNCH-209: Fix InputSplit bug that occurs with very large input directories
jwills
git commit: CRUNCH-208: Add mapValues convenience functions for PTable and PGroupedTable as well as a mapKeys function for PTable. Deprecate the MapKeysFn and MapValuesFn in favor of these new methods.
jwills
svn commit: r861978 - /websites/production/crunch/content/
jwills
svn commit: r1995 - /release/crunch/crunch-0.5.0-incubating/
jwills
svn commit: r861975 - in /websites/staging/crunch/trunk/content: ./ download.html
buildbot
svn commit: r1482397 - /crunch/site/trunk/content/download.mdtext
jwills
git commit: CRUNCH-206: Upgrade base Hadoop version to 1.1.2 and HBase to 0.94.3
jwills
Earlier messages
Later messages