[jira] Subscription: PIG patch available
Issue Subscription Filter: PIG patch available (36 issues) Subscriber: pigdaily Key Summary PIG-5369Add llap-client dependency https://issues.apache.org/jira/browse/PIG-5369 PIG-5360Pig sets working directory of input file systems causes exception thrown https://issues.apache.org/jira/browse/PIG-5360 PIG-5338Prevent deep copy of DataBag into Jython List https://issues.apache.org/jira/browse/PIG-5338 PIG-5323Implement LastInputStreamingOptimizer in Tez https://issues.apache.org/jira/browse/PIG-5323 PIG-5273_SUCCESS file should be created at the end of the job https://issues.apache.org/jira/browse/PIG-5273 PIG-5267Review of org.apache.pig.impl.io.BufferedPositionedInputStream https://issues.apache.org/jira/browse/PIG-5267 PIG-5256Bytecode generation for POFilter and POForeach https://issues.apache.org/jira/browse/PIG-5256 PIG-5160SchemaTupleFrontend.java is not thread safe, cause PigServer thrown NPE in multithread env https://issues.apache.org/jira/browse/PIG-5160 PIG-5115Builtin AvroStorage generates incorrect avro schema when the same pig field name appears in the alias https://issues.apache.org/jira/browse/PIG-5115 PIG-5106Optimize when mapreduce.input.fileinputformat.input.dir.recursive set to true https://issues.apache.org/jira/browse/PIG-5106 PIG-5081Can not run pig on spark source code distribution https://issues.apache.org/jira/browse/PIG-5081 PIG-5080Support store alias as spark table https://issues.apache.org/jira/browse/PIG-5080 PIG-5057IndexOutOfBoundsException when pig reducer processOnePackageOutput https://issues.apache.org/jira/browse/PIG-5057 PIG-5029Optimize sort case when data is skewed https://issues.apache.org/jira/browse/PIG-5029 PIG-4926Modify the content of start.xml for spark mode https://issues.apache.org/jira/browse/PIG-4926 PIG-4913Reduce jython function initiation during compilation https://issues.apache.org/jira/browse/PIG-4913 PIG-4849pig on tez will cause tez-ui to crash,because the content from timeline server is too long. https://issues.apache.org/jira/browse/PIG-4849 PIG-4750REPLACE_MULTI should compile Pattern once and reuse it https://issues.apache.org/jira/browse/PIG-4750 PIG-4684Exception should be changed to warning when job diagnostics cannot be fetched https://issues.apache.org/jira/browse/PIG-4684 PIG-4656Improve String serialization and comparator performance in BinInterSedes https://issues.apache.org/jira/browse/PIG-4656 PIG-4598Allow user defined plan optimizer rules https://issues.apache.org/jira/browse/PIG-4598 PIG-4551Partition filter is not pushed down in case of SPLIT https://issues.apache.org/jira/browse/PIG-4551 PIG-4539New PigUnit https://issues.apache.org/jira/browse/PIG-4539 PIG-4515org.apache.pig.builtin.Distinct throws ClassCastException https://issues.apache.org/jira/browse/PIG-4515 PIG-4373Implement PIG-3861 in Tez https://issues.apache.org/jira/browse/PIG-4373 PIG-4323PackageConverter hanging in Spark https://issues.apache.org/jira/browse/PIG-4323 PIG-4313StackOverflowError in LIMIT operation on Spark https://issues.apache.org/jira/browse/PIG-4313 PIG-4251Pig on Storm https://issues.apache.org/jira/browse/PIG-4251 PIG-4002Disable combiner when map-side aggregation is used https://issues.apache.org/jira/browse/PIG-4002 PIG-3952PigStorage accepts '-tagSplit' to return full split information https://issues.apache.org/jira/browse/PIG-3952 PIG-3911Define unique fields with @OutputSchema https://issues.apache.org/jira/browse/PIG-3911 PIG-3877Getting Geo Latitude/Longitude from Address Lines https://issues.apache.org/jira/browse/PIG-3877 PIG-3873Geo distance calculation using Haversine https://issues.apache.org/jira/browse/PIG-3873 PIG-3668COR built-in function when atleast one of the coefficient values is NaN https://issues.apache.org/jira/browse/PIG-3668 PIG-3587add functionality for rolling over dates https://issues.apache.org/jira/browse/PIG-3587 PIG-1804Alow Jython function to implement Algebraic and/or Accumulator interfaces https://issues.apache.org/jira/browse/PIG-1804 You may edit this subscription at: https://issues.apache.org/jira/secure/EditSubscription!default.jspa?subId=16328=12322384
[jira] [Commented] (PIG-5371) Hdfs bytes written assertions fail in TestPigRunner
[ https://issues.apache.org/jira/browse/PIG-5371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16721419#comment-16721419 ] Adam Szita commented on PIG-5371: - Hi [~abstractdog], can you please elaborate on {quote}TestPigRunner - work, only on an internal maintenance line {quote} I am able to run TestPigRunner checked out from trunk as per: {code:java} ant clean jar ant test -Dtest=TestPigRunner{code} ..and it succeeds: {code:java} BUILD SUCCESSFUL Total time: 8 minutes 47 seconds{code} > Hdfs bytes written assertions fail in TestPigRunner > --- > > Key: PIG-5371 > URL: https://issues.apache.org/jira/browse/PIG-5371 > Project: Pig > Issue Type: Bug >Reporter: Laszlo Bodor >Assignee: Laszlo Bodor >Priority: Major > Attachments: PIG-5371.01.patch, simpleTest.out > > > Attached [^simpleTest.out]. It seems like HDFS counter 'HDFS_BYTES_WRITTEN' > returns the byte count not only for the result of pig store operator, but it > includes the size of the jar files as well. The problem is this could change > very easily, so in my opinion the best would be to remove these assertions > from TestPigRunner as this is just causing intermittent and/or persistent > failures. > The test class is for basic testing of PigRunner, and this is achieved well > enough without the asserts. > {code} > 2018-11-23 10:14:52,661 [IPC Server handler 5 on 54929] INFO > org.apache.hadoop.hdfs.StateChange - BLOCK* allocate blk_1073741827_1003, > replicas=127.0.0.1:54934, 127.0.0.1:54930, 127.0.0.1:54943 for > /tmp/temp-157262781/tmp-1057655772/automaton-1.11-8.jar > ... > 2018-11-23 10:14:52,735 [PacketResponder: > BP-26001448-10.200.50.195-1542964474138:blk_1073741827_1003, > type=HAS_DOWNSTREAM_IN_PIPELINE, downstreams=2:[127.0.0.1:54930, > 127.0.0.1:54943]] INFO > org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace - src: > /127.0.0.1:54978, dest: /127.0.0.1:54934, bytes: 176285, op: HDFS_WRITE, > cliID: DFSClient_NONMAPREDUCE_-1959727442_1, offset: 0, srvID: > 108c4000-1ae0-402e-82cf-bf403629c0f7, blockid: > BP-26001448-10.200.50.195-1542964474138:blk_1073741827_1003, duration(ns): > 57162859 > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)