Re: Review Request 46701: DATAFU-117 - New UDF - CountDistinctUpTo

2016-06-05 Thread Eyal Allweil
/ Testing --- Thanks, Eyal Allweil

Review Request 49248: New UDF - TupleDiff

2016-06-27 Thread Eyal Allweil
--- New UDF - TupleDiff Diffs - datafu-pig/src/main/java/datafu/pig/util/TupleDiff.java PRE-CREATION datafu-pig/src/test/java/datafu/test/pig/util/TupleDiffTest.java PRE-CREATION Diff: https://reviews.apache.org/r/49248/diff/ Testing --- Thanks, Eyal Allweil

Making the rest of bag and set UDF's implement Accumulator

2016-02-21 Thread Eyal Allweil
to optimize plans including these UDF's much better. If no one objects (and it is likely to be accepted) I can prepare such implementations for review - I don't think it is a lot of work. What do you think? Regards,Eyal Allweil

Review Request 46701: DATAFU-117 - New UDF - CountDistinctUpTo

2016-04-27 Thread Eyal Allweil
--- Thanks, Eyal Allweil

Re: Review Request 46701: DATAFU-117 - New UDF - CountDistinctUpTo

2016-05-19 Thread Eyal Allweil
utomatically generated e-mail. To reply, visit: https://reviews.apache.org/r/46701/#review133803 ------- On April 27, 2016, 7:44 a.m., Eyal Allweil wrote: > >

Re: Graduation?

2017-02-11 Thread Eyal Allweil
Count me in too. On Friday, February 10, 2017 10:04 AM, Roman Shaposhnik wrote: Cool. Another question: besides Russell and Matthew who can also volunteer time commitment and interest to be on the PMC? Thanks, Roman. On Thu, Feb 9, 2017 at 10:30 PM, Russell

Re: Review Request 55110: DATAFU-106 Test files are currently created in the subdirectory folder (e.g. datafu-pig/input*). For better organization, they should be created in a subdirectory.

2017-01-16 Thread Eyal Allweil
> On Jan. 2, 2017, 11:29 a.m., Eyal Allweil wrote: > > Hi Piyush, > > > > Thank you for your patch! It looks to me that it works fine - I ran our > > tests on Ubuntu, both from Eclipse and from the command line. > > > > I have two comments, one "

Re: Discussion of 'Becoming a Committer' Page

2017-04-08 Thread Eyal Allweil
It looks good to me, too. But maybe we can make the inactivity-for-committer-removal period longer than what is described in the Fineract model? (six months + four weeks notice) That seems too short to me. On Thursday, April 6, 2017 8:38 PM, Mitul Tiwari wrote:

Re: Podling Report Reminder - July 2017

2017-07-10 Thread Eyal Allweil
Hi John, I can try to put it together in the next two days or so  ... is it too late for July? Regards,Eyal On Friday, July 7, 2017 6:09 AM, John D. Ament wrote: All, Please note that DataFu's report is past due.  Is anyone available to put one together? On

Re: What's the status of DataFu?

2017-07-10 Thread Eyal Allweil
Hi John (and everyone else), There isn't a lot of traffic, but over the past few months the community has been working on preparing for graduation - I think we're almost there. There was one JIRA issue opened this past month by a new contributor, which I've already reviewed and need to commit.

Re: [RESULT] [VOTE] Apache DataFu graduation proposal

2017-07-31 Thread Eyal Allweil
Oh, I see now that you've already updated the report. Looks good! Regards,Eyal On Tuesday, August 1, 2017 8:37 AM, Eyal Allweil <eyal_allw...@yahoo.com> wrote: Great news! Matthew, will you write (our last) podling report? If you don't have time, is there anything else that needs

Re: About DATAFU-83

2017-05-31 Thread Eyal Allweil
It's true that the native Pig IN operator has made InUdf kind of unnecessary, but as long as it's included in DataFu this bug is worth fixing, in my opinion. On Tuesday, May 30, 2017 8:21 AM, Prafulla wrote: Hello Team, I see that following jira item is

Re: Shepherd Report

2017-09-11 Thread Eyal Allweil
Hi Dave, everyone - Is there any update with this? Are we graduating? Regards,Eyal On Thursday, August 3, 2017 12:24 AM, Matthew Hayes wrote: Dave the website has been updated based on your feedback:

Re: Podling Report Reminder - October 2017

2017-10-02 Thread Eyal Allweil
Matthew, are you preparing this? Do you want me to? On Sunday, October 1, 2017 4:06 PM, "johndam...@apache.org" wrote: Dear podling, This email was sent by an automated system on behalf of the Apache Incubator PMC. It is an initial reminder to give you plenty

Re: Podling Report Reminder - October 2017

2017-10-02 Thread Eyal Allweil
thew Hayes <matthew.terence.ha...@gmail.com> wrote: Eyal, I haven't started writing it yet.  Want to write this one? -Matt On Sun, Oct 1, 2017 at 11:12 PM, Eyal Allweil < eyal_allw...@yahoo.com.invalid> wrote: > Matthew, are you preparing this? Do you want me to? > > > 

Re: Task #b90969e5: Add left outer join macro

2017-11-06 Thread Eyal Allweil
Hi Varun, Thank you for your interest! Do you have any experience with Pig? For setting up your environment, you should follow this guide: http://datafu.incubator.apache.org/community/contributing.html We haven't published instructions for how to contribute Pig macros. I'll try to write a rough

[VOTE] Apache DataFu 1.3.3 release RC0

2018-01-15 Thread Eyal Allweil
My vote: +1 I downloaded the sources, checked hashes, signature, built and ran tests. I didn't have a chance to check it out on a cluster though.

[VOTE] Apache DataFu 1.3.3 release RC0

2018-01-15 Thread Eyal Allweil
My vote: +1 I downloaded the sources, checked hashes, signature, built and ran tests. I didn't have a chance to check it out on a cluster though.

Re: [VOTE] Apache DataFu 1.3.3 release RC1

2018-01-21 Thread Eyal Allweil
I checked md5, asc, ran tests from source and ran a sample script on one of the new macros on a cluster with the jar. +1 On Friday, January 19, 2018, 3:16:05 AM GMT+2, Mitul Tiwari wrote: +1 On Thu, Jan 18, 2018 at 4:17 PM, Matthew Hayes <

Re: [VOTE] Apache DataFu 1.3.3 release RC1

2018-01-21 Thread Eyal Allweil
Hi Justin, Don't worry, we don't move that fast. I know the incubator vote is ongoing (I saw your +1 :-) ) - but I've actually started preparing the post and didn't want someone to duplicate my efforts. If we need another release candidate the blog post will probably still stay more or less

Re: [VOTE] Apache DataFu 1.3.3 release RC1

2018-01-21 Thread Eyal Allweil
<matthew.terence.ha...@gmail.com> wrote: The vote has passed with three +1 binding votes, and no -1s or 0s. Binding +1s: Matt, Mitul, Eyal -Matt On Sun, Jan 21, 2018 at 3:18 AM, Eyal Allweil < eyal_allw...@yahoo.com.invalid> wrote: > I checked md5, asc, ran tests from source a

Re: [VOTE] Apache DataFu graduation proposal

2018-02-02 Thread Eyal Allweil
+1 On Friday, February 2, 2018, 4:02:31 AM GMT+2, Jakob Homan wrote: +1 On 1 February 2018 at 15:38, Mitul Tiwari wrote: > +1 > > On Thu, Feb 1, 2018 at 3:18 PM, Matthew Hayes wrote: > >> Hi all, >> >> I would like to

[jira] [Updated] (DATAFU-117) New UDF - CountDistinctUpTo

2016-06-08 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-117: Attachment: DATAFU-117-4.patch This patch incorporates the last remaining comment from the review

[jira] [Created] (DATAFU-114) Make FirstTupleFromBag implement Accumulator

2016-01-14 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-114: --- Summary: Make FirstTupleFromBag implement Accumulator Key: DATAFU-114 URL: https://issues.apache.org/jira/browse/DATAFU-114 Project: DataFu Issue Type

[jira] [Updated] (DATAFU-114) Make FirstTupleFromBag implement Accumulator

2016-01-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-114: Attachment: FirstTupleFromBag.java I wasn't able to test this patch because I can't get the build

[jira] [Commented] (DATAFU-95) Improve wrong JDK error message

2016-01-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-95?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15097858#comment-15097858 ] Eyal Allweil commented on DATAFU-95: As an immediate, easy-to-do improvement, writing what Java version

[jira] [Commented] (DATAFU-119) New UDF - TupleDiff

2016-06-27 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15350489#comment-15350489 ] Eyal Allweil commented on DATAFU-119: - I put up a [reviewboard|https://reviews.apache.org/r/49248

[jira] [Commented] (DATAFU-114) Make FirstTupleFromBag implement Accumulator

2016-02-04 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15131991#comment-15131991 ] Eyal Allweil commented on DATAFU-114: - Anyone? > Make FirstTupleFromBag implement Accumula

[jira] [Commented] (DATAFU-114) Make FirstTupleFromBag implement Accumulator

2016-01-25 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114990#comment-15114990 ] Eyal Allweil commented on DATAFU-114: - Any comments? Can this patch be pulled? > M

[jira] [Commented] (DATAFU-114) Make FirstTupleFromBag implement Accumulator

2016-02-17 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15150312#comment-15150312 ] Eyal Allweil commented on DATAFU-114: - Thanks! After I imported the projects individually, like you

[jira] [Created] (DATAFU-117) New UDF - CountDistinctUpTo

2016-03-24 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-117: --- Summary: New UDF - CountDistinctUpTo Key: DATAFU-117 URL: https://issues.apache.org/jira/browse/DATAFU-117 Project: DataFu Issue Type: New Feature

[jira] [Updated] (DATAFU-117) New UDF - CountDistinctUpTo

2016-03-24 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-117: Attachment: DATAFU-117.patch Patch including new UDF and test (in BagTests) > New

[jira] [Commented] (DATAFU-115) Make TupleFromBag implement Accumulator

2016-03-27 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15213559#comment-15213559 ] Eyal Allweil commented on DATAFU-115: - I'm not sure why, but I can't see this patch in the master

[jira] [Created] (DATAFU-116) Make SetIntersect and SetDifference implement Accumulator

2016-03-08 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-116: --- Summary: Make SetIntersect and SetDifference implement Accumulator Key: DATAFU-116 URL: https://issues.apache.org/jira/browse/DATAFU-116 Project: DataFu Issue

[jira] [Commented] (DATAFU-116) Make SetIntersect and SetDifference implement Accumulator

2016-03-08 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15185409#comment-15185409 ] Eyal Allweil commented on DATAFU-116: - As far as I can tell, when the accumulator is used, Pig passes

[jira] [Created] (DATAFU-115) Make TupleFromBag implement Accumulator

2016-03-03 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-115: --- Summary: Make TupleFromBag implement Accumulator Key: DATAFU-115 URL: https://issues.apache.org/jira/browse/DATAFU-115 Project: DataFu Issue Type: Improvement

[jira] [Updated] (DATAFU-115) Make TupleFromBag implement Accumulator

2016-03-03 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-115: Attachment: DATAFU-115.patch Relatively straightforward patch ... there's one difference from

[jira] [Updated] (DATAFU-115) Make TupleFromBag implement Accumulator

2016-03-03 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-115: Flags: Patch > Make TupleFromBag implement Accumula

[jira] [Commented] (DATAFU-115) Make TupleFromBag implement Accumulator

2016-03-29 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15215634#comment-15215634 ] Eyal Allweil commented on DATAFU-115: - Thanks! > Make TupleFromBag implement Accumula

[jira] [Updated] (DATAFU-117) New UDF - CountDistinctUpTo

2016-04-26 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-117: Attachment: DATAFU-117-2.patch This replaces the previous patch file, addresses (most of) Matthew's

[jira] [Updated] (DATAFU-117) New UDF - CountDistinctUpTo

2016-05-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-117: Attachment: DATAFU-117-3.patch Incorporates changes from [review |https://reviews.apache.org/r

[jira] [Comment Edited] (DATAFU-117) New UDF - CountDistinctUpTo

2016-05-09 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15258239#comment-15258239 ] Eyal Allweil edited comment on DATAFU-117 at 5/9/16 8:50 AM: - Ok, I opened

[jira] [Commented] (DATAFU-119) New UDF - TupleDiff

2016-09-07 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15471164#comment-15471164 ] Eyal Allweil commented on DATAFU-119: - Any feedback about this? > New UDF - TupleD

[jira] [Commented] (DATAFU-119) New UDF - TupleDiff

2016-09-18 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15500764#comment-15500764 ] Eyal Allweil commented on DATAFU-119: - I've run it on results that were in the tens of millions. I

[jira] [Updated] (DATAFU-65) Aho-Corasick Pig UDF

2016-10-18 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-65?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-65: --- Issue Type: New Feature (was: Bug) > Aho-Corasick Pig

[jira] [Commented] (DATAFU-45) RFE: CartesianProduct

2016-10-18 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-45?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584898#comment-15584898 ] Eyal Allweil commented on DATAFU-45: Hi Sam, Did you ever solve this? I agree with Matthew

[jira] [Commented] (DATAFU-16) weighted reservoir sampling with exponential jumps UDF

2016-10-18 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-16?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15586085#comment-15586085 ] Eyal Allweil commented on DATAFU-16: It looks like this got added - can this issue be closed

[jira] [Commented] (DATAFU-98) New UDF for Histogram / Frequency counting

2016-10-25 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-98?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15605952#comment-15605952 ] Eyal Allweil commented on DATAFU-98: Hi Russell. First of all, I want to apologize for the time it's

[jira] [Commented] (DATAFU-87) Edit distance

2016-10-25 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-87?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15606106#comment-15606106 ] Eyal Allweil commented on DATAFU-87: Hi Joydeep, I want to begin by apologizing for the time it's

[jira] [Updated] (DATAFU-25) AliasableEvalFunc should use getInputSchema

2016-10-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-25?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-25: --- Attachment: DATAFU-25.patch This is a minimal fix that uses getInputSchema() instead of the udf

[jira] [Comment Edited] (DATAFU-25) AliasableEvalFunc should use getInputSchema

2016-10-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-25?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589407#comment-15589407 ] Eyal Allweil edited comment on DATAFU-25 at 10/19/16 6:01 PM

[jira] [Commented] (DATAFU-28) Tests are too slow

2016-10-13 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-28?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571961#comment-15571961 ] Eyal Allweil commented on DATAFU-28: On my machine the datafu-pig tests run in 18 minutes (I ran them

[jira] [Commented] (DATAFU-85) Add SPRINTF to provide this functionality to Pig < 0.14.0

2016-10-13 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571787#comment-15571787 ] Eyal Allweil commented on DATAFU-85: Given the time that has passed, and that it can't be backported

[jira] [Updated] (DATAFU-122) Documentation error/typo on tips and tricks involving Coalesce

2016-10-12 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-122: Assignee: Eyal Allweil Labels: documentation typo (was: docuentation typo) Fix

[jira] [Commented] (DATAFU-106) Test files should be created in a subfolder of projects

2017-01-11 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15817823#comment-15817823 ] Eyal Allweil commented on DATAFU-106: - [~takias], I will try to sort our Jira issues out and mark

[jira] [Commented] (DATAFU-119) New UDF - TupleDiff

2017-01-02 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793097#comment-15793097 ] Eyal Allweil commented on DATAFU-119: - If we add DATAFU-123, we can include the macro I put

[jira] [Updated] (DATAFU-123) Allow DataFu to include macros

2017-03-08 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-123: Attachment: DATAFU-123.patch The change ended up being smaller than what I originally described

[jira] [Commented] (DATAFU-12) Implement Lead UDF based on version from SQL

2017-04-18 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-12?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15972397#comment-15972397 ] Eyal Allweil commented on DATAFU-12: It looks like this functionality is implemented in HIve - see

[jira] [Commented] (DATAFU-124) sessionize() ought to support millisecond periods

2017-06-29 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16067884#comment-16067884 ] Eyal Allweil commented on DATAFU-124: - I reviewed it - looks fine, a nice improvement. I'll try to get

[jira] [Commented] (DATAFU-119) New UDF - TupleDiff

2017-08-06 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16115744#comment-16115744 ] Eyal Allweil commented on DATAFU-119: - [~matterhayes] - We want the Apache license header on our macro

[jira] [Updated] (DATAFU-61) Add TF-IDF Macro to DataFu

2017-08-06 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-61?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-61: --- Attachment: DATAFU-61-2.patch Now that macros are supported (and can be tested), I updated this patch

[jira] [Commented] (DATAFU-119) New UDF - TupleDiff

2017-09-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16165925#comment-16165925 ] Eyal Allweil commented on DATAFU-119: - The documentation can be part of [DATAFU-128|https

[jira] [Commented] (DATAFU-61) Add TF-IDF Macro to DataFu

2017-09-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16165991#comment-16165991 ] Eyal Allweil commented on DATAFU-61: Yes, I'll merge it. I did respond to an open issue in the review

[jira] [Updated] (DATAFU-119) New UDF - TupleDiff

2017-09-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-119: Attachment: DATAFU-119-2.patch > New UDF - TupleDiff > --- > >

[jira] [Commented] (DATAFU-130) Add left outer join macro described in the DataFu guide

2017-09-17 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16169225#comment-16169225 ] Eyal Allweil commented on DATAFU-130: - I think this is a good Jira issue to put in the [Apache Help

[jira] [Updated] (DATAFU-130) Add left outer join macro described in the DataFu guide

2017-09-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-130: Description: In our [guide|http://datafu.incubator.apache.org/blog/2013/09/04/datafu-1-0.html

[jira] [Resolved] (DATAFU-61) Add TF-IDF Macro to DataFu

2017-09-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-61?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil resolved DATAFU-61. Resolution: Fixed Assignee: Eyal Allweil Merged. > Add TF-IDF Macro to Dat

[jira] [Commented] (DATAFU-12) Implement Lead UDF based on version from SQL

2017-10-08 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-12?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16196058#comment-16196058 ] Eyal Allweil commented on DATAFU-12: [~matterhayes], anyone, what do you think? I wouldn't "waste

[jira] [Updated] (DATAFU-48) Upgrade Guava to 17.0

2017-10-08 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-48: --- Attachment: DATAFU-48-update-gradle-to-20.0.patch I checked, and Guava 20.0 is the last version

[jira] [Commented] (DATAFU-87) Edit distance

2017-10-09 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-87?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16197120#comment-16197120 ] Eyal Allweil commented on DATAFU-87: On second thought, since this UDF is now available in Hive

[jira] [Created] (DATAFU-131) Update DataFu site to meet graduation requirements

2017-10-10 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-131: --- Summary: Update DataFu site to meet graduation requirements Key: DATAFU-131 URL: https://issues.apache.org/jira/browse/DATAFU-131 Project: DataFu Issue Type

[jira] [Commented] (DATAFU-131) Update DataFu site to meet graduation requirements

2017-10-10 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16199209#comment-16199209 ] Eyal Allweil commented on DATAFU-131: - Here's a link to the Apache site guidelines: https

[jira] [Commented] (DATAFU-126) There is a typo in document

2017-09-11 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16161252#comment-16161252 ] Eyal Allweil commented on DATAFU-126: - Thanks Kane! I've fixed this in our sources, and it will show

[jira] [Commented] (DATAFU-83) InUDF does not validate that types are compatible

2017-09-11 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-83?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16161211#comment-16161211 ] Eyal Allweil commented on DATAFU-83: By the way, [~ItsAUsernameRight?], if you're already looking

[jira] [Resolved] (DATAFU-126) There is a typo in document

2017-09-11 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil resolved DATAFU-126. - Resolution: Fixed > There is a typo in docum

[jira] [Commented] (DATAFU-61) Add TF-IDF Macro to DataFu

2017-09-11 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16161118#comment-16161118 ] Eyal Allweil commented on DATAFU-61: Came back to this today and tried a little experiment - I verified

[jira] [Assigned] (DATAFU-126) There is a typo in document

2017-09-11 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil reassigned DATAFU-126: --- Assignee: Eyal Allweil > There is a typo in docum

[jira] [Commented] (DATAFU-61) Add TF-IDF Macro to DataFu

2017-09-13 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16164373#comment-16164373 ] Eyal Allweil commented on DATAFU-61: One last thing - I noticed after I uploaded my patch that it has

[jira] [Updated] (DATAFU-129) New macro - dedup

2017-09-12 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-129: Attachment: DATAFU-129.patch Macro and test > New macro - de

[jira] [Updated] (DATAFU-127) New macro - samply by keys

2017-09-12 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-127: Attachment: DATAFU-127.patch Patch including new macros and tests > New macro - samply by k

[jira] [Created] (DATAFU-128) Add documentation for macros

2017-09-12 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-128: --- Summary: Add documentation for macros Key: DATAFU-128 URL: https://issues.apache.org/jira/browse/DATAFU-128 Project: DataFu Issue Type: Improvement

[jira] [Created] (DATAFU-127) New macro - samply by keys

2017-09-12 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-127: --- Summary: New macro - samply by keys Key: DATAFU-127 URL: https://issues.apache.org/jira/browse/DATAFU-127 Project: DataFu Issue Type: New Feature

[jira] [Created] (DATAFU-130) Add left outer join macro described in the DataFu guide

2017-09-12 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-130: --- Summary: Add left outer join macro described in the DataFu guide Key: DATAFU-130 URL: https://issues.apache.org/jira/browse/DATAFU-130 Project: DataFu Issue

[jira] [Created] (DATAFU-129) New macro - dedup

2017-09-12 Thread Eyal Allweil (JIRA)
Eyal Allweil created DATAFU-129: --- Summary: New macro - dedup Key: DATAFU-129 URL: https://issues.apache.org/jira/browse/DATAFU-129 Project: DataFu Issue Type: New Feature Reporter

[jira] [Commented] (DATAFU-128) Add documentation for macros

2017-09-12 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16162936#comment-16162936 ] Eyal Allweil commented on DATAFU-128: - Is the documentation for updating the website accurate

[jira] [Commented] (DATAFU-48) Upgrade Guava to 17.0

2017-10-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-48?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16210769#comment-16210769 ] Eyal Allweil commented on DATAFU-48: None, actually. Hadoop 1 and 2 are using 11.0.2, like us. Hadoop 3

[jira] [Commented] (DATAFU-32) Hourglass concrete jobs should have getters and setters for output name and namespace

2017-10-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-32?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16210941#comment-16210941 ] Eyal Allweil commented on DATAFU-32: Is this still relevant? If so, I'll open a [Help Wanted task

[jira] [Commented] (DATAFU-118) Automatically run rat task when running assemble

2017-10-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211505#comment-16211505 ] Eyal Allweil commented on DATAFU-118: - (because we have a patch that seems to work on a newer Gradle

[jira] [Commented] (DATAFU-17) Improve testing of randomized functions

2017-10-22 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-17?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16214398#comment-16214398 ] Eyal Allweil commented on DATAFU-17: I think we can close this, just as we closed [DATAFU-28|https

[jira] [Resolved] (DATAFU-125) Upgrade Gradle to v4 or later

2017-11-28 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil resolved DATAFU-125. - Resolution: Fixed Merged - everything looks fine to me. I repeated these tests, and tried out

[jira] [Commented] (DATAFU-30) Website crawl errors for class use links

2017-11-30 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16273148#comment-16273148 ] Eyal Allweil commented on DATAFU-30: I think newer Javadoc versions don't have a "Use"

[jira] [Assigned] (DATAFU-118) Automatically run rat task when running assemble

2017-11-30 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil reassigned DATAFU-118: --- Assignee: Eyal Allweil > Automatically run rat task when running assem

[jira] [Updated] (DATAFU-47) UDF for Murmur3 (and other) Hash functions

2017-12-05 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-47?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-47: --- Attachment: DATAFU-47-new.patch I looked at the review board for this issue, and fixed the merge

[jira] [Closed] (DATAFU-116) Make SetIntersect and SetDifference implement Accumulator

2017-12-14 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil closed DATAFU-116. --- Resolution: Won't Fix Since it seems like Pig doesn't use the Accumulator interface when

[jira] [Commented] (DATAFU-63) SimpleRandomSample by a fixed number

2017-11-13 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-63?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16249793#comment-16249793 ] Eyal Allweil commented on DATAFU-63: Hi [~cur4so], I'll quickly answer your last comment - I'll get

[jira] [Commented] (DATAFU-63) SimpleRandomSample by a fixed number

2017-11-19 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-63?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16258480#comment-16258480 ] Eyal Allweil commented on DATAFU-63: I wonder if the gradlew script is there because it can be "

[jira] [Commented] (DATAFU-130) Add left outer join macro described in the DataFu guide

2017-11-15 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16253564#comment-16253564 ] Eyal Allweil commented on DATAFU-130: - Hi [~varunu28], Thank you for your interest! Do you have any

[jira] [Commented] (DATAFU-60) Support NDCG calculation within a UDF

2017-12-09 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-60?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16284984#comment-16284984 ] Eyal Allweil commented on DATAFU-60: Hi [~jhartman], I know it's been years, but do you think you'll

[jira] [Resolved] (DATAFU-48) Upgrade Guava to 20.0

2017-10-29 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil resolved DATAFU-48. Resolution: Fixed Assignee: Eyal Allweil (was: Philip (flip) Kromer) Fix Version/s

[jira] [Updated] (DATAFU-48) Upgrade Guava to 20.0

2017-10-29 Thread Eyal Allweil (JIRA)
[ https://issues.apache.org/jira/browse/DATAFU-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Allweil updated DATAFU-48: --- Summary: Upgrade Guava to 20.0 (was: Upgrade Guava to 17.0) > Upgrade Guava to 2

  1   2   >