[jira] [Closed] (DATAFU-51) Add DataFu MR project, a lightweight for implementing Java/Scala MapReduce jobs

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-51?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-51. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Add DataFu MR

[jira] [Closed] (DATAFU-3) Bootstrap sum UDF

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-3?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-3. -- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Bootstrap sum UDF

[jira] [Closed] (DATAFU-9) Add datafu.text.ToJson UDF to serialize any relation/field as a JSON String

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-9?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-9. -- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Add datafu.text.

[jira] [Closed] (DATAFU-60) Support NDCG calculation within a UDF

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-60?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-60. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Support NDCG c

[jira] [Closed] (DATAFU-16) weighted reservoir sampling with exponential jumps UDF

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-16?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-16. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > weighted reser

[jira] [Closed] (DATAFU-71) Create IncrementalAvroStorage UDF for incrementally processing date partitioned data

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-71. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Create Increme

[jira] [Closed] (DATAFU-32) Hourglass concrete jobs should have getters and setters for output name and namespace

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-32?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-32. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Hourglass conc

[jira] [Closed] (DATAFU-14) Add NGram Tokenizer to datafu.pig.text.lucene

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-14?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-14. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Add NGram Toke

[jira] [Closed] (DATAFU-41) BagGroup does not name bag field in some cases

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-41?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-41. --- Resolution: Won't Fix Closing this as it is quite old and there have been no updates. > BagGroup does

[jira] [Closed] (DATAFU-13) Hourglass fixed-length windows should be robust to reappearing data

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-13?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-13. --- Resolution: Won't Fix Closing this as it is quite old and there have been no updates. > Hourglass fix

[jira] [Closed] (DATAFU-80) Enahnce InUDF to support tuple collection from String and Add Java Compatibility for DataFu-Pig

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-80?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-80. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Enahnce InUDF

[jira] [Closed] (DATAFU-34) Add some UDFS to handle map type

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-34?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-34. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Add some UDFS

[jira] [Closed] (DATAFU-98) New UDF for Histogram / Frequency counting

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-98?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-98. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > New UDF for Hi

[jira] [Closed] (DATAFU-40) Using BagGroup may yield error: Problem while reconciling output schema of ForEach

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-40?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-40. --- Resolution: Won't Fix Closing this as it is quite old and there have been no updates and there is a w

[jira] [Closed] (DATAFU-83) InUDF does not validate that types are compatible

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-83?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-83. --- Resolution: Won't Fix > InUDF does not validate that types are compatible > --

[jira] [Closed] (DATAFU-64) Investigate the possibility of creating a probability weighted sampling with replacement

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-64?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-64. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Investigate th

[jira] [Closed] (DATAFU-21) Probability weighted sampling without reservoir

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-21?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-21. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Probability we

[jira] [Closed] (DATAFU-63) SimpleRandomSample by a fixed number

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-63?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-63. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > SimpleRandomSa

[jira] [Closed] (DATAFU-30) Website crawl errors for class use links

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-30. --- Resolution: Won't Do > Website crawl errors for class use links >

[jira] [Commented] (DATAFU-150) Add MultiLabelOneHotEncoder

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17015342#comment-17015342 ] Matthew Hayes commented on DATAFU-150: -- It could make sense to add a library of Pyth

[jira] [Closed] (DATAFU-67) Adding Simple SimHash for near duplicate detection

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-67?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-67. --- Resolution: Won't Do Closing this as it is quite old and there have been no updates. > Adding Simple

[jira] [Closed] (DATAFU-89) Hourglass documentation is out of date

2020-01-14 Thread Matthew Hayes (Jira)
[ https://issues.apache.org/jira/browse/DATAFU-89?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Hayes closed DATAFU-89. --- Resolution: Not A Problem Already fixed > Hourglass documentation is out of date > --