Igor,

Not 100% sure what you mean by "filter" in this case - but - it sounds like you need to think about how to express Apriori as a sequence of operations that explicitly includes counted aggregation followed by normal filtering on the count field?  At least to get something working?  Limit doesn't feel like the right approach here; it's more of an important operator when you are trying to stop the flow of unwanted "lower down" data.  Think about how you might express Apriori in the extended relational algebra...?

Cheers,

Mike

On 8/30/19 8:28 AM, Ígor Chagas wrote:
Hi, devs

I started to work with Hyracks a few months ago. I tested several
variations on WordCount code to learn about its paradigm. Now, I'm trying
to implement Apriori Algorithm on it to do some studies.

In some step of this algorithm, I need to filter tuples that have a number
of occurrences higher than N. However, I did not find a proper filter
operator. I tried to use the LimitOperator¹ as a template since it is
simple and deals directly with tuples. However, I didn't figure out how to
compare some tuple field values to a specific Apriori parameter, as support.

Do you guys have some advice or a starting point to implement a filter
operator in this situation? Is there something I'd missed?

Thanks and Regards.

Ígor Chagas Marques

[1] -> dataflow/std/misc/LimitOperatorDescriptor.java

Reply via email to