[ 
https://issues.apache.org/jira/browse/MAHOUT-625?focusedWorklogId=991688&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-991688
 ]

ASF GitHub Bot logged work on MAHOUT-625:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 14/Nov/25 09:58
            Start Date: 14/Nov/25 09:58
    Worklog Time Spent: 10m 
      Work Description: guan404ming commented on PR #626:
URL: https://github.com/apache/mahout/pull/626#issuecomment-3531890189

   Sure, let's wait for the fix or do you want to help with that?




Issue Time Tracking
-------------------

    Worklog Id:     (was: 991688)
    Time Spent: 40m  (was: 0.5h)

> Some of generated patterns have support higher than in reality
> --------------------------------------------------------------
>
>                 Key: MAHOUT-625
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-625
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.4
>            Reporter: Jaroslaw Odzga
>            Assignee: Robin Anil
>            Priority: Major
>             Fix For: 0.5
>
>         Attachments: FPGrowth.java, MAHOUT-625-patch.txt, bugfix-patch.txt, 
> dataset_ok.txt, final_patch_with_bug_fix_test_and_the_dataset.txt, 
> mahout-test.zip
>
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> It turnes out that some of generated patterns have incorrect support. The 
> returned support is slightly higher than the true one.
> I attached the test, which proves that FPGrowth has a bug. Test is using data 
> (retail) found here: http://fimi.ua.ac.be/data/
> The pattern (36, 39, 41) occurs in the transactions 572 times (this is also 
> calculated in test), but the FPGrowth returns pattern (36, 39, 41) with 
> support 573.
> Please note that mentioned pattern is not the only one with incorrect support 
> - the test only point out one example to hace something to focus on. There is 
> plenty more patterns with support higher than the real one. The biggest 
> difference I noticed was support 8 higher than the real one for one of 
> patterns.
> Please find attached failing unit test - it's actually a maven project, which 
> contains test data and is ready to run.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to