Oh good catch! Thanks for digging, Adrien. We had had reports of our
JP indexes taking longer to build (not anything like 6x, but
noticeable - I guess analysis is only part of the time).

On Mon, May 25, 2020 at 3:54 AM Adrien Grand (Jira) <j...@apache.org> wrote:
>
>
>     [ 
> https://issues.apache.org/jira/browse/LUCENE-9286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17115802#comment-17115802
>  ]
>
> Adrien Grand commented on LUCENE-9286:
> --------------------------------------
>
> FYI I was just digging a Kuromoji regression introduced in 8.4 that made 
> analysis run about 6x slower. Interestingly the slowdown was on both 
> branch_8_4 and branch 8_5 but not on branch_8x and git bisect pointed out to 
> this commit as the fix of the regression.
>
> > FST arc.copyOf clones BitTables and this can lead to excessive memory use
> > -------------------------------------------------------------------------
> >
> >                 Key: LUCENE-9286
> >                 URL: https://issues.apache.org/jira/browse/LUCENE-9286
> >             Project: Lucene - Core
> >          Issue Type: Bug
> >    Affects Versions: 8.5
> >            Reporter: Dawid Weiss
> >            Assignee: Bruno Roustant
> >            Priority: Major
> >             Fix For: 8.6
> >
> >         Attachments: screen-[1].png
> >
> >          Time Spent: 1h 50m
> >  Remaining Estimate: 0h
> >
> > I see a dramatic increase in the amount of memory required for construction 
> > of (arguably large) automata. It currently OOMs with 8GB of memory consumed 
> > for bit tables. I am pretty sure this didn't require so much memory before 
> > (the automaton is ~50MB after construction).
> > Something bad happened in between. Thoughts, [~broustant], [~sokolov]?
>
>
>
> --
> This message was sent by Atlassian Jira
> (v8.3.4#803005)
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
> For additional commands, e-mail: issues-h...@lucene.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to