On Fri, 2024-07-26 at 15:02 +0200, Marta Rybczynska wrote:
> On Thu, Jul 25, 2024 at 5:27 PM Richard Purdie 
> <[email protected]> wrote:
> > On Thu, 2024-07-25 at 16:48 +0200, [email protected] wrote:
> > > On 25.07.2024 16:29, Richard Purdie wrote:
> > > > Hi Marta,
> > > > 
> > > > 
> > > > With the v3 series applied we did just see this on the autobuilder
> > > > unfortunately so I'm not sure that problem is addressed:
> > > > 
> > > > https://autobuilder.yoctoproject.org/typhoon/#/builders/87/builds/7004/steps/14/logs/stdio
> > > > 
> > > 
> > > Hello Richard,
> > > Thanks, this is unfortunate. Is it possible to have a copy of the
> > > corrupted database somewhere?
> > 
> > I think it is transient as we never clean it up and not all tasks fail.
> > That seems to imply it is a race of some kind.
> 
> I have a few ideas of what it might be, but I do not have a reproducer right 
> now. With the
> vex changes, the duration of the cve_check operation changed slightly. On the 
> other hand,
> the database download is slower these days (I have had standalone runs that 
> lasted for 5+ hours).
> Also, I noticed that there were cancellations of some of the build, so the 
> cancellation of the download
> may be in play too.
> 
> A question: autobuilder configuration does share DL_DIR among multiple builds?

DL_DIR is shared between all the workers over NFS.

> My possibility list right now:
> - the "download" job timeout too short
> - download failure/timeout
> - job cancellation during the download

While a download is in progress, the exclusive lock should be held. If
the database were damaged, I'd then expect all subsequent cve_check
tasks to fail the same way.

In the failures, 2 or 3 tasks fail, the rest all continue to work. So
ti doesn't really fit.

> What do you think?

I'm wondering if we should make the do_fetch of the database recipe
copy the database to somewhere in TMPDIR when it finishes, then have
the main cve_check class use the copy there. This would move NFS issues
out the equation?

That would be more in keeping with how other recipes work, just using
DL_DIR as an accelerator. 

Cheers,

Richard

-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#202739): 
https://lists.openembedded.org/g/openembedded-core/message/202739
Mute This Topic: https://lists.openembedded.org/mt/107525289/21656
Group Owner: [email protected]
Unsubscribe: https://lists.openembedded.org/g/openembedded-core/unsub 
[[email protected]]
-=-=-=-=-=-=-=-=-=-=-=-

Reply via email to