Hi,

any progress on this?  There is anything I can do to help?


Kind regards
Jose M Calhariz



On Fri, Sep 04, 2020 at 06:12:30PM +0100, Jose M Calhariz wrote:
> Hi,
> 
> I have made an update to my private backport. It is better but I still
> see the same errors on the logs.  This machine is a VM and no other VM
> or host is reporting IO errors of any kind, that I know off.
> 
> It was my first time using gerrit, so can you please check if the
> I have downloaded the correct patches?
> 
> ee578e9.diff
> 179a418.diff
> 
> There is a way to decode this errors and try to understand better what
> is happening and find a fix?
> 
> 
> [    9.760892] openafs: loading out-of-tree module taints kernel.
> [    9.760898] openafs: module license 
> 'http://www.openafs.org/dl/license10.html' taints kernel.
> [    9.762091] openafs: module verification failed: signature and/or required 
> key missing - tainting kernel
> [    9.778441] Key type afs_pag registered
> [ 8245.094223] afs: disk cache read error in CacheItems slot 211006 off 
> 16880500/19660820 code -4/80
> [ 8245.094254] afs: disk cache read error in CacheItems slot 211006 off 
> 16880500/19660820 code -4/80
> [ 8245.094277] afs: disk cache read error in CacheItems slot 211006 off 
> 16880500/19660820 code -4/80
> [ 8245.094299] afs: disk cache read error in CacheItems slot 211006 off 
> 16880500/19660820 code -4/80
> [10181.679636] afs: disk cache read error in CacheItems slot 156531 off 
> 12522500/19660820 code -4/80
> [10181.679638] afs: Error while alloc'ing cache slot for file 
> 204:536874423.516.5309; failing with an i/o error
> [11438.241843] afs_UFSGetVolSlot: error -4 reading volumeinfo
> [11438.242213] afs_UFSGetVolSlot: error -4 reading volumeinfo
> 
> 
> Kind regards
> Jose M Calhariz
> 
> 
> 
> 
> 
> 
> On Wed, Sep 02, 2020 at 07:28:50PM +0100, Jose M Calhariz wrote:
> > Hi,
> > 
> > I will then update my private backport and see if the things improve.
> > I will report here the results of your sugestion.  Thank you.
> > 
> > Kind regards
> > Jose M Calhariz
> > 
> > On Tue, Sep 01, 2020 at 04:07:55PM -0700, Benjamin Kaduk wrote:
> > > On Tue, Sep 01, 2020 at 03:43:37PM +0100, Jose M Calhariz wrote:
> > > > Package: openafs-client
> > > > Version: 1.8.6-1~dsi10+1
> > > > Severity: normal
> > > > 
> > > > I am using a private backport of openafs from testing.  On this server I
> > > > am getting multiples strange errors about openafs cache.  This server
> > > > is different in that it runs apache to serve personal web pages and 
> > > > every
> > > > web page runs under a different openafs user.  So is normal for this
> > > > server to be simultaneuous running code under 100 or 200 different 
> > > > openafs 
> > > > users.
> > > > 
> > > > The an example of errors on the logs are:
> > > > 
> > > > afs: disk cache read error in CacheItems slot 350195 off 
> > > > 28015620/35000020 code -4/80
> > > > afs: Error while alloc'ing cache slot for file 204:536874423.964.4794; 
> > > > failing with an i/o error
> > > > 
> > > > I am not certain this types of errors are to be ignored and there have
> > > > been reports of problems accessing openafs files.  I am using this bug
> > > > report to collect more information about this cache errors and the
> > > > possibility of being an indication of important errors with the openafs
> > > > cache code.
> > > 
> > > This error message is supposed to indicate that a read from the cache
> > > filesystem got EIO, which in turn is supposed to indicate a physical
> > > problem with the drive.  That said, I'm not going to jump to conclusions
> > > and try to blame your drive, as there are several other things that could
> > > be coming into play.
> > > 
> > > While the log message itself is pretty old, there's been a lot of work
> > > recently to more accurately report EIO in error conditions (mostly instead
> > > of ENOENT, since returning ENOENT can cause that to get cached at the VFS
> > > layer and produce strange user-visible behavior).
> > > 
> > > Having a lot of users present makes me suspect that the credentials used 
> > > by
> > > the kernel to read/write the cache file are not being saved/restored
> > > properly, and indeed we recently merged to 1.8.x (not in a release yet)
> > > https://gerrit.openafs.org/14082 and https://gerrit.openafs.org/14099 
> > > which
> > > improve such credentials management.
> > > 
> > > My recommendation would be to try pulling in those two patches to your
> > > build before proceeding to try to trace the source of the EIO.
> > > 
> > > Thanks for the report!
> > > 
> > > -Ben
> > > 
> > 
> 
> 
> 



-- 
--

Nós trabalhamos com o propósito de tornar nossos produtos obsoletos, antes que 
outros o façam

--Bill Gates

Attachment: signature.asc
Description: PGP signature

Reply via email to