On Fri, 8 Jul 2022 at 17:17, Robert Newson <[email protected]> wrote: > > Hi, > > There's a bug in 3.1.0 that affects you. Namely that the default 5 second > gen_server timeout is used for some requests if ioq bypass is enabled. Please > check if your config has a [ioq.bypass] section and try again without > bypasses for a time.
Thanks for taking the time to answer me. I set all the settings of the [ioq.bypass] section to false, set the cluster in maintenance mode, waited a couple minutes, than set maintenance to false... but no joy. > If you could explain your migration process in more detail perhaps we can > find other explanations. I note that such migrations are better done online > using replication, moving the files around is a bit more challenging. I tried replication, but it failed, hence the shard files copy. The procedure I followed (a tad simplified): - set the source cluster in maintenance mode; - copied the shard files to a shared disk; - created a database with the same name on the target cluster; - changed the database id on the copied shard files to match the newly-created one on the target cluster; - set the target cluster to maintenance mode; - copied the shard files from the shared disk to the target cluster data directories, making sure to get the shard directories right; - unset the maintenance mode on the target cluster. The procedure above worked for a few databases (including one that -with replicas- was 6GB) but failed with the 200GB database. Cheers, Luca Morandini
