On Fri, 8 Jul 2022 at 17:17, Robert Newson <[email protected]> wrote:
>
> Hi,
>
> There's a bug in 3.1.0 that affects you. Namely that the default 5 second 
> gen_server timeout is used for some requests if ioq bypass is enabled. Please 
> check if your config has a [ioq.bypass] section and try again without 
> bypasses for a time.

Thanks for taking the time to answer me.

I set all the settings of the [ioq.bypass] section to false, set the
cluster in maintenance mode, waited a couple minutes, than set
maintenance to false... but no joy.


> If you could explain your migration process in more detail perhaps we can 
> find other explanations. I note that such migrations are better done online 
> using replication, moving the files around is a bit more challenging.

I tried replication, but it failed, hence the shard files copy.

The procedure I followed (a tad simplified):
- set the source cluster in maintenance mode;
- copied the shard files to a shared disk;
- created a database with the same name on the target cluster;
- changed the database id on the copied shard files to match the
newly-created one on the target cluster;
- set the target cluster to maintenance mode;
- copied the shard files from the shared disk to the target cluster
data directories, making sure to get the shard directories right;
- unset the maintenance mode on the target cluster.

The procedure above worked for a few databases (including one that
-with replicas- was 6GB) but failed with the 200GB database.

Cheers,

Luca Morandini

Reply via email to