Hi Shawn,

I had an issue with async backup on solr 6.5.1 reporting that the backup
was complete when clearly it was not. I was using 12 shards across 6 nodes.
I only noticed this issue when one shard was much larger than the others.
There were no answers here
http://lucene.472066.n3.nabble.com/async-backup-td4342776.html

I was focusing on the STATUS returned from the REQUESTSTATUS command, but
looking again now I can see a response from only 6 shards, and each shard
is from a different node. So this fits with what you're seeing. I assume
your shards 1, 7, 9 are all on different nodes.

HTH,
Damien.


On Sat, 13 Oct 2018 at 02:28, Shawn Heisey <apa...@elyograg.org> wrote:

> I'm working on reproducing a problem reported via the IRC channel.
>
> Started a test cloud with 7.5.0. Initially with two nodes, then again
> with 3 nodes.  Did this on Windows 10.
>
> Command to create a collection:
>
> bin\solr create -c test2 -shards 30 -replicationFactor 2
>
> For these URLs, I dropped them into a browser, so URL encoding was
> handled automatically.  I'm sure the URL to start the backup wouldn't
> work as-is with curl because it includes characters that need encoding.
>
> Backup URL:
>
>
> http://localhost:8983/solr/admin/collections?action=BACKUP&name=test2.3&collection=test2&location=C
> :\Users\elyograg\Downloads\solrbackups&async=sometag
>
> Request status URL:
>
>
> http://localhost:8983/solr/admin/collections?action=REQUESTSTATUS&requestid=sometag
>
> Here's the raw JSON response from the status URL:
> {
>    "responseHeader":{
>      "status":0,
>      "QTime":3},
>    "success":{
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":2}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":2}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":1}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":35}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":1}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":1}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":33}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":34}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":40}},
>      "192.168.56.1:8984_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":2}},
>      "192.168.56.1:8984_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":2}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8984_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8984_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:7574_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":0}},
>      "192.168.56.1:8983_solr":{
>        "responseHeader":{
>          "status":0,
>          "QTime":1}}},
>    "sometag135341573915254":{
>      "responseHeader":{
>        "status":0,
>        "QTime":0},
>      "STATUS":"completed",
>      "Response":"TaskId: sometag135341573915254 webapp=null
> path=/admin/cores
> params={core=test2_shard9_replica_n34&async=sometag135341573915254&qt=/admin/cores&name=shard9&action=BACKUPCORE&location=file:///C:/Users/elyograg/Downloads/solrbackups/test2.3&wt=javabin&version=2}
>
> status=0 QTime=0"},
>    "sometag135341570605052":{
>      "responseHeader":{
>        "status":0,
>        "QTime":0},
>      "STATUS":"completed",
>      "Response":"TaskId: sometag135341570605052 webapp=null
> path=/admin/cores
> params={core=test2_shard1_replica_n1&async=sometag135341570605052&qt=/admin/cores&name=shard1&action=BACKUPCORE&location=file:///C:/Users/elyograg/Downloads/solrbackups/test2.3&wt=javabin&version=2}
>
> status=0 QTime=0"},
>    "sometag135341570647962":{
>      "responseHeader":{
>        "status":0,
>        "QTime":0},
>      "STATUS":"completed",
>      "Response":"TaskId: sometag135341570647962 webapp=null
> path=/admin/cores
> params={core=test2_shard7_replica_n26&async=sometag135341570647962&qt=/admin/cores&name=shard7&action=BACKUPCORE&location=file:///C:/Users/elyograg/Downloads/solrbackups/test2.3&wt=javabin&version=2}
>
> status=0 QTime=0"},
>    "status":{
>      "state":"completed",
>      "msg":"found [sometag] in completed tasks"}}
>
>
> As you can see, only 3 (out of 30) shards are mentioned in the response.
> When I did the same test on a 2-node cloud example, there were only 2
> shards in the response.
>
> Should all 30 shards have been in the response? Is there a bug here?
>
> If I make the request without the async parameter, the response doesn't
> contain ANY shard information at all. Because this is an empty
> collection, the backup is fast. I expected detailed information to be in
> the response.  Is that worth an issue in Jira?
>
> Side note: In the status response, the individual shard info that IS
> present doesn't indicate what node handled the CoreAdmin call.  That
> would be useful information to include.
>
> Thanks,
> Shawn
>
>

Reply via email to