Question: shard awareness allocation and shard allocation filtering

Grégoire Seux Thu, 16 Oct 2014 02:42:55 -0700

Hello,

My question is related to cooperation between shard awareness allocation 
and shard allocation filtering.


We have two kinds of nodes: those with ssds (used for indexing and search 
recent data), those with large spinning disks (used for archiving old 
indices).

I'd like to setup a mechanism to move old indices from ssds to spinning 
disks.

The first solution uses reroute command in cluster api. However it feels 
unnatural since you have to do it shard by shard and decide the target node.

What I want to achieve is the following:
1. stick recent indices (the current one being written) to ssds. They have 
2 copies.
2. at some point (disk on ssds is above 65%), one copy is moved to larger 
boxes (1 copy is still on ssd to help search, 1 copy on large box)
3. when disk is scarce on ssd boxes (90%), we simply drop the copy present 
on ssd. Since we don't care that much of old data having only one copy is 
not an issue.

I have tried to implement this with shard awareness allocation and 
allocation filtering but it does not seem to work as expected.

Nodes have *flavor* attribute depending on their hardware (*ssd* or *iodisk*
).
Cluster is using shard awareness based on *flavor* attribute 
(*cluster.routing.allocation.awareness.attributes: 
flavor*).

1. My index template has *routing.allocation.require: ssd* to impose two 
have all copies on ssds first. 
2. At some point, I drop the requirement (effectively 
*routing.allocation.require: 
**). I expect flavor awareness to move one copy to large (iodisk) boxes.
3. At a later point, I'll set *number_of_replicas* to 0 and change 
*routing.allocation.require* to *iodisk* to drop the shard copy on ssds

Sadly allocation filtering and shard awareness do not seem to cooperate 
well :
when an new index is created, one copy goes to ssds and the other is not 
allocated anywhere (index stays in yellow state).

Using *curl -XPUT localhost:9200/_cluster/settings -d 
'{"transient":{"logger.cluster.routing.allocation":"trace"}}*,
I have observed what happen when a new index is created.

[2014-10-16 06:53:19,462][TRACE][cluster.routing.allocation.decider] 
> [bungeearchive01-par.storage.criteo.preprod] Can not allocate 
> [[2014-10-16.01][3], node[null], [R], s[UNASSIGNED]] on node 
> [qK34VLdhTferCQs2oNJOyg] due to [SameShardAllocationDecider]
> [2014-10-16 06:53:19,463][TRACE][cluster.routing.allocation.decider] 
> [bungeearchive01-par.storage.criteo.preprod] Can not allocate 
> [[2014-10-16.01][3], node[null], [R], s[UNASSIGNED]] on node 
> [gE7OTgevSUuoj44RozxK0Q] due to [AwarenessAllocationDecider]
> [2014-10-16 06:53:19,463][TRACE][cluster.routing.allocation.decider] 
> [bungeearchive01-par.storage.criteo.preprod] Can not allocate 
> [[2014-10-16.01][3], node[null], [R], s[UNASSIGNED]] on node 
> [Y2k9qXfsTx6X2iQTxg9RBQ] due to [AwarenessAllocationDecider]
> [2014-10-16 06:53:19,463][TRACE][cluster.routing.allocation.decider] 
> [bungeearchive01-par.storage.criteo.preprod] Can not allocate 
> [[2014-10-16.01][3], node[null], [R], s[UNASSIGNED]] on node 
> [FwWc2XPPRWuje2KH6AlDEQ] due to [FilterAllocationDecider]
> [2014-10-16 06:53:19,492][TRACE][cluster.routing.allocation.allocator] 
> [bungeearchive01-par.storage.criteo.preprod] No Node found to assign shard 
> [[2014-10-16.01][3], node[null], [R], s[UNASSIGNED]]
>


This transcript shows that 
- shard 3 primary replica is on node qK34VLdhTferCQs2oNJOyg (flavor:ssd) 
which prevent its copy to placed there
- it cannot be placed on gE7OTgevSUuoj44RozxK0Q (ssd as well) because it 
tries to maximizes dispersion accross flavors
- it cannot be placed on Y2k9qXfsTx6X2iQTxg9RBQ for the same reason
- it cannot be placed on FwWc2XPPRWuje2KH6AlDEQ (flavor: iodisk) because of 
the filter

Questions:
- am I doing it wrong?
- should I stick with a set of reroute command?
- are awareness and filtering supposed to cooperate?

Any help will be appreciated 

-- 
Grégoire Seux

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/61665ede-f772-43c5-b159-53bf9353bbdf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Question: shard awareness allocation and shard allocation filtering

Reply via email to