Re: Speedup initial index creation

2021-04-01 Thread Kadir Ozdemir
Also, there are config params to reduce the impact of the IndexTool on your cluster, such as reducing the number of mappers and the amount of work to be done in each RPC call initiated by a mapper to the server so that you can run IndexTool outside the maintenance window. On Thu, Apr 1, 2021 at

Re: Speedup initial index creation

2021-04-01 Thread Kadir Ozdemir
If you have only one version for the rows, then you should not see much difference between old and new. Can you provide some info on your performance testing? For example, number of rows in your data table, number of indexes, number of data table regions, number of MapReduce mappers, IndexTool

Re: Speedup initial index creation

2021-04-01 Thread Alexander Batyrshin
> 2 апр. 2021 г., в 03:55, Kadir Ozdemir написал(а): > >  > 1) I was thinking about the bulk load tool > (https://phoenix.apache.org/bulk_dataload.html). However, in this case, you > are not interested in bulk loading into the data table and its index but just > the index table. Now, I see

Re: Speedup initial index creation

2021-04-01 Thread Kadir Ozdemir
1) I was thinking about the bulk load tool ( https://phoenix.apache.org/bulk_dataload.html). However, in this case, you are not interested in bulk loading into the data table and its index but just the index table. Now, I see that it would not work for you. You are supposed to build a strongly

Re: Speedup initial index creation

2021-04-01 Thread Alexander Batyrshin
1) How to create index old way via intermediate HFiles? I see “direct” option for IndexTool but description says its disabled: private static final Option DIRECT_API_OPTION = new Option("direct", "direct", false, "This parameter is deprecated. Direct mode will be used whether it is set or