[ https://issues.apache.org/jira/browse/PHOENIX-1609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14650027#comment-14650027 ]
Thomas D'Silva commented on PHOENIX-1609: ----------------------------------------- [~maghamraviki...@gmail.com] I was testing out the MR index build on a table with 1 billion rows. The map reduce job completed but I got the following error at the end when it tries to load the HFiles. Any idea what might be happening? 2015-07-31 23:39:39,746 ERROR [main] index.IndexTool - An exception occured while performing the indexing job : java.io.IOException: Trying to load more than 32 hfiles to one family of one region at org.apache.hadoop.hbase.mapreduce.LoadIncrementalHFiles.doBulkLoad(LoadIncrementalHFiles.java:371) at org.apache.phoenix.mapreduce.index.IndexTool.run(IndexTool.java:226) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) at org.apache.phoenix.mapreduce.index.IndexTool.main(IndexTool.java:299) > MR job to populate index tables > -------------------------------- > > Key: PHOENIX-1609 > URL: https://issues.apache.org/jira/browse/PHOENIX-1609 > Project: Phoenix > Issue Type: New Feature > Reporter: maghamravikiran > Assignee: maghamravikiran > Fix For: 5.0.0, 4.4.0 > > Attachments: 0001-PHOENIX-1609-4.0.patch, > 0001-PHOENIX-1609-4.0.patch, 0001-PHOENIX-1609-wip.patch, > 0001-PHOENIX_1609.patch, PHOENIX-1609-master.patch > > > Often, we need to create new indexes on master tables way after the data > exists on the master tables. It would be good to have a simple MR job given > by the phoenix code that users can call to have indexes in sync with the > master table. > Users can invoke the MR job using the following command > hadoop jar org.apache.phoenix.mapreduce.Index -st MASTER_TABLE -tt > INDEX_TABLE -columns a,b,c > Is this ideal? -- This message was sent by Atlassian JIRA (v6.3.4#6332)