soumilshah1995 opened a new issue, #10815:
URL: https://github.com/apache/hudi/issues/10815

   
   
   Greetings,
   
   I hope this message finds you well. There was a recent discussion in the 
group regarding the possibility of changing the index type from Bloom to RLI on 
an older table. My understanding was that RLI needed to be created on a fresh 
table. However, there have been discussions on Slack suggesting that we can 
utilize the Hudi indexer for this purpose.
   
   Before proceeding, I would like to kindly request clarification and 
verification on whether such a transition is feasible or not. Your insights on 
this matter would be greatly appreciated.
   
   ```
   spark-submit \
       --class org.apache.hudi.utilities.HoodieIndexer \
       --packages 'org.apache.hudi:hudi-spark3.4-bundle_2.12:0.14.0' \
       --master 'local[*]' \
       --executor-memory 1g \
       
/Users/soumilshah/IdeaProjects/SparkProject/DeltaStreamer/jar/hudi-utilities-slim-bundle_2.12-0.14.0.jar
 \
        --mode scheduleAndExecute \
       --base-path 
file:///Users/soumilshah/IdeaProjects/SparkProject/DeltaStreamer/hudi/bronze_orders
 \
       --table-name bronze_orders \
       --index-types RECORD_INDEX \
       --hoodie-conf "hoodie.metadata.enable=true" \
       --hoodie-conf "hoodie.metadata.index.async=true" \
       --hoodie-conf 
"hoodie.write.concurrency.mode=optimistic_concurrency_control" \
       --hoodie-conf 
"hoodie.write.lock.provider=org.apache.hudi.client.transaction.lock.InProcessLockProvider"
 \
       --parallelism 2 \
       --spark-memory 2g
   ```
   # Error 
   ```
   
   /03/04 18:05:57 ERROR UtilHelpers: Indexer failed
   org.apache.hudi.exception.HoodieMetadataException: Failed to index partition 
[record_index]
        at 
org.apache.hudi.table.action.index.RunIndexActionExecutor.execute(RunIndexActionExecutor.java:181)
        at 
org.apache.hudi.table.HoodieSparkCopyOnWriteTable.index(HoodieSparkCopyOnWriteTable.java:308)
        at 
org.apache.hudi.client.BaseHoodieWriteClient.index(BaseHoodieWriteClient.java:1009)
        at 
org.apache.hudi.utilities.HoodieIndexer.scheduleAndRunIndexing(HoodieIndexer.java:294)
        at 
org.apache.hudi.utilities.HoodieIndexer.lambda$start$1(HoodieIndexer.java:199)
        at org.apache.hudi.utilities.UtilHelpers.retry(UtilHelpers.java:602)
        at org.apache.hudi.utilities.HoodieIndexer.start(HoodieIndexer.java:186)
        at org.apache.hudi.utilities.HoodieIndexer.main(HoodieIndexer.java:155)
        at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
        at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.base/java.lang.reflect.Method.invoke(Method.java:566)
        at 
org.apache.spark.deploy.JavaMainApplication.start(SparkApplication.scala:52)
        at 
org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:1020)
        at 
org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:192)
        at org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:215)
        at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:91)
        at 
org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:1111)
        at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:1120)
        at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
   24/03/04 18:05:57 ERROR HoodieIndexer: Indexing with basePath: 
file:///Users/soumilshah/IdeaProjects/SparkProject/DeltaStreamer/hudi/bronze_orders,
 tableName: bronze_orders, runningMode: scheduleAndExecute failed
   24/03/04 18:05:57 INFO SparkContext: SparkContext is stopping with exitCode 
0.
   24/03/04 18:05:57 INFO SparkUI: Stopped Spark web UI at 
http://soumils-mbp:8090
   24/03/04 18:05:57 INFO MapOutputTrackerMasterEndpoint: 
MapOutputTrackerMasterEndpoint stopped!
   24/03/04 18:05:57 INFO MemoryStore: MemoryStore cleared
   24/03/04 18:05:57 INFO BlockManager: BlockManager stopped
   24/03/04 18:05:57 INFO BlockManagerMaster: BlockManagerMaster stopped
   24/03/04 18:05:57 INFO 
OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: 
OutputCommitCoordinator stopped!
   24/03/04 18:05:57 INFO SparkContext: Successfully stopped SparkContext
   24/03/04 18:06:01 INFO ShutdownHookManager: Shutdown hook called
   24/03/04 18:06:01 INFO ShutdownHookManager: Deleting directory 
/private/var/folders/qq/s_1bjv516pn_mck29cwdwxnm0000gp/T/spark-b008e82d-b028-4eb0-b703-830d213bb5b4
   24/03/04 18:06:01 INFO ShutdownHookManager: Deleting directory 
/private/var/folders/qq/s_1bjv516pn_mck29cwdwxnm0000gp/T/spark-197d2ead-2361-4fdc-988e-bd785a89a1fb
   soumilshah@Soumils-MBP DeltaStreamer % 
   
   ```
   
   Thank you for your attention to this inquiry.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to