szehon-ho commented on PR #53572:
URL: https://github.com/apache/spark/pull/53572#issuecomment-3688263826

   Actually I did som analysis and limit the fallback "UsesCachedData" to only 
Show commands.
   
   All the Describe commands I tested actually are idempotent, in V2 they have 
an in-memory reference to DSV2 Table object, in V1 due to RelationCache.  So a 
user triggering a second run by doing describeDf.cache() should not see any 
difference.
   
   Also , it is more likely that Show output is cached than Describe, as the 
result is typically a list of entities and bigger (though its not that likely 
overall). 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to