作者:没有变成王子的青蛙 | 来源:互联网 | 2023-08-07 18:27
Spark sql catalyst解析过程及 RBO:http://www.jasongj.com/spark/rbo/
CBO详解:http://www.jasongj.com/spark/cbo/
CBO源码间析:https://my.oschina.net/KasuganoShin/blog/4427738
catalyst 介绍 、SparkPlanner 及 UI 分析:https://www.jianshu.com/p/410c23efb565
分析表操作:https://www.qubole.com/blog/automatic-statistics-collection-better-query-performance/
If you run the Hive statement ANALYZE TABLE COMPUTE STATISTICS FOR COLUMNS
, Impala can only use the resulting column statistics if the table is unpartitioned. Impala cannot use Hive-generated column statistics for a partitioned table.
https://docs.cloudera.com/runtime/7.2.7/impala-reference/topics/impala-perf-stats.html