MySQL执行计划explain与索引数据结构推演

作者：萤之光 | 来源：互联网 | 2020-12-07 13:06

mysql教程进行SQL调优，你得知道要调优的SQL语句是怎么执行的，查看SQL语句的具体执行过程，以加快SQL语句的执行效率。可以使用explain+SQL语句来模拟优化器执行SQL查询语句，从而知道MySQL是如何处理SQL语句的。

mysql教程栏目介绍执行计划explain与索引数据结构

准备工作

先建好数据库表，演示用的MySQL表，建表语句：

CREATE TABLE `emp` (  `id` int(11) NOT NULL AUTO_INCREMENT COMMENT &＃39;主键&＃39;,  `empno` int(11) DEFAULT NULL COMMENT &＃39;雇员工号&＃39;,  `ename` varchar(255) DEFAULT NULL COMMENT &＃39;雇员姓名&＃39;,  `job` varchar(255) DEFAULT NULL COMMENT &＃39;工作&＃39;,  `mgr` varchar(255) DEFAULT NULL COMMENT &＃39;经理的工号&＃39;,  `hiredate` date DEFAULT NULL COMMENT &＃39;雇用日期&＃39;,  `sal` double DEFAULT NULL COMMENT &＃39;工资&＃39;,  `comm` double DEFAULT NULL COMMENT &＃39;津贴&＃39;,  `deptno` int(11) DEFAULT NULL COMMENT &＃39;所属部门号&＃39;,
  PRIMARY KEY (`id`) USING BTREE
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=&＃39;雇员表&＃39;;CREATE TABLE `dept` (  `id` int(11) NOT NULL AUTO_INCREMENT COMMENT &＃39;主键&＃39;,  `deptno` int(11) DEFAULT NULL COMMENT &＃39;部门号&＃39;,  `dname` varchar(255) DEFAULT NULL COMMENT &＃39;部门名称&＃39;,  `loc` varchar(255) DEFAULT NULL COMMENT &＃39;地址&＃39;,
  PRIMARY KEY (`id`) USING BTREE
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=&＃39;部门表&＃39;;CREATE TABLE `salgrade` (  `id` int(11) NOT NULL COMMENT &＃39;主键&＃39;,  `grade` varchar(255) DEFAULT NULL COMMENT &＃39;等级&＃39;,  `lowsal` varchar(255) DEFAULT NULL COMMENT &＃39;最低工资&＃39;,  `hisal` varchar(255) DEFAULT NULL COMMENT &＃39;最高工资&＃39;,
  PRIMARY KEY (`id`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=&＃39;工资等级表&＃39;;CREATE TABLE `bonus` (  `id` int(11) NOT NULL COMMENT &＃39;主键&＃39;,  `ename` varchar(255) DEFAULT NULL COMMENT &＃39;雇员姓名&＃39;,  `job` varchar(255) DEFAULT NULL COMMENT &＃39;工作&＃39;,  `sal` double DEFAULT NULL COMMENT &＃39;工资&＃39;,  `comm` double DEFAULT NULL COMMENT &＃39;津贴&＃39;,
  PRIMARY KEY (`id`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COMMENT=&＃39;奖金表&＃39;;

后续执行计划，查询优化，索引优化等等知识的演练，基于以上几个表来操作。

MySQL执行计划
要进行SQL调优，你得知道要调优的SQL语句是怎么执行的，查看SQL语句的具体执行过程，以加快SQL语句的执行效率。
可以使用`explain + SQL`语句来模拟优化器执行SQL查询语句，从而知道MySQL是如何处理SQL语句的。
关于`explain`可以看看官网介绍。

explain的输出格式

mysql> explain select * from emp;
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+| id | select_type | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+|  1 | SIMPLE      | emp   | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    1 |   100.00 | NULL  |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+

字段id，select_type等字段的解释：

Column	Meaning
id	The `SELECT` identifier（该SELECT标识符）
select_type	The `SELECT` type（该SELECT类型）
table	The table for the output row（输出该行的表名）
partitions	The matching partitions（匹配的分区）
type	The join type（连接类型）
possible_keys	The possible indexes to choose（可能的索引选择）
key	The index actually chosen（实际选择的索引）
key_len	The length of the chosen key（所选键的长度）
ref	The columns compared to the index（与索引比较的列）
rows	Estimate of rows to be examined（检查的预估行数）
filtered	Percentage of rows filtered by table condition（按表条件过滤的行百分比）
extra	Additional information（附加信息）

id

select查询的序列号，包含一组数字，表示查询中执行select子句或者操作表的顺序。

id号分为三类：

如果id相同，那么执行顺序从上到下

mysql> explain select * from emp e join dept d on e.deptno = d.deptno join salgrade sg on e.sal between sg.lowsal and sg.hisal;
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------------------------------------------+
| id | select_type | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra                                              |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------------------------------------------+
|  1 | SIMPLE      | e     | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    1 |   100.00 | NULL                                               |
|  1 | SIMPLE      | d     | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    1 |   100.00 | Using where; Using join buffer (Block Nested Loop) |
|  1 | SIMPLE      | sg    | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    1 |   100.00 | Using where; Using join buffer (Block Nested Loop) |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------------------------------------------+

这个查询，用explain执行一下，id序号都是1，那么MySQL的执行顺序就是从上到下执行的。

如果id不同，如果是子查询，id的序号会递增，id值越大优先级越高，越先被执行

mysql> explain select * from emp e where e.deptno in (select d.deptno from dept d where d.dname = &＃39;SALEDept&＃39;);
+----+--------------+-------------+------------+------+---------------+------+---------+------+------+----------+----------------------------------------------------+| id | select_type  | table       | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra                                              |
+----+--------------+-------------+------------+------+---------------+------+---------+------+------+----------+----------------------------------------------------+|  1 | SIMPLE       |  | NULL       | ALL  | NULL          | NULL | NULL    | NULL | NULL |   100.00 | NULL                                               |
|  1 | SIMPLE       | e           | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    2 |    50.00 | Using where; Using join buffer (Block Nested Loop) |
|  2 | MATERIALIZED | d           | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    1 |   100.00 | Using where                                        |
+----+--------------+-------------+------------+------+---------------+------+---------+------+------+----------+----------------------------------------------------+

这个例子的执行顺序是先执行id为2的，然后执行id为1的。

id相同和不同的，同时存在：相同的可以认为是一组，从上往下顺序执行，在所有组中，id值越大，优先级越高，越先执行

还是上面那个例子，先执行id为2的，然后按顺序从上往下执行id为1的。

select_type

主要用来分辨查询的类型，是普通查询还是联合查询还是子查询。

`select_type` Value	JSON Name	Meaning
SIMPLE	None	Simple SELECT (not using UNION or subqueries)
PRIMARY	None	Outermost SELECT
UNION	None	Second or later SELECT statement in a UNION
DEPENDENT UNION	dependent (true)	Second or later SELECT statement in a UNION, dependent on outer query
UNION RESULT	union_result	Result of a UNION.
SUBQUERY	None	First SELECT in subquery
DEPENDENT SUBQUERY	dependent (true)	First SELECT in subquery, dependent on outer query
DERIVED	None	Derived table
MATERIALIZED	materialized_from_subquery	Materialized subquery
UNCACHEABLE SUBQUERY	cacheable (false)	A subquery for which the result cannot be cached and must be re-evaluated for each row of the outer query
UNCACHEABLE UNION	cacheable (false)	The second or later select in a UNION that belongs to an uncacheable subquery (see UNCACHEABLE SUBQUERY)

SIMPLE 简单的查询，不包含子查询和union

mysql> explain select * from emp;
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+| id | select_type | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+|  1 | SIMPLE      | emp   | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    3 |   100.00 | NULL  |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------+

primary 查询中若包含任何复杂的子查询，最外层查询则被标记为Primary
union 若第二个select出现在union之后，则被标记为union

mysql> explain select * from emp where deptno = 1001 union select * from emp where sal <5000;
+----+--------------+------------+------------+------+---------------+------+---------+------+------+----------+-----------------+| id | select_type  | table      | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra           |
+----+--------------+------------+------------+------+---------------+------+---------+------+------+----------+-----------------+|  1 | PRIMARY      | emp        | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |    25.00 | Using where     |
|  2 | UNION        | emp        | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |    33.33 | Using where     |
| NULL | UNION RESULT |  | NULL       | ALL  | NULL          | NULL | NULL    | NULL | NULL |     NULL | Using temporary |
+----+--------------+------------+------------+------+---------------+------+---------+------+------+----------+-----------------+

这条语句的select_type包含了primary和union

dependent union 跟union类似，此处的depentent表示union或union all联合而成的结果会受外部表影响
union result 从union表获取结果的select
dependent subquery subquery的子查询要受到外部表查询的影响

mysql> explain select * from emp e where e.empno  in ( select empno from emp where deptno = 1001 union select empno from emp where sal <5000);
+----+--------------------+------------+------------+------+---------------+------+---------+------+------+----------+-----------------+| id | select_type        | table      | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra           |
+----+--------------------+------------+------------+------+---------------+------+---------+------+------+----------+-----------------+|  1 | PRIMARY            | e          | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |   100.00 | Using where     |
|  2 | DEPENDENT SUBQUERY | emp        | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |    25.00 | Using where     |
|  3 | DEPENDENT UNION    | emp        | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |    25.00 | Using where     |
| NULL | UNION RESULT       |  | NULL       | ALL  | NULL          | NULL | NULL    | NULL | NULL |     NULL | Using temporary |
+----+--------------------+------------+------------+------+---------------+------+---------+------+------+----------+-----------------+

这条SQL执行包含了PRIMARY、DEPENDENT SUBQUERY、DEPENDENT UNION和UNION RESULT

subquery 在select或者where列表中包含子查询

举例：

mysql> explain select * from emp where sal > (select avg(sal) from emp) ;
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------------+| id | select_type | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra       |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------------+|  1 | PRIMARY     | emp   | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |    33.33 | Using where |
|  2 | SUBQUERY    | emp   | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |   100.00 | NULL        |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+-------------+

DERIVED from子句中出现的子查询，也叫做派生表
MATERIALIZED Materialized subquery？
UNCACHEABLE SUBQUERY 表示使用子查询的结果不能被缓存

例如：

mysql> explain select * from emp where empno = (select empno from emp where deptno=@@sort_buffer_size);
+----+----------------------+-------+------------+------+---------------+------+---------+------+------+----------+-------------+| id | select_type          | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra       |
+----+----------------------+-------+------------+------+---------------+------+---------+------+------+----------+-------------+|  1 | PRIMARY              | emp   | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |   100.00 | Using where |
|  2 | UNCACHEABLE SUBQUERY | emp   | NULL       | ALL  | NULL          | NULL | NULL    | NULL |    4 |    25.00 | Using where |
+----+----------------------+-------+------------+------+---------------+------+---------+------+------+----------+-------------+

uncacheable union 表示union的查询结果不能被缓存

table

对应行正在访问哪一个表，表名或者别名，可能是临时表或者union合并结果集。
如果是具体的表名，则表明从实际的物理表中获取数据，当然也可以是表的别名
表名是derivedN的形式，表示使用了id为N的查询产生的衍生表
当有union result的时候，表名是union n1,n2等的形式，n1,n2表示参与union的id

type

type显示的是访问类型，访问类型表示我是以何种方式去访问我们的数据，最容易想到的是全表扫描，直接暴力的遍历一张表去寻找需要的数据，效率非常低下。
访问的类型有很多，效率从最好到最坏依次是：
system > const > eq_ref > ref > fulltext > ref_or_null > index_merge > unique_subquery > index_subquery > range > index > ALL
一般情况下，得保证查询至少达到range级别，最好能达到ref

all 全表扫描，一般情况下出现这样的sql语句而且数据量比较大的话那么就需要进行优化

通常，可以通过添加索引来避免ALL

index 全索引扫描这个比all的效率要好，主要有两种情况：
- 一种是当前的查询时覆盖索引，即我们需要的数据在索引中就可以索取
- 一是使用了索引进行排序，这样就避免数据的重排序
range 表示利用索引查询的时候限制了范围，在指定范围内进行查询，这样避免了index的全索引扫描，适用的操作符： =, <>, >, >=, <, <=, IS NULL, BETWEEN, LIKE, or IN()

官网上举例如下：

SELECT * FROM tbl_name WHERE key_column = 10;
SELECT * FROM tbl_name WHERE key_column BETWEEN 10 and 20;
SELECT * FROM tbl_name WHERE key_column IN (10,20,30);
SELECT * FROM tbl_name WHERE key_part1 = 10 AND key_part2 IN (10,20,30);

index_subquery 利用索引来关联子查询，不再扫描全表

value IN (SELECT key_column FROM single_table WHERE some_expr)

unique_subquery 该连接类型类似与index_subquery，使用的是唯一索引

value IN (SELECT primary_key FROM single_table WHERE some_expr)

index_merge 在查询过程中需要多个索引组合使用
ref_or_null 对于某个字段既需要关联条件，也需要null值的情况下，查询优化器会选择这种访问方式

SELECT * FROM ref_table

WHERE key_column=expr OR key_column IS NULL;

fulltext 使用FULLTEXT索引执行join
ref 使用了非唯一性索引进行数据的查找

SELECT * FROM ref_table WHERE key_column=expr;
SELECT * FROM ref_table,other_table WHERE ref_table.key_column=other_table.column;
SELECT * FROM ref_table,other_table WHERE ref_table.key_column_part1=other_table.column AND ref_table.key_column_part2=1;

eq_ref 使用唯一性索引进行数据查找

SELECT * FROM ref_table,other_table WHERE ref_table.key_column=other_table.column;
SELECT * FROM ref_table,other_table WHERE ref_table.key_column_part1=other_table.column AND ref_table.key_column_part2=1;

const 这个表至多有一个匹配行

SELECT * FROM tbl_name WHERE primary_key=1;
SELECT * FROM tbl_name WHERE primary_key_part1=1 AND primary_key_part2=2;

例如：

mysql> explain select * from emp where id = 1;
+----+-------------+-------+------------+-------+---------------+---------+---------+-------+------+----------+-------+| id | select_type | table | partitions | type  | possible_keys | key     | key_len | ref   | rows | filtered | Extra |
+----+-------------+-------+------------+-------+---------------+---------+---------+-------+------+----------+-------+|  1 | SIMPLE      | emp   | NULL       | const | PRIMARY       | PRIMARY | 4       | const |    1 |   100.00 | NULL  |
+----+-------------+-------+------------+-------+---------------+---------+---------+-------+------+----------+-------+

system 表只有一行记录（等于系统表），这是const类型的特例，平时不会出现

possible_keys

显示可能应用在这张表中的索引，一个或多个，查询涉及到的字段上若存在索引，则该索引将被列出，但不一定被查询实际使用

注意：

在B+Tree上有两个头指针，一个指向根节点，另一个指向关键字最小的叶子节点，而且所有叶子节点（即数据节点）之间是一种链式环结构。

因此可以对 B+Tree 进行两种查找运算：一种是对于主键的范围查找和分页查找，另一种是从根节点开始，进行随机查找。

由于B+树叶子结点只存放data，根节点只存放key，那么我们计算一下，即使只有3层B+树，也能制成千万级别的数据。

你得知道的技（zhuang）术（b）名词

假设有这样一个表如下，其中id是主键：

mysql> select * from stu;
+------+---------+------+| id   | name    | age  |
+------+---------+------+|    1 | Jack Ma |   18 |
|    2 | Pony    |   19 |
+------+---------+------+

回表

我们对普通列建普通索引，这时候我们来查：

select * from stu where name=&＃39;Pony&＃39;;

由于name建了索引，查询时先找name的B+树，找到主键id后，再找主键id的B+树，从而找到整行记录。

这个最终会回到主键上来查找B+树，这个就是回表。

覆盖索引

如果是这个查询：

mysql> select id from stu where name=&＃39;Pony&＃39;;

就没有回表了，因为直接找到主键id，返回就完了，不需要再找其他的了。

没有回表就叫覆盖索引。

最左匹配

再来以name和age两个字段建组合索引(name, age)，然后有这样一个查询：

select * from stu where name=? and age=?

这时按照组合索引(name, age)查询，先匹配name，再匹配age，如果查询变成这样：

select * from stu where age=?

直接不按name查了，此时索引不会生效，也就是不会按照索引查询---这就是最左匹配原则。

加入我就要按age查，还要有索引来优化呢？可以这样做：

（推荐）把组合索引(name, age)换个顺序，建(age, name)索引
或者直接把age字段单独建个索引

索引下推

可能也叫谓词下推。。。

select t1.name,t2.name from t1 join t2 on t1.id=t2.id

t1有10条记录，t2有20条记录。

我们猜想一下，这个要么按这个方式执行：

先t1,t2按id合并（合并后20条），然后再查t1.name,t2.name

或者：

先把t1.name,t2.name找出来，再按照id关联

如果不使用索引条件下推优化的话，MySQL只能根据索引查询出t1,t2合并后的所有行，然后再依次比较是否符合全部条件。

当使用了索引条件下推优化技术后，可以通过索引中存储的数据判断当前索引对应的数据是否符合条件，只有符合条件的数据才将整行数据查询出来。

小结
`Explain` 为了知道优化SQL语句的执行，需要查看SQL语句的具体执行过程，以加快SQL语句的执行效率。
索引优点及用处。
索引采用的数据结构是B+树。
回表，覆盖索引，最左匹配和索引下推。
更多相关免费学习推荐：mysql教程(视频)
以上就是MySQL 执行计划explain与索引数据结构推演的详细内容，更多请关注第一PHP社区其它相关文章！

MySQL执行计划explain与索引数据结构推演

mysql教程栏目介绍执行计划explain与索引数据结构

explain的输出格式

你得知道的技（zhuang）术（b）名词

回表

覆盖索引

最左匹配

索引下推

PHP 编程疑难解析与知识点汇总

新浪笔试题

PHP 5.2.5 安装与配置指南

深入理解 SQL 视图、存储过程与事务

网站与MySQL数据库的连接与交互

MySQL 数据库迁移指南：从本地到远程及磁盘间迁移

Python 爬虫基础教程及代码实例

MySQL LAST_INSERT_ID() 函数深入解析

深入解析JDBC源码

Python配置文件读写指南

CentOS7源码编译安装MySQL5.6

PHP 中文 JSON 编码问题的解决方案

解决SQL Server动态SQL中LIKE语句参数传递导致无返回值的问题

解析JSON格式文本并处理数据

中央电视台电影频道节目预告及优化分析