当前位置: 开发笔记 > 数据库 > 正文

关于分页查询和columnisnull能否走索引的分析补充

作者：超级a9厑厑 | 来源：互联网 | 2018-06-12 02:20

群里有朋友在谈到关于分页查询的问题，类似下面的sql想让其走索引select*from(select*fromtaorderbyobject_iddesc)whererownum这位朋友在排序列上建立了索引，但是执行计划并不走索引来避免排序，而是全表扫描然后排序后取了前几条数据，这个

群里有朋友在谈到关于分页查询的问题，类似下面的sql想让其走索引 select * from (select * from ta order by object_id desc) where rownum 这位朋友在排序列上建立了索引，但是执行计划并不走索引来避免排序，而是全表扫描然后排序后取了前几条数据，这个

群里有朋友在谈到关于分页查询的问题，类似下面的sql想让其走索引 select * from (select * from ta order by object_id desc) where rownum<10; 这位朋友在排序列上建立了索引，但是执行计划并不走索引来避免排序，而是全表扫描然后排序后取了前几条数据，这个消耗成本是很高的，我们来看看如何让这类分页查询走索引（这里的索引我们都理解为b tree索引，而不是bitmap索引）


SQL> select * from v$version;
BANNER

--------------------------------------------------------------------------------

Oracle Database 11g Enterprise Edition Release 11.2.0.1.0 - 64bit Production

PL/SQL Release 11.2.0.1.0 - Production

CORE    11.2.0.1.0      Production

TNS for Linux: Version 11.2.0.1.0 - Production

NLSRTL Version 11.2.0.1.0 – Production
SQL> create table ta as select * from dba_objects;
Table created.
SQL> create index ind_id_null on ta(object_id);
Index created.
SQL> execute dbms_stats.gather_table_stats(ownname=>'SYS',tabname=>'TA');
PL/SQL procedure successfully completed.
SQL> select * from ta where object_id is null;
no rows selected
Execution Plan

----------------------------------------------------------

Plan hash value: 824468716
--------------------------------------------------------------------------

| Id  | Operation         | Name | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------

|   0 | SELECT STATEMENT  |      |     1 |   101 |   292   (1)| 00:00:04 |

|*  1 |  TABLE ACCESS FULL| TA   |     1 |   101 |   292   (1)| 00:00:04 |

--------------------------------------------------------------------------
Predicate Information (identified by operation id):

---------------------------------------------------
   1 - filter("OBJECT_ID" IS NULL)
Statistics

----------------------------------------------------------

         42  recursive calls

          0  db block gets

       1078  consistent gets

          0  physical reads

          0  redo size

       1343  bytes sent via SQL*Net to client

        509  bytes received via SQL*Net from client

          1  SQL*Net roundtrips to/from client

          1  sorts (memory)

          0  sorts (disk)

0	rows processed
这里看出cbo是不会走object_id列上的索引来避免排序和全表扫描。

SQL> select * from (select * from ta order by object_id desc) where rownum<10;
9 rows selected.
Execution Plan

----------------------------------------------------------

Plan hash value: 2218702745
----------------------------------------------------------------------------------------

| Id  | Operation               | Name | Rows  | Bytes |TempSpc| Cost (%CPU)| Time     |

----------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT        |      |     9 |  1863 |       |  2025   (1)| 00:00:25 |

|*  1 |  COUNT STOPKEY          |      |       |       |       |            |          |

|   2 |   VIEW                  |      | 74906 |    14M|       |  2025   (1)| 00:00:25 |

|*  3 |    SORT ORDER BY STOPKEY|      | 74906 |  7388K|     9M|  2025   (1)| 00:00:25 |

|   4 |     TABLE ACCESS FULL   | TA   | 74906 |  7388K|       |   293   (1)| 00:00:04 |

----------------------------------------------------------------------------------------
Predicate Information (identified by operation id):

---------------------------------------------------
   1 - filter(ROWNUM<10)

   3 - filter(ROWNUM<10)
Statistics

----------------------------------------------------------

        164  recursive calls

          0  db block gets

       1101  consistent gets

          0  physical reads

          0  redo size

       2306  bytes sent via SQL*Net to client

        520  bytes received via SQL*Net from client

          2  SQL*Net roundtrips to/from client

          5  sorts (memory)

          0  sorts (disk)

          9  rows processed
那么这里有什么问题导致cbo不去考虑索引了，其实b tree索引存储的key是不能全部为null的，由于object_id列上没有not null的约束，而cbo的执行计划不能影响sql的执行结果，索引这里cbo没办法去认为通过索引回表，然后count stopkey取前几条来完成查询
而如果我们添加not null约束，或者在内部的查询结果中添加一个object_id is not null约束的过滤条件，那么此时cbo就知道了能够通过现在有的b tree索引回表的方式来完成查询

SQL> select * from (select * from ta where object_id is not null order by object_id desc) where rownum<10;
9 rows selected.
Execution Plan

----------------------------------------------------------

Plan hash value: 679434780
---------------------------------------------------------------------------------------------

| Id  | Operation                     | Name        | Rows  | Bytes | Cost (%CPU)| Time     |

---------------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT              |             |     9 |  1863 |     3   (0)| 00:00:01 |

|*  1 |  COUNT STOPKEY                |             |       |       |            |          |

|   2 |   VIEW                        |             |     9 |  1863 |     3   (0)| 00:00:01 |

|   3 |    TABLE ACCESS BY INDEX ROWID| TA          | 74906 |  7388K|     3   (0)| 00:00:01 |

|*  4 |     INDEX FULL SCAN DESCENDING| IND_ID_NULL |     9 |       |     2   (0)| 00:00:01 |

---------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):

---------------------------------------------------
   1 - filter(ROWNUM<10)

   4 - filter("OBJECT_ID" IS NOT NULL)
Statistics

----------------------------------------------------------

          1  recursive calls

          0  db block gets

          7  consistent gets

          0  physical reads

          0  redo size

       2306  bytes sent via SQL*Net to client

        520  bytes received via SQL*Net from client

          2  SQL*Net roundtrips to/from client

          0  sorts (memory)

          0  sorts (disk)

          9  rows processed
那么如果业务中有object_id等于null的值，那么这个查询可能会影响结果，而且oracle对于null值的排序正是认为null是最大值的。
那么这个分页查询如果没有not null约束或者过滤条件，就不能走索引了吗，其实不然，小鱼之前处理过下面的类似的case，是对单个的列进行is null的谓词过滤
SQL> create index ind_id_multi_null on ta(1,object_id);
Index created.
SQL> select /*+index(ta,ind_id_multi_null)*/* from ta where object_id is null;
no rows selected
Execution Plan

----------------------------------------------------------

Plan hash value: 849692407
-------------------------------------------------------------------------------------------------

| Id  | Operation                   | Name              | Rows  | Bytes | Cost (%CPU)| Time     |

-------------------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT            |                   |     1 |   101 |   199   (1)| 00:00:03 |

|   1 |  TABLE ACCESS BY INDEX ROWID| TA                |     1 |   101 |   199   (1)| 00:00:03 |

|*  2 |   INDEX FULL SCAN           | IND_ID_MULTI_NULL |     1 |       |   199   (1)| 00:00:03 |

-------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):

---------------------------------------------------
   2 - access("OBJECT_ID" IS NULL)

       filter("OBJECT_ID" IS NULL)
Statistics

----------------------------------------------------------

          1  recursive calls

          0  db block gets

        198  consistent gets

        197  physical reads

          0  redo size

       1343  bytes sent via SQL*Net to client

        509  bytes received via SQL*Net from client

          1  SQL*Net roundtrips to/from client

          0  sorts (memory)

          0  sorts (disk)

0	rows processed
这个上面走的全索引扫描然后回表的方式来过滤的object_id is null的，这个是因为把索引的前导列弄错了导致的，如果我们建立下面的索引，把过滤列放在索引的前导列上

SQL> create index ind_id_nulti_null_bak on ta(object_id,1);
Index created.
SQL> select * from ta where object_id is null;
no rows selected
Execution Plan

----------------------------------------------------------

Plan hash value: 2610853831
-----------------------------------------------------------------------------------------------------

| Id  | Operation                   | Name                  | Rows  | Bytes | Cost (%CPU)| Time     |

-----------------------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT            |                       |     1 |   101 |     1   (0)| 00:00:01 |

|   1 |  TABLE ACCESS BY INDEX ROWID| TA                    |     1 |   101 |     1   (0)| 00:00:01 |

|*  2 |   INDEX RANGE SCAN          | IND_ID_NULTI_NULL_BAK |     1 |       |     1   (0)| 00:00:01 |

-----------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):

---------------------------------------------------
   2 - access("OBJECT_ID" IS NULL)
Statistics

----------------------------------------------------------

          1  recursive calls

          0  db block gets

          2  consistent gets

          0  physical reads

          0  redo size

       1343  bytes sent via SQL*Net to client

        509  bytes received via SQL*Net from client

          1  SQL*Net roundtrips to/from client

          0  sorts (memory)

          0  sorts (disk)

0	rows processed
这个已经可以走这个复合索引的索引范围扫描了，那么最开始那个分页查询同样可以走全索引扫描，这个扫描只会扫描rownum分页数目的key然后回表，这个绝对比大表的全表扫描然后排序的成本要低很多。

SQL> select * from (select * from ta order by object_id desc) where rownum<10;
9 rows selected.
Execution Plan

----------------------------------------------------------

Plan hash value: 2361786208
-------------------------------------------------------------------------------------------------------

| Id  | Operation                     | Name                  | Rows  | Bytes | Cost (%CPU)| Time     |

-------------------------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT              |                       |     9 |  1863 |     3   (0)| 00:00:01 |

|*  1 |  COUNT STOPKEY                |                       |       |       |            |          |

|   2 |   VIEW                        |                       |     9 |  1863 |     3   (0)| 00:00:01 |

|   3 |    TABLE ACCESS BY INDEX ROWID| TA                    | 74906 |  7388K|     3   (0)| 00:00:01 |

|   4 |     INDEX FULL SCAN DESCENDING| IND_ID_NULTI_NULL_BAK |     9 |       |     2   (0)| 00:00:01 |

-------------------------------------------------------------------------------------------------------
Predicate Information (identified by operation id):

---------------------------------------------------
   1 - filter(ROWNUM<10)
Statistics

----------------------------------------------------------

          1  recursive calls

          0  db block gets

          7  consistent gets

          0  physical reads

          0  redo size

       2306  bytes sent via SQL*Net to client

        520  bytes received via SQL*Net from client

          2  SQL*Net roundtrips to/from client

          0  sorts (memory)

          0  sorts (disk)

          9  rows processed
至此最开始那个分页查询我们已经优化完毕了。

这里有两点需要注意的地方： 1对于object_id is null这类过滤条件并不是不能走索引范围扫描的，我们只需要建立该列为前导列的复合索引就有可能让cbo考虑该索引 2还有就是分页查询要利用索引完成索引全扫描rownum分页数据的key然后回表的方式，一定要考虑该列是否有not null的约束或者过滤条件，这个可能造成部分分页查询无法通过索引完成。

原文地址：关于分页查询和column is null能否走索引的分析补充, 感谢原作者分享。

sql
linux

推荐阅读

mysql
新浪笔试题

1:有如下一段程序：packagea.b.c;publicclassTest{privatestaticinti0;publicintgetNext(){return ... [详细]

蜡笔小新 2024-12-27 19:32:17
json
Python配置文件读写指南

本文详细介绍如何使用Python进行配置文件的读写操作，涵盖常见的配置文件格式（如INI、JSON、TOML和YAML），并提供具体的代码示例。 ... [详细]

蜡笔小新 2024-12-28 08:39:55
sql
在 Linux 系统中部署 PostgreSQL 数据库

本文详细介绍了如何在 Linux 平台上安装和配置 PostgreSQL 数据库。通过访问官方资源并遵循特定的操作步骤，用户可以在不同发行版（如 Ubuntu 和 Red Hat）上顺利完成 PostgreSQL 的安装。 ... [详细]

蜡笔小新 2024-12-27 03:46:27
sql
使用arm-eabi-gdb调试Android C/C++应用程序的详细指南

本文详细介绍如何使用arm-eabi-gdb调试Android平台上的C/C++程序。通过具体步骤和实用技巧，帮助开发者更高效地进行调试工作。 ... [详细]

蜡笔小新 2024-12-28 10:25:18
sql
PyCharm下载与安装指南

本文详细介绍如何从官方渠道下载并安装PyCharm集成开发环境（IDE），涵盖Windows、macOS和Linux系统，同时提供详细的安装步骤及配置建议。 ... [详细]

蜡笔小新 2024-12-28 09:42:41
sql
深入解析 HDFS Federation：多命名空间架构详解

HDFS Federation 是一种扩展 HDFS 架构的方式，通过引入多个独立的 NameNode 来解决单点故障和性能瓶颈问题。本文将详细探讨 HDFS Federation 的工作原理、优势以及潜在挑战。 ... [详细]

蜡笔小新 2024-12-28 08:22:22
数据库
四载相伴，与51CTO学院共成长

在计算机技术的学习道路上，51CTO学院以其专业性和专注度给我留下了深刻印象。从2012年接触计算机到2014年开始系统学习网络技术和安全领域，51CTO学院始终是我信赖的学习平台。 ... [详细]

蜡笔小新 2024-12-28 08:20:07
数据库
信息安全小组第一周工作总结

本周信息安全小组主要进行了CTF竞赛相关技能的学习，包括HTML和CSS的基础知识、逆向工程的初步探索以及整数溢出漏洞的学习。此外，还掌握了Linux命令行操作及互联网工作原理的基本概念。 ... [详细]

蜡笔小新 2024-12-28 05:52:22
数据库
Linux 系统启动故障排除指南：MBR 和 GRUB 问题

本文详细介绍了 Linux 系统启动过程中常见的 MBR 扇区和 GRUB 引导程序故障及其解决方案，涵盖从备份、模拟故障到恢复的具体步骤。 ... [详细]

蜡笔小新 2024-12-27 20:40:29
数据库
配置并访问BackTrack 5的SSH服务

本文详细介绍了如何在BackTrack 5中配置和启动SSH服务，确保其正常运行，并通过Windows系统成功连接。涵盖了必要的密钥生成步骤及常见问题解决方法。 ... [详细]

蜡笔小新 2024-12-27 20:13:35
database
网络链路质量监控：Smokeping部署与配置

本文详细介绍了如何在Linux系统上安装和配置Smokeping，以实现对网络链路质量的实时监控。通过详细的步骤和必要的依赖包安装，确保用户能够顺利完成部署并优化其网络性能监控。 ... [详细]

蜡笔小新 2024-12-27 19:31:05
redis
Python 的 10 个开发技巧！太实用了

1.如何在运行状态查看源代码？查看函数的源代码，我们通常会使用IDE来完成。比如在PyCharm中，你可以Ctrl+鼠标点击进入函数的源代码。那如果没有IDE呢？当我们想使用一个函 ... [详细]

蜡笔小新 2024-12-27 18:36:54
mysql
CentOS7源码编译安装MySQL5.6

2019独角兽企业重金招聘Python工程师标准一、先在cmake官网下个最新的cmake源码包cmake官网：https:www.cmake.org如此时最新 ... [详细]

蜡笔小新 2024-12-27 17:49:56
redis
Dockerfile 编写与 Docker 网络配置详解

本文详细介绍了 Dockerfile 的编写方法及其在网络配置中的应用，涵盖基础指令、镜像构建与发布流程，并深入探讨了 Docker 的默认网络、容器互联及自定义网络的实现。 ... [详细]

蜡笔小新 2024-12-27 17:31:41
redis
掌握Linux：基础命令入门

本章节深入浅出地介绍了Linux系统中的基本命令操作，帮助读者快速上手并理解其核心功能。 ... [详细]

蜡笔小新 2024-12-27 17:15:39

超级a9厑厑

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章