数据库索引如何使搜索更快-Howdodatabaseindicesmakesearchfaster

作者：魔帝君 | 来源：互联网 | 2023-08-01 16:54

Iwasreadingthroughrailstutorial(http:ruby.railstutorial.orgbookruby-on-rails-tutorial#side

I was reading through rails tutorial (http://ruby.railstutorial.org/book/ruby-on-rails-tutorial#sidebar-database_indices) but confused about the explanation of database indicies, basically the author proposes that rather then searching O(n) time through the a list of emails (for login) its much faster to create an index, giving the following example:

我正在阅读rails教程(http://ruby.railstutorial.org/book/ruby-on-rails-tutorial#sidebar-database_indices),但对数据库指标的解释感到困惑,基本上作者提出的不是搜索O( n)通过电子邮件列表(登录)的时间,创建索引要快得多,给出以下示例:

To understand a database index, it’s helpful to consider the analogy of a book index. In a book, to find all the occurrences of a given string, say “foobar”, you would have to scan each page for “foobar”. With a book index, on the other hand, you can just look up “foobar” in the index to see all the pages containing “foobar”. source: http://ruby.railstutorial.org/chapters/modeling-users#sidebar:database_indices**

要理解数据库索引,考虑书籍索引的类比是有帮助的。在一本书中,为了找到给定字符串的所有出现,比如说“foobar”,你必须扫描每一页的“foobar”。另一方面,使用书籍索引,您只需在索引中查找“foobar”即可查看包含“foobar”的所有页面。来源:http://ruby.railstutorial.org/chapters/modeling-users#sidebar:database_indices**

So what I understand from that example is that words can be repeated in text, so the "index page" consists of unique entries. However, in the railstutorial site, the login is set such that each email address is unique to an account, so how does having an index make it faster when we can have at most one occurrence of each email?

所以我从这个例子中理解的是,单词可以在文本中重复,因此“索引页面”由唯一条目组成。但是,在railstutorial网站中,登录设置为每个电子邮件地址对于一个帐户是唯一的,那么当我们每个电子邮件最多只出现一次时,如何使索引更快?

Thanks

3 个解决方案

#1

Indexing isn't (much) about duplicates. It's about order.

索引不是(很多)关于重复。这是关于订单。

When you do a search, you want to have some kind of order that lets you (for example) do a binary search to find the data in logarithmic time instead of searching through every record to find the one(s) you care about (that's not the only type of index, but it's probably the most common).

当你进行搜索时,你希望有某种顺序让你(例如)进行二进制搜索,以对数时间查找数据,而不是搜索每条记录以找到你关心的那些(这是不是唯一的索引类型,但它可能是最常见的)。

Unfortunately, you can only arrange the records themselves in a single order.

不幸的是,您只能在一个订单中自行安排记录。

An index contains just the data (or a subset of it) that you're going to use to search on, and pointers (or some sort) to the records containing the actual data. This allows you to (for example) do searches based on as many different fields as you care about, and still be able to do binary searching on all of them, because each index is arranged in order by that field.

索引仅包含您要用于搜索的数据(或其子集),以及包含实际数据的记录的指针(或某种类型)。这允许您(例如)基于您关心的多个不同字段进行搜索,并且仍然能够对所有字段进行二进制搜索,因为每个索引按该字段按顺序排列。

#2

Because the index in the DB and in the given example is sorted alphabetically. The raw table / book is not. Then think: How do you search an index knowing it is sorted? I guess you don't start reading at "A" up to the point of your interest. Instead you skip roughly to the POI and start searching from there. Basically a DB can to the same with an index.

因为DB和给定示例中的索引按字母顺序排序。原始表/书不是。然后想一想:你如何搜索已知排序的索引?我想你不会开始阅读“A”,直到你感兴趣的程度。相反,你大致跳过POI并从那里开始搜索。基本上DB可以与索引相同。

#3

It is faster because the index contains only values from the column in question, so it is spread across a smaller number of pages than the full table. Also, indexes usually include additional optimizations such as hash tables to limit the number of reads required.

它更快,因为索引仅包含来自相关列的值,因此它分布在比完整表少的页面上。此外,索引通常还包括其他优化,例如哈希表,以限制所需的读取次数。

推荐阅读

byte
delphi控件大全

本文章已收录于：delphi控件查询：http:www.torry.nethttp:www.jrsoftware.orgTb97最有名的工具条(ToolBar) ... [详细]

蜡笔小新 2024-09-30 11:49:36
java
为什么不能用datatables来添加在数据库中查到的数据

尝试在数据库中查询数据并在datatables中异步显示时总是报错。有人帮我看下吗，好像是这个json的格式出问题，我看了firebug，应该是servlet返回的json数据格式问题，但因为新 ... [详细]

蜡笔小新 2024-09-29 18:34:31
java
【Zabbix4.2学习笔记】1、CentOS7.5安装zabbix4.2

1、关闭防火墙和selinux#systemctlstopfirewalld#vimetcselinuxconfigSELINUXpermissive#setenforce02、添加zabbix存储库rpm-Uvhh ... [详细]

蜡笔小新 2024-09-29 14:19:49
java
oracle text db2,从Oracle 到DB2（一）

在实际的软件项目的开发过程中，特别是在企业的应用系统集成(EAI)项目中广大开发人员经常遇到不同关系型数据库之间的数据移植问题。笔者根据自己在工作中的不同数据库数据移 ... [详细]

蜡笔小新 2024-09-28 10:56:59
java
Java多线程编程实战精要(1)

在Java程序中使用多线程要比在C或C++中容易得多，这是因为Java编程语言提供了语言级的支持。为什么会排队等待?下面的这个简单的Java程序完成四项不相关的任 ... [详细]

蜡笔小新 2024-09-26 19:44:06
input
POJ2253(floyd)

FroggerTimeLimit:1000MSMemoryLimit:65536KTotalSubmissions:32257Accepted:10396DescriptionFr ... [详细]

蜡笔小新 2024-09-30 20:13:09
input
ETC 纹理压缩和 Alpha 通道处理

转自：http:malideveloper.arm.comcndevelop-for-malisample-codeetcv1-texture-compression-and-alpha- ... [详细]

蜡笔小新 2024-09-30 20:00:46
join
Java如何快速定位无效字符,mybatis的报错…ORA00911: 无效字符,该怎么解决

mybatis的报错……ORA-00911:无效字符xml里的配置resultTypejava.lang.Stringselectt.sfzhfromt_ldrktandt. ... [详细]

蜡笔小新 2024-09-30 14:45:30
go
开发笔记:sql盲注之报错注入(附自动化脚本)

篇首语：本文由编程笔记#小编为大家整理，主要介绍了sql盲注之报错注入(附自动化脚本)相关的知识，希望对你有一定的参考价值。 ... [详细]

蜡笔小新 2024-09-30 12:32:17
byte
C#学习教程：使用RSACryptoServiceProvider进行公钥加密分享

使用RSACryptoServiceProvider进行公钥加密我已经在CodeProject上发表了一篇文章，解释了如何使用RSA提供程序进行加密和解密：RSA私钥加密虽然200 ... [详细]

蜡笔小新 2024-09-29 18:06:38
go
记一次ssh免密登录踩坑and Debug之路

突然觉得服务器ssh密码登录总是浪费一定量的时间，就想试试用sshKey进行登录。生成服务器sshkey和本地sshkey$ssh-keygen在服务器上生成一个authorize ... [详细]

蜡笔小新 2024-09-28 16:45:48
go
在Windows应用程序中模拟会话 - Simulating session in a Windows app

Iamworkingonawindowsapplication.IneedtosimulateSession(thatwehaveinawebapp)inthe ... [详细]

蜡笔小新 2024-09-28 08:17:27
byte
java – 将带有二进制数据的byte []转换为String

我有二进制格式的数据(十六进制：803bc8870a89),我需要将其转换为字符串,以便通过Jackcess在MSAccess数据库中保存二进制数据.我知道,我不认为在Java中使用 ... [详细]

蜡笔小新 2024-09-27 18:50:34
instance
javax.swing.Action.addPropertyChangeListener()方法的使用及代码示例

本文整理了Java中javax.swing.Action.addPropertyChangeListener()方法的一些代码示例，展示了Action.ad ... [详细]

蜡笔小新 2024-09-26 16:30:30
instance
Mysql安装和初步使用

2019独角兽企业重金招聘Python工程师标准一、安装1、下载及安装：官网：https:downloads.mysql.comarchivesc ... [详细]

蜡笔小新 2024-09-26 15:56:42

魔帝君

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章