Elasticsearch实践一：初识

快速搭建测试环境

window下部署ELK&＃xff08;6.2.2系列&＃xff09;

基础知识

基本认知

索引(index)&＃xff1a;类似于传统关系数据库中的一个数据库; 复数词为 indices 或 indexes 类型(type):类似于传统关系数据库中的一个表文档(docuemnt):类似于传统关系数据库中的表中的一行数据倒排索引&＃xff1a; 关系型数据库通过增加一个索引比如一个 B树&＃xff08;B-tree&＃xff09;索引到指定的列上&＃xff0c;以便提升数据检索速度。Elasticsearch 和 Lucene 使用了一个叫做倒排索引的结构来达到相同的目的。默认的&＃xff0c;一个文档中的每一个属性都是被索引的&＃xff08;有一个倒排索引&＃xff09;和可搜索的。一个没有倒排索引的属性是不能被搜索到的**Elasticsearch 开放的restful api接口通信&＃xff0c;所以理论任何语言都可以接入**

分析和过滤器认知

# 字符过滤器首先&＃xff0c;字符串按顺序通过每个字符过滤器。他们的任务是在分词前整理字符串。一个字符过滤器可以用来去掉HTML&＃xff0c;或者将 & 转化成 &＃96;and&＃96;。 # 分词器(分析器) 其次&＃xff0c;字符串被分词器分为单个的词条或词语。一个简单的分词器遇到空格和标点的时候&＃xff0c;可能会将文本拆分成词条。 # Token 过滤器最后&＃xff0c;词条按顺序通过每个 token 过滤器。这个过程可能会改变词条&＃xff08;例如&＃xff0c;小写化 Quick &＃xff09;&＃xff0c;删除词条&＃xff08;例如&＃xff0c; 像 a&＃96;&＃xff0c; &＃96;and&＃96;&＃xff0c; &＃96;the 等无用词&＃xff09;&＃xff0c;或者增加词条&＃xff08;例如&＃xff0c;像 jump 和 leap 这种同义词&＃xff09;。 Elasticsearch提供了开箱即用的字符过滤器、分词器和token过滤器

Curl请求

curl -i -X &＃39;://:/?&＃39; -d &＃39;&＃39; -i: 打印返回的头部信息 VERB: HTTP方法 : &＃96;GET&＃96;、 &＃96;POST&＃96;、 &＃96;PUT&＃96;、 &＃96;HEAD 或者 &＃96;DELETE&＃96; -d: 请求提内容

学习笔记

创建索引

# 创建megacorp索引&＃xff0c;分配3封主分配&＃xff0c;每个主分片分配1个副本&＃xff0c;这里总的是6个分片&＃xff08;3主3副&＃xff09;, 后面章节我们会简单介绍es集群知识 PUT /megacorp {"settings" : {"number_of_shards" : 3,"number_of_replicas" : 1} }

直接提供id方式索引一个文档

# PUT 索引/类型/id PUT /megacorp/employee/1 {"first_name" : "John","last_name" : "Smith","age" : 25,"about" : "I love to go rock climbing","interests": [ "sports", "music" ] }PUT /megacorp/employee/2 {"first_name" : "Jane","last_name" : "Smith","age" : 32,"about" : "I like to collect rock albums","interests": [ "music" ] }PUT /megacorp/employee/3 {"first_name" : "Douglas","last_name" : "Fir","age" : 35,"about": "I like to build cabinets","interests": [ "forestry" ] }

基本的查询

# GET 索引/类型/id GET /megacorp/employee/1# GET 索引/类型/_search GET /megacorp/employee/_search# 增加搜索参数q GET /megacorp/employee/_search?q&＃61;last_name:Smith# _all表示所有索引里查询 # 查询在 tweet 类型中 tweet 字段包含 elasticsearch 单词的所有文档 GET /_all/employee/_search?q&＃61;tweet:elasticsearch# name 字段中包含 john 并且在 tweet 字段中包含 mary 的文档 GET /_all/employee/_search?q&＃61;&＃43;name:john &＃43;tweet:mary # /_search?q&＃61;%2Bname%3Ajohn&＃43;%2Btweet%3Amary# &＃xff08;name 字段中包含 mary 或者 john &＃xff09;&＃xff08;date 值大于 2014-09-10&＃xff09; &＃xff08;_all_ 字段包含 aggregations 或者 geo&＃xff09; GET /_all/employee/_search?q&＃61;&＃43;name:(mary john) &＃43;date:>2014-09-10 &＃43;(aggregations geo)# 搜索返回包含 mary 的所有文档 GET /_search?q&＃61;mary# 查询表达式查询&＃xff08;查询体&＃xff09; GET /megacorp/employee/_search {"query" : {"match" : {"about" : "rock albums"}},"highlight": {"fields" : {"about" : {}}} }# 查询索引下的类型所有和细节 GET /megacorp/_mapping?pretty&＃61;true