当前位置: 开发笔记 > 编程语言 > 正文

Dockerswarm搭建docker高可用集群

作者：mobiledu2502902687 | 来源：互联网 | 2023-08-20 16:21

目录项目名称：基于docker-swarm搭建docker高可用集群1、网络拓扑图网络数据流图2、项目环境3、项目描述4、项目步骤1.规划设计整个集群的架构、网络拓扑

项目名称：基于docker- swarm 搭建docker高可用集群
- 1、网络拓扑图
- 网络数据流图
- 2、项目环境
- 3、项目描述
- 4、项目步骤
- - 1.规划设计整个集群的架构、网络拓扑，安装7台CentOS 7.6的系统，按照规划配置好每台linux的IP，准备好Docker环境，搭建swarm集群
  - 2、使用Volume（挂载目录到NFS服务器下）来提供Web服务，达到所有容器都使用相同的数据；
  - 3、编译安装Nginx，编写安装脚本，使用Nginx做负载均衡
  - 4、Keepalived的双vip实现高可用
  - 5、安装prometheus，在被监控的机器上安装exporter，实现监控功能
  - 6、添加Grafana，美观、强大的可视化监控指标展示工具
- 5、项目心得

项目名称：基于docker- swarm 搭建docker高可用集群

1、网络拓扑图

请添加图片描述

网络数据流图

请添加图片描述

2、项目环境

Docker 20.10.8，CentOS 7.6（7台 1核1G），Nginx 1.19.7，Prometheus2.29.1，Grafana8.1.2，Keepalived，NFS。

3、项目描述

实现一个高可用的负载均衡Web服务器集群，后端采用Swarm管理的Docker集群来提供Web服务，大量使用容器来完成Web服务的扩展性、高可用性，使用Prometheus对整个集群进行监控，保证业务正常进行。

4、项目步骤

1.规划设计整个集群的架构、网络拓扑，安装7台CentOS 7.6的系统，按照规划配置好每台linux的IP，准备好Docker环境，搭建swarm集群

1、创建swarm集群

[root@centos-7 ~]# docker swarm init --advertise-addr 192.168.0.101 Swarm initialized: current node (wxfmm8k75qxwey2fufk204ivv) is now a manager.To add a worker to this swarm, run the following command:# 这就是添加节点的方式(要保存初始化后token，因为在节点加入时要使用token作为通讯的密钥docker swarm join --token SWMTKN-1-3sqv9hho99m8z686tspko5c5dn3pmk6h02p5zscduh3eq2nkm5-1h1g2xndxeit74aa2vy5304jo 192.168.0.101:2377To add a manager to this swarm, run &＃39;docker swarm join-token manager&＃39; and follow the instructions

2、添加节点主机到Swarm集群（如果想要将其他更多的节点添加到这个swarm集群中，添加方法与其一致）

[root@work_3 ~]# docker swarm join --token SWMTKN-1-3sqv9hho99m8z686tspko5c5dn3pmk6h02p5zscduh3eq2nkm5-1h1g2xndxeit74aa2vy5304jo 192.168.0.101:2377 This node joined a swarm as a worker.

2、使用Volume（挂载目录到NFS服务器下）来提供Web服务，达到所有容器都使用相同的数据；

1、共享文件，编辑/etc/exports文件，写好具体的共享的目录和权限

[root@u-nfs ~]# vim /etc/exports /web 192.168.0.0/24(rw,all_squash,sync)

2、根据编辑的文件建立web文件
3、刷新输出文件的列表：

exportfs -rv

4、在manage机器上创建服务（注意swarm集群的机器也需要安装nfs服务）：

docker service create -d --name nfs-web --mount &＃39;type=volume,source=nfsvolume,target=/usr/share/nginx/html,volume-driver=local,volume-opt=type=nfs,volume-opt=device=:/web,"volume-opt=o=addr=192.168.0.100,rw,nfsvers=4,async"&＃39; --replicas 10 -p 8089:80 nginx:latest

3、编译安装Nginx，编写安装脚本，使用Nginx做负载均衡

负载均衡：将用户的访问请求均衡的分散到后端的真正提供服务的机器上
负载均衡器：实现负载均衡功能的一个机器
1、编写脚本

#!/bin/bash#解决软件的依赖关系，需要安装的软件包 yum -y install zlib zlib-devel openssl openssl-devel pcre pcre-devel gcc gcc-c++ autoconf automake make psmisc net-tools lsof vim wget#新建luogan用户和组 id sanchuang || useradd sanchuang -s /sbin/nologin#下载nginx软件 mkdir /sanchuang99 -p cd /sanchuang99 wget http://nginx.org/download/nginx-1.21.1.tar.gz#解压软件 tar xf nginx-1.21.1.tar.gz #进入解压后的文件夹 cd nginx-1.21.1#编译前的配置 ./configure --prefix=/usr/local/scsanchuang99 --user=sanchuang --group=sanchuang --with-http_ssl_module --with-threads --with-http_v2_module --with-http_stub_status_module --with-stream#如果上面的编译前的配置失败，直接退出脚本 if (( $? != 0));thenexit fi #编译 make -j 2 #编译安装 make install#修改PATH变量 echo "PATH=$PATH:/usr/local/scsanchuang99/sbin" >>/root/.bashrc #执行修改了环境变量的脚本 source /root/.bashrc#firewalld and selinux#stop firewall和设置下次开机不启动firewalld service firewalld stop systemctl disable firewalld#临时停止selinux和永久停止selinux setenforce 0 sed -i &＃39;/^SELINUX=/ s/enforcing/disabled/&＃39; /etc/selinux/config#开机启动 chmod +x /etc/rc.d/rc.local echo "/usr/local/scsanchuang99/sbin/nginx" >>/etc/rc.local

2、运行安装脚本

[root@load-balancer ~]# bash onekey_install_shediao_nginx_v10.sh

切换用户，加载修改了的PATH变量

[root@load-balancer ~]# su - root

3、配置Nginx的负载均衡功能
cd /usr/local/scsanchuang99/ 进入nginx编译安装指定的目录

[root@load-balancer scsanchuang99]# ls client_body_temp conf fastcgi_temp html logs proxy_temp sbin scgi_temp uwsgi_temp

cd conf/ 进入配置文件的命令

[root@load-balancer conf]# ls fastcgi.conf fastcgi_params.default mime.types nginx.conf.default uwsgi_params fastcgi.conf.default koi-utf mime.types.default scgi_params uwsgi_params.default fastcgi_params koi-win nginx.conf scgi_params.default win-utf

nginx.conf 是nginx的配置文件
编辑配置文件：

[root@load-balancer conf]# vim nginx.conf

http{upstream xuweb { #定义一个负载均衡器名字叫xuwebserver 192.168.0.101:8089;server 192.168.0.102:8089;server 192.168.0.97:8089;}server {listen 80; #监听80端口server_name www.sc.com; #为www.sc.com 域名服务location / {proxy_pass http://scweb ; #调用负载均衡器} .....省略很多配置 }

nginx -s reload 重新加载配置文件–》相当于重启了nginx服务

4、Keepalived的双vip实现高可用

单点：整个架构中，只有一台服务器的地方
单点故障：如果某台服务器down机会导致整个集群出现异常
如何解决单点故障，防止单点故障–》高可用
高可用：一台出现问题，另外的机器可以顶替，继续保障整个集群的正常运转.
keepalived 是实现高可用的软件

1、安装和配置
在两台安装Nginx的负载k均衡器的基础上安装Keepalived
yum install keepalived -y

2.配置keepalived.conf文件，添加vip和相关信息

cd /etc/keepalived/ vim keepalived.conf

配置文件详细解释：

vrrp_instance VI_1 { 启动一个vrrp的实例 VI_1 实例名，可以自定义state MASTER --》角色是masterinterface ens33 --》在哪个接口上监听vrrp协议，同时绑定vip到那个接口virtual_router_id 105 --》虚拟路由id（帮派） 0~255范围priority 120 ---》优先级 0~255advert_int 1 --》advert interval 宣告消息时间间隔 1秒authentication { 认证auth_type PASS 认证的类型是密码认证 passwordauth_pass 11112222 具体的密码，可以自己修改}virtual_ipaddress { --》vip的配置，vip可以是多个ip192.168.200.16 192.168.200.17192.168.200.18} }

cent-nginx-bl的详细配置：

! Configuration File for keepalivedglobal_defs {notification_email {acassen@firewall.locfailover@firewall.locsysadmin@firewall.loc}notification_email_from Alexandre.Cassen@firewall.locsmtp_server 192.168.200.1smtp_connect_timeout 30router_id LVS_DEVELvrrp_skip_check_adv_addr#vrrp_strictvrrp_garp_interval 0vrrp_gna_interval 0 }vrrp_instance VI_1 {state MASTERinterface ens33virtual_router_id 108priority 200advert_int 1authentication {auth_type PASSauth_pass 1111}virtual_ipaddress {192.168.0.108} }vrrp_instance VI_2 {state BACKUPinterface ens33virtual_router_id 109priority 100advert_int 1authentication {auth_type PASSauth_pass 1111}virtual_ipaddress {192.168.0.109} }

cent-keepalived-bl的详细配置：

! Configuration File for keepalivedglobal_defs {notification_email {acassen@firewall.locfailover@firewall.locsysadmin@firewall.loc}notification_email_from Alexandre.Cassen@firewall.locsmtp_server 192.168.200.1smtp_connect_timeout 30router_id LVS_DEVELvrrp_skip_check_adv_addr#vrrp_strictvrrp_garp_interval 0vrrp_gna_interval 0 }vrrp_instance VI_1 {state BACKUPinterface ens33virtual_router_id 108priority 100advert_int 1authentication {auth_type PASSauth_pass 1111}virtual_ipaddress {192.168.0.108} }vrrp_instance VI_2 {state MASTERinterface ens33virtual_router_id 109priority 200advert_int 1authentication {auth_type PASSauth_pass 1111}virtual_ipaddress {192.168.0.109} }

5、安装prometheus，在被监控的机器上安装exporter，实现监控功能

1、安装Prometheus

root@prometheus ~]# rz [root@prometheus ~]# lsprometheus-2.29.1.linux-amd64.tar.gz [root@prometheus ~]# [root@prometheus ~]# mkdir /prometheus [root@prometheus ~]# mv prometheus-2.29.1.linux-amd64.tar.gz /prometheus/ #临时添加环境变量 [root@prometheus prometheus]# PATH=$PATH:/prometheus/prometheus-2.29.1.linux-amd64 [root@prometheus prometheus]# which prometheus /prometheus/prometheus-2.29.1.linux-amd64/prometheus [root@prometheus prometheus]#

永久添加安装路径到PATH环境变量里

[root@prometheus ~]# vim /root/.bashrc PATH=$PATH:/prometheus/prometheus-2.29.1.linux-amd64

prometheus 启动程序
prometheus.yml 配置文件
启动prometheus

[root@prometheus prometheus-2.29.1.linux-amd64]# ./prometheus --config.file=prometheus.yml level=info ts=2021-08-25T09:23:53.236Z caller=main.go:390 msg="No time or size retention was set so using the default time retention" duration=15d level=info ts=2021-08-25T09:23:53.237Z caller=main.go:428 msg="Starting Prometheus" version="(version=2.29.1, branch=HEAD, revision=dcb07e8eac34b5ea37cd229545000b857f1c1637)" level=info ts=2021-08-25T09:23:53.237Z caller=main.go:433 build_cOntext="(go=go1.16.7, user=root@364730518a4e, date=20210811-14:48:27)"

在后台启动prometheus

[root@prometheus prometheus-2.29.1.linux-amd64]# nohup ./prometheus --config.file=/prometheus/prometheus-2.29.1.linux-amd64/prometheus.yml &

2、在被监控的服务器上安装exporter

exporter ：是prometheus的客户端程序，需要安装到被监控的服务器上。exporter是一个程序，需要去定制，但是prometheus平台给我们开发了很多通用的或者定制的exporter
exporter会到客户机（被监控的服务器上）收集指定的指标数据，例如：cpu的使用率，内存的使用率，磁盘的使用情况，网络的带宽使用情况等等数据

上传下载的node_exporter-1.2.2.linux-amd64.tar.gz到被监控的服务器

[root@cent7-manage~]# rz [root@cent7-manage ~]# ls anaconda-ks.cfg getting-started-master echo.sh getting-started-master.zip node_exporter-1.2.2.linux-amd64.tar.gz sc-ubuntu2.tar [root@cent7-manage~]# mkdir /exporter [root@cent7-manage~]# mv node_exporter-1.2.2.linux-amd64.tar.gz /exporter/ [root@cent7-manage ~]# cd /exporter/ [root@cent7-manage exporter]#

解压软件

[root@cent7-manage exporter]# tar xf node_exporter-1.2.2.linux-amd64.tar.gz [root@cent7-manage exporter]# ls node_exporter-1.2.2.linux-amd64 node_exporter-1.2.2.linux-amd64.tar.gz [root@cent7-manage exporter]# cd node_exporter-1.2.2.linux-amd64 [root@cent7-manage node_exporter-1.2.2.linux-amd64]# ls LICENSE node_exporter NOTICE [root@cent7-manage node_exporter-1.2.2.linux-amd64]#

执行软件

[root@cent7-manage node_exporter-1.2.2.linux-amd64]# ./node_exporter --help[root@cent7-manage node_exporter-1.2.2.linux-amd64]# nohup ./node_exporter --web.listen-address="0.0.0.0:9100" & [1] 96546 [root@cent7-manage node_exporter-1.2.2.linux-amd64]# nohup: 忽略输入并把输出追加到&＃39;nohup.out&＃39;

查看进程

[root@cent7-manage node_exporter-1.2.2.linux-amd64]# ps aux|grep node root 96546 0.1 0.2 716440 10996 pts/1 Sl 10:38 0:00 ./node_exporter --web.listen-address=0.0.0.0:9100 root 96551 0.0 0.0 12348 1144 pts/1 S+ 10:38 0:00 grep --color=auto node

修改PATH环境变量
#临时修改

[root@cent7-manage node_exporter-1.2.2.linux-amd64]# PATH=/exporter/node_exporter-1.2.2.linux-amd64:$PATH [root@cent7-manage node_exporter-1.2.2.linux-amd64]# which node_exporter /exporter/node_exporter-1.2.2.linux-amd64/node_exporter [root@cent7-manage node_exporter-1.2.2.linux-amd64]#

#永久修改

[root@cent7-manage node_exporter-1.2.2.linux-amd64]# vim /root/.bashrc PATH=/exporter/node_exporter-1.2.2.linux-amd64:$PATH 在末尾添加

server去访问这个网址获取node上的metrics

http://192.168.0.101:9100/metrics

3.添加被监控服务器到prometheus server里
在server上操作

[root@prometheus prometheus-2.29.1.linux-amd64]# cd /prometheus/prometheus-2.29.1.linux-amd64 [root@prometheus prometheus-2.29.1.linux-amd64]# [root@prometheus prometheus-2.29.1.linux-amd64]# vim prometheus.yml scrape_configs:# The job name is added as a label `job=` to any timeseries scraped from this config.- job_name: "prometheus"# metrics_path defaults to &＃39;/metrics&＃39;# scheme defaults to &＃39;http&＃39;.static_configs:- targets: ["localhost:9090"]#添加需要监控的服务器的信息- job_name: "swarm-manager"static_configs:- targets: ["192.168.0.101:9100"]

重启prometheus服务，因为没有专门的重启脚本，需要手工完成
先杀死原来的进程，然后再启动新的进程，启动新的进程会重启加载配置文件

[root@prometheus prometheus-2.29.1.linux-amd64]# ps aux|grep prome root 2160 0.1 6.3 912304 63172 pts/2 Sl 10:06 0:07 ./prometheus --config.file=/prometheus/prometheus-2.29.1.linux-amd64/prometheus.yml root 2265 0.0 0.0 112824 980 pts/2 S+ 11:14 0:00 grep --color=auto prome

kill -9 2160 杀死进程

重新启动程序

[root@prometheus prometheus-2.29.1.linux-amd64]# nohup prometheus --config.file=/prometheus/prometheus-2.29.1.linux-amd64/ometheus.yml & [1] 2276 [root@prometheus prometheus-2.29.1.linux-amd64]# nohup: 忽略输入并把输出追加到"nohup.out"

6、添加Grafana，美观、强大的可视化监控指标展示工具

grafana 是一款采用 go
语言编写的开源应用，主要用于大规模指标数据的可视化展现，是网络架构和应用分析中最流行的时序数据展示工具，目前已经支持绝大部分常用的时序数据库。最好的参考资料就是官网（http://docs.grafana.org/）

1、安装

[root@u-nfs yum.repos.d]# vim grafana.repo [root@u-nfs yum.repos.d]# cat grafana.repo [grafana] name=grafana baseurl=https://packages.grafana.com/enterprise/rpm repo_gpgcheck=1 enabled=1 gpgcheck=1 gpgkey=https://packages.grafana.com/gpg.key sslverify=1 sslcacert=/etc/pki/tls/certs/ca-bundle.crt

[root@u-nfs yum.repos.d]# yum install grafana -y

启动：

[root@u-nfs yum.repos.d]# systemctl start grafana-server

查看进程

[root@u-nfs yum.repos.d]# ps aux|grep grafana root 42897 0.0 0.0 169308 756 ? Ss 11:31 0:00 gpg-agent --homedir /var/cache/dnf/grafana-ee12c6ab2813e349/pubring --use-standard-socket --daemon grafana 43438 3.6 4.3 1229004 80164 ? Ssl 11:34 0:01 /usr/sbin/grafana-server --cOnfig=/etc/grafana/grafana.ini--pidfile=/var/run/grafana/grafana-server.pid --packaging=rpm cfg:default.paths.logs=/var/log/grafana cfg:default.paths.data=/var/lib/grafana cfg:default.paths.plugins=/var/lib/grafana/plugins cfg:default.paths.provisiOning=/etc/grafana/provisioning root 43490 0.0 0.0 12324 1060 pts/1 S+ 11:34 0:00 grep --color=auto grafana

查看端口

ss -anplut|grep grafana tcp LISTEN 0 128 *:3000 *:* users:(("grafana-server",pid=43438,fd=8))

到web 浏览器里访问

http://192.168.0.100:3000
web登陆
默认账号和密码都是admin

添加监控项（PromQL里查询的指标）–》grafana帮助我们去出图展示 --》自己去添加监控项遇到：
1.对很多监控项的指标具体对应那个PromQL 语句我们不熟悉
2.如果监控的指标过多，操作笔记复杂

grafana有模板，模板里包含很多的重要的监控项，我们直接导入就可以了 grafana的模板，本质上是一个json格式的文件

5、项目心得

1.提前规划好整个集群的架构，可以提高项目开展时效率，可以让我们更加清晰；
2.对本地hosts文件进行DNS集群域名解析记录，效果并不明显，考虑在前面加一个负载均衡器，实现论询效果；
3.通过整个项目更加深刻的理解了Docker的相关技术，使用Docker的集群解决方案比传统的集群解决方案更加快捷方便，Docker内部的高可用和负载均衡也非常不错；
4.通过实验锻炼了自己细心和trouble shooting的能力。

推荐阅读

io
Docker安全策略与管理

本文探讨了Docker的安全挑战、核心安全特性及其管理策略，旨在帮助读者深入理解Docker安全机制，并提供实用的安全管理建议。 ... [详细]

蜡笔小新 2024-11-21 20:03:03
io
英特尔推出第三代至强可扩展处理器及傲腾持久内存，AI性能显著提升

英特尔在数据创新峰会上发布了第三代至强可扩展处理器和第二代傲腾持久内存，全面增强AI能力和系统性能。 ... [详细]

蜡笔小新 2024-11-17 13:07:14
php
H5技术实现经典游戏《贪吃蛇》

本文将分享一个使用HTML5技术实现的经典小游戏——《贪吃蛇》。通过H5技术，我们将探讨如何构建这款游戏的两种主要玩法：积分闯关和无尽模式。 ... [详细]

蜡笔小新 2024-11-21 20:16:59
php
SIP基础概览

本文介绍了SIP（Session Initiation Protocol，会话发起协议）的基本概念、功能、消息格式及其实现机制。SIP是一种在IP网络上用于建立、管理和终止多媒体通信会话的应用层协议。 ... [详细]

蜡笔小新 2024-11-21 17:42:08
js
实践指南：使用Express、Create React App与MongoDB搭建React开发环境

本文详细介绍了如何利用Express、Create React App和MongoDB构建一个高效的React应用开发环境，旨在为开发者提供一套完整的解决方案，包括环境搭建、数据模拟及前后端交互。 ... [详细]

蜡笔小新 2024-11-20 10:05:15
lua
在Linux中获取库源码及编译软件时如何收集依赖项

本文介绍了如何在Linux系统中获取库源码，并在从源代码编译软件时收集所需的依赖项列表。 ... [详细]

蜡笔小新 2024-11-17 20:34:02
client
MOSS2007 中型服务场配置指南：网络负载均衡集群设置

本文详细介绍了如何在MOSS2007环境中配置网络负载均衡集群，包括安装和配置网络负载均衡功能的具体步骤。通过本文，读者可以了解如何在多台Web服务器上安装并配置网络负载均衡，以实现高效的服务分发。 ... [详细]

蜡笔小新 2024-11-16 14:18:11
php
嵌入式Linux工程师笔试题精选

本文整理了一份基础的嵌入式Linux工程师笔试题，涵盖填空题、编程题和简答题，旨在帮助考生更好地准备考试。 ... [详细]

蜡笔小新 2024-11-15 10:42:13
io
兆芯X86 CPU架构的演进与现状（国产CPU系列）

本文详细介绍了兆芯X86 CPU架构的发展历程，从公司成立背景到关键技术授权，再到具体芯片架构的演进，全面解析了兆芯在国产CPU领域的贡献与挑战。 ... [详细]

蜡笔小新 2024-11-14 15:04:34
io
华为捐赠欧拉操作系统，承诺不推商用版

华为近日宣布将欧拉开源操作系统捐赠给开放原子开源基金会，并承诺不会推出欧拉的商用发行版。此举旨在推动欧拉和鸿蒙操作系统的全场景融合与生态发展。 ... [详细]

蜡笔小新 2024-11-14 13:19:40
io
基于Vue和Nuxt的服务端渲染，Node.js全栈项目的博客系统搭建

大家好，我是李白。本文将分享一个从零开始的全栈项目，涵盖了设计、前端、后端和服务端的全面学习过程。通过这个项目，我希望能够帮助初学者更好地理解和掌握全栈开发的技术栈。 ... [详细]

蜡笔小新 2024-11-12 17:27:19
php
秒建一个后台管理系统？用这5个开源免费的Java项目就够了

秒建一个后台管理系统？用这5个开源免费的Java项目就够了 ... [详细]

蜡笔小新 2024-11-12 03:21:33
require
C#中数值结果的格式化展示方法与技巧

在C#编程中，数值结果的格式化展示是提高代码可读性和用户体验的重要手段。本文探讨了多种格式化方法和技巧，如使用格式说明符、自定义格式字符串等，以实现对数值结果的精确控制。通过实例演示，展示了如何灵活运用这些技术来满足不同的展示需求。 ... [详细]

蜡笔小新 2024-11-11 09:27:57
import
如何在Docker环境中高效利用数据库？ | Baeldung

在本文中，我们将探讨如何在Docker环境中高效地管理和利用数据库。首先，需要安装Docker Desktop以确保本地环境准备就绪。接下来，可以从Docker Hub中选择合适的数据库镜像，并通过简单的命令将其拉取到本地。此外，我们还将介绍如何配置和优化这些数据库容器，以实现最佳性能和安全性。 ... [详细]

蜡笔小新 2024-11-09 19:34:33
client
CLIfe：我的高效开发环境配置

在开发过程中，我最初也依赖于功能全面但操作繁琐的集成开发环境（IDE），如Borland Delphi 和 Microsoft Visual Studio。然而，随着对高效开发的追求，我逐渐转向了更加轻量级和灵活的工具组合。通过 CLIfe，我构建了一个高度定制化的开发环境，不仅提高了代码编写效率，还简化了项目管理流程。这一配置结合了多种强大的命令行工具和插件，使我在日常开发中能够更加得心应手。 ... [详细]

蜡笔小新 2024-11-07 18:32:20