关于kubernetes:centos7通过kubeadm安装k8s1273版本解决各种坑可供参考

1. 硬件要求

1、Master主机：2核CPU、4G内存、20G硬盘
2、Node主机：4+核CPU、8G+内存、40G+硬盘
2、集群中的所有机器的网络彼此均能相互连接
3、节点之中不能够有反复的主机名、MAC 地址或 product_uuid
4、开启机器上的某些端口
5、为了保障 kubelet 失常工作，必须禁用替换分区

在每一台节点上写上集群主机的hosts

10.2.xx.215    gz-xx-gw-c7                    # master10.2.xx.128    gz-xx-node1-c7                 # node10.2.xx.246    gz-xx-node2-c7                 # node

2. 服务器环境配置

2.1 敞开防火墙（所有节点）

敞开防火墙并设置开机不启动

systemctl stop firewalldsystemctl disable firewalld

2.3 敞开swap分区（所有节点）

批改后重启服务器失效

swapoff -avim /etc/fstab                        #永恒禁用swap,删除或正文掉/etc/fstab里的swap设施的挂载命令即可#/dev/mapper/centos-swap swap                    swap    defaults        0 0

2.4 Centos7内核降级（所有节点）

因为centos7.9的零碎默认内核版本是3.10，3.10的内核有很多BUG，最常见的一个就是group memory leak(四台主机都要执行)
1）下载所须要的内核版本，我这里采纳rpm装置，所以间接下载的rpm包

[root@localhost ~]# wget https://cbs.centos.org/kojifiles/packages/kernel/4.9.220/37.el7/x86_64/kernel-4.9.220-37.el7.x86_64.rpm

2）执行rpm降级即可

[root@localhost ~]# rpm -ivh kernel-4.9.220-37.el7.x86_64.rpm#查看零碎可用内核，并设置启动项

[root@gz-bjrd-devops-gw-c7 dd]# sudo awk -F\' '$1=="menuentry " {print i++ " : " $2}' /etc/grub2.cfg0 : CentOS Linux (4.9.220-37.el7.x86_64) 7 (Core)1 : CentOS Linux (3.10.0-1160.88.1.el7.x86_64) 7 (Core)2 : CentOS Linux (3.10.0-1160.76.1.el7.x86_64) 7 (Core)3 : CentOS Linux (3.10.0-1160.el7.x86_64) 7 (Core)4 : CentOS Linux (0-rescue-1caefa67ba0d4c758d6742dfc455d487) 7 (Core)#指定开机启动内核版本grub2-set-default 0 或者 grub2-set-default 'CentOS Linux (6.3.1-1.el7.elrepo.x86_64) 7 (Core)'#生成 grub 配置文件grub2-mkconfig -o /boot/grub2/grub.cfg

3）降级完reboot，而后查看内核是否胜利降级################肯定要重启

[root@localhost ~]# reboot#重启零碎，验证[root@k8s-master ~]# uname -aLinux gz-xxs-gw-c7 4.9.220-37.el7.x86_64 #1 SMP Tue Apr 28 10:14:25 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux

2.5 设置主机名（所有节点）

[root@k8s-master ~]# cat /etc/hosts 10.2.xx.215    gz-xx-gw-c7                    # master10.2.xx.128    gz-xx-node1-c7                 # node10.2.xx.246    gz-xx-node2-c7                 # node

2.6 工夫同步（所有节点）

ntpdate cn.pool.ntp.org

2.7配制iptables规定

留神本人原有的规定(谨慎点敌人)

iptables -F && iptables -X && iptables -F -t nat && iptables -X -t nat && iptables -P FORWARD ACCEPT

设置零碎参数

cat <<EOF > /etc/sysctl.d/k8s.conf net.bridge.bridge-nf-call-ip6tables = 1 net.bridge.bridge-nf-call-iptables = 1 EOF  sysctl --system

3、装置docker（所有节点）

[root@gz-xx-gw-c7 ~]# yum-config-manager --add-repo https://download.docker.com/linux/centos/docker-ce.repo[root@gz-xx-gw-c7 ~]# yum install docker-ce-23.0.6-1.el7 -y[root@gz-xx-gw-c7 ~]# mkdir -pv /opt/docker[root@gz-xx-gw-c7 ~]# cat /etc/docker/daemon.json {    "registry-mirrors": ["https://zd6lf0p4.mirror.aliyuncs.com"],    "exec-opts": ["native.cgroupdriver=systemd"],    "data-root":"/opt/docker"}[root@gz-xx-gw-c7 ~]# systemctl start docker[root@gz-xx-gw-c7 ~]# systemctl enable docker[root@gz-xx-gw-c7 ~]#

配制镜像减速和cgroup

没有则本人创立文件
没有则本人创立文件

[root@k8s-master ~]# cat /etc/docker/daemon.json {    "registry-mirrors": ["https://zd6lf0p4.mirror.aliyuncs.com"],    "exec-opts": ["native.cgroupdriver=systemd"]} #"exec-opts": ["native.cgroupdriver=systemd"]为docker应用cgroup的形式，k8s应用形式也是systemd，两边要统一#加载配制systemctl restart dockersystemctl enable docker

4. 装置cri-dockerd（所有节点）

#从https://github.com/Mirantis/cri-dockerd/releases中下载最新的rpm包,手动下载后上传到服务器里rpm -ivh cri-dockerd-0.3.1-3.el7.x86_64.rpm#批改/usr/lib/systemd/system/cri-docker.service文件中的ExecStart配置vim /usr/lib/systemd/system/cri-docker.serviceExecStart=/usr/bin/cri-dockerd --network-plugin=cni --pod-infra-container-image=registry.aliyuncs.com/google_containers/pause:3.7systemctl daemon-reloadsystemctl enable --now cri-docker

5.yum装置kubeadm、kubelet、kubectl（所有节点）

配置yum 源

vim /etc/yum.repos.d/kubernetes.repo[kubernetes]name=Kubernetesbaseurl=https://mirrors.aliyun.com/kubernetes/yum/repos/kubernetes-el7-x86_64/enabled=1gpgcheck=1repo_gpgcheck=1gpgkey=https://mirrors.aliyun.com/kubernetes/yum/doc/yum-key.gpg https://mirrors.aliyun.com/kubernetes/yum/doc/rpm-package-key.gpgyum install -y kubelet kubeadm kubectlsystemctl enable kubelet --nowmaster pull images (master)kubeadm config images pull --kubernetes-version=v1.27.3 --image-repository registry.aliyuncs.com/google_containers --cri-socket unix:///var/run/cri-dockerd.sock

6.初始化master节点的控制面板（master节点）

kubeadm init \--apiserver-advertise-address=10.2.xx.215 \--image-repository registry.aliyuncs.com/google_containers \--kubernetes-version v1.27.3 \--service-cidr=172.18x.0.0/12 \--pod-network-cidr=172.17x.0.0/16 \--cri-socket unix:///var/run/cri-dockerd.sock \--ignore-preflight-errors=all

会呈现上面的内容

Your Kubernetes control-plane has initialized successfully!To start using your cluster, you need to run the following as a regular user:  mkdir -p $HOME/.kube  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config  sudo chown $(id -u):$(id -g) $HOME/.kube/configAlternatively, if you are the root user, you can run:  export KUBECONFIG=/etc/kubernetes/admin.confYou should now deploy a pod network to the cluster.Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:  https://kubernetes.io/docs/concepts/cluster-administration/addons/Then you can join any number of worker nodes by running the following on each as root:kubeadm join 10.2.xx.215:6443 --token 4eozgp.xm7tfxxxxxxxxxz42y \    --discovery-token-ca-cert-hash sha256:66bfxxxxxxxxxxxxxxxx8ac68cbc927e86789a9e4e8183365ded688a1

在master节点执行

  mkdir -p $HOME/.kube  sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config  sudo chown $(id -u):$(id -g) $HOME/.kube/config

在node节点执行

kubeadm join 10.2.xx.215:6443 --token 4eozgp.xm7tfxxxxxxxxxz42y --discovery-token-ca-cert-hash sha256:66bfxxxxxxxxxxxxxxxx8ac68cbc927e86789a9e4e8183365ded688a1

还是依据下面的提醒来增加节点，上面的命令是在你要退出的节点上执行，master节点不必执行

在master节点查看

[root@gz-bjrd-devops-gw-c7 ~]# kubectl get nodeNAME                     STATUS   ROLES            AGE     VERSIONgz-xx-gw-c7            NotReady   control-plane    22h      v1.27.3gz-xx-node1-c7         NotReady   <none>           21h      v1.27.3gz-xx-node2-c7         NotReady   <none>           21h      v1.27.3

不要慌，网络插件搞好就ready了

7.装置网络插件（flannel）

https://github.com/flannel-io...

[root@gz-bjrd-devops-gw-c7 ~]# cat flannel.yml apiVersion: v1kind: Namespacemetadata:  labels:    k8s-app: flannel    pod-security.kubernetes.io/enforce: privileged  name: kube-flannel---apiVersion: v1kind: ServiceAccountmetadata:  labels:    k8s-app: flannel  name: flannel  namespace: kube-flannel---apiVersion: rbac.authorization.k8s.io/v1kind: ClusterRolemetadata:  labels:    k8s-app: flannel  name: flannelrules:- apiGroups:  - ""  resources:  - pods  verbs:  - get- apiGroups:  - ""  resources:  - nodes  verbs:  - get  - list  - watch- apiGroups:  - ""  resources:  - nodes/status  verbs:  - patch- apiGroups:  - networking.k8s.io  resources:  - clustercidrs  verbs:  - list  - watch---apiVersion: rbac.authorization.k8s.io/v1kind: ClusterRoleBindingmetadata:  labels:    k8s-app: flannel  name: flannelroleRef:  apiGroup: rbac.authorization.k8s.io  kind: ClusterRole  name: flannelsubjects:- kind: ServiceAccount  name: flannel  namespace: kube-flannel---apiVersion: v1data:  cni-conf.json: |    {      "name": "cbr0",      "cniVersion": "0.3.1",      "plugins": [        {          "type": "flannel",          "delegate": {            "hairpinMode": true,            "isDefaultGateway": true          }        },        {          "type": "portmap",          "capabilities": {            "portMappings": true          }        }      ]    }  net-conf.json: |    {      "Network": "172.17x.0.0/16",           # 这里得改成你本人的      "Backend": {        "Type": "vxlan"      }    }kind: ConfigMapmetadata:  labels:    app: flannel    k8s-app: flannel    tier: node  name: kube-flannel-cfg  namespace: kube-flannel---apiVersion: apps/v1kind: DaemonSetmetadata:  labels:    app: flannel    k8s-app: flannel    tier: node  name: kube-flannel-ds  namespace: kube-flannelspec:  selector:    matchLabels:      app: flannel      k8s-app: flannel  template:    metadata:      labels:        app: flannel        k8s-app: flannel        tier: node    spec:      affinity:        nodeAffinity:          requiredDuringSchedulingIgnoredDuringExecution:            nodeSelectorTerms:            - matchExpressions:              - key: kubernetes.io/os                operator: In                values:                - linux      containers:      - args:        - --ip-masq        - --kube-subnet-mgr        command:        - /opt/bin/flanneld        env:        - name: POD_NAME          valueFrom:            fieldRef:              fieldPath: metadata.name        - name: POD_NAMESPACE          valueFrom:            fieldRef:              fieldPath: metadata.namespace        - name: EVENT_QUEUE_DEPTH          value: "5000"        image: docker.io/flannel/flannel:v0.21.5        name: kube-flannel        resources:          requests:            cpu: 100m            memory: 50Mi        securityContext:          capabilities:            add:            - NET_ADMIN            - NET_RAW          privileged: false        volumeMounts:        - mountPath: /run/flannel          name: run        - mountPath: /etc/kube-flannel/          name: flannel-cfg        - mountPath: /run/xtables.lock          name: xtables-lock      hostNetwork: true      initContainers:      - args:        - -f        - /flannel        - /opt/cni/bin/flannel        command:        - cp        image: docker.io/flannel/flannel-cni-plugin:v1.1.2        name: install-cni-plugin        volumeMounts:        - mountPath: /opt/cni/bin          name: cni-plugin      - args:        - -f        - /etc/kube-flannel/cni-conf.json        - /etc/cni/net.d/10-flannel.conflist        command:        - cp        image: docker.io/flannel/flannel:v0.21.5        name: install-cni        volumeMounts:        - mountPath: /etc/cni/net.d          name: cni        - mountPath: /etc/kube-flannel/          name: flannel-cfg      priorityClassName: system-node-critical      serviceAccountName: flannel      tolerations:      - effect: NoSchedule        operator: Exists      volumes:      - hostPath:          path: /run/flannel        name: run      - hostPath:          path: /opt/cni/bin        name: cni-plugin      - hostPath:          path: /etc/cni/net.d        name: cni      - configMap:          name: kube-flannel-cfg        name: flannel-cfg      - hostPath:          path: /run/xtables.lock          type: FileOrCreate        name: xtables-lockkubectl apply -f flannel.yml

测试

能够进行一个简略的测试

[root@k8s-master /data/yaml]# cat nginx.yaml apiVersion: apps/v1kind: Deploymentmetadata:  name: nginx-deploy  labels:    app: ceshispec:  replicas: 2  selector:    matchLabels:      school: qinghua  template:    metadata:      name: nginx-pod      labels:        school: qinghua    spec:      containers:      - name: nginx-web        image: nginx:1.20.2        ports:        - containerPort: 80---apiVersion: v1kind: Servicemetadata:  name: nginx-service  labels:    role: leaderspec:  type: NodePort  ports:  - port: 8888     targetPort: 80    nodePort: 30000

kubectl apply -f nginx.yaml[root@k8s-master /data/yaml]# kubectl get podsNAME                            READY   STATUS    RESTARTS   AGEnginx-deploy-6659dbd7c4-kldxj   1/1     Running   0          79mnginx-deploy-6659dbd7c4-qgr4v   1/1     Running   0          79m

这里因为是第一次下载镜像会很慢，等一段时间running就代表实现了

#查看service状态Endpoint有ip应该就没问题了[root@k8s-master /data/yaml]# kubectl describe svc nginx-service Name:                     nginx-serviceNamespace:                defaultLabels:                   role=leaderAnnotations:              <none>Selector:                 school=qinghuaType:                     NodePortIP Family Policy:         SingleStackIP Families:              IPv4IP:                       10.2.2xx.219IPs:                      10.2.2xx.219Port:                     <unset>  8888/TCPTargetPort:               80/TCPNodePort:                 <unset>  30000/TCPEndpoints:                172.17.1.12:80,172.117.1.13:80Session Affinity:         NoneExternal Traffic Policy:  ClusterEvents:                   <none>

留神：k8s1.24版本当前nodeport（应用宿主机端口）在node宿主机不会显示端口，如ss -lntup，间接浏览器nodeip加端口的形式拜访给就能够

问题解决

Error registering network: failed to acquire lease: node "caasfaasslave1.XXXXXX.local" pod cidr not assigned

*在master上查看

pod没有取得podCIDR。我也遇到了这种状况，只管主节点上的清单说podCIDR是真的，但它依然不起作用，漏斗在crashbackloop中运行

[root@gz-xxs-gw-c7 ~]# sudo cat /etc/kubernetes/manifests/kube-controller-manager.yaml | grep -i cluster-cidr    - --cluster-cidr=172.17x.0.0/16kubectl patch node gz-xx-node2-c7 -p '{"spec":{"podCIDR":"172.17x.0.0/16"}}'kubectl patch node gz-xx-node1-c7 -p '{"spec":{"podCIDR":"172.17x.0.0/16"}}'

https://stackoverflow.com/que...