- 1. 迁移Pod
- 1.1. 设置节点是否可调度
- 1.2. 执行kubectl drain命令
- 1.3. 特别说明
- 2. kubectl drain 流程图
- 3. TroubleShooting
- 4. kubectl drain
1. 迁移Pod
1.1. 设置节点是否可调度
确定需要迁移和被迁移的节点,将不允许被迁移的节点设置为不可调度。
# 查看节点kubectl get nodes# 设置节点为不可调度kubectl cordon <NodeName># 设置节点为可调度kubectl uncordon <NodeName>
1.2. 执行kubectl drain命令
kubectl drain <NodeName> --force --ignore-daemonsets
示例:
$ kubectl drain bjzw-prek8sredis-99-40 --force --ignore-daemonsetsnode "bjzw-prek8sredis-99-40" already cordonedWARNING: Deleting pods not managed by ReplicationController, ReplicaSet, Job, DaemonSet or StatefulSet: kube-proxy-bjzw-prek8sredis-99-40; Ignoring DaemonSet-managed pods: calicoopsmonitor-mfpqs, arachnia-agent-j56n8pod "pre-test-pro2-r-0-redis-2-8-19-1" evictedpod "pre-test-hwh1-r-8-redis-2-8-19-2" evictedpod "pre-eos-hdfs-vector-eos-hdfs-redis-2-8-19-0" evicted
1.3. 特别说明
对于statefulset创建的Pod,kubectl drain的说明如下:
kubectl drain操作会将相应节点上的旧Pod删除,并在可调度节点上面起一个对应的Pod。当旧Pod没有被正常删除的情况下,新Pod不会起来。例如:旧Pod一直处于Terminating状态。
对应的解决方式是通过重启相应节点的kubelet,或者强制删除该Pod。
示例:
# 重启发生`Terminating`节点的kubeletsystemctl restart kubelet# 强制删除`Terminating`状态的Podkubectl delete pod <PodName> --namespace=<Namespace> --force --grace-period=0
2. kubectl drain 流程图

3. TroubleShooting
1、存在不是通过ReplicationController, ReplicaSet, Job, DaemonSet 或者StatefulSet创建的Pod(即静态pod,通过文件方式创建的),所以需要设置强制执行的参数--force。
$ kubectl drain bjzw-prek8sredis-99-40node "bjzw-prek8sredis-99-40" already cordonederror: unable to drain node "bjzw-prek8sredis-99-40", aborting command...There are pending nodes to be drained:bjzw-prek8sredis-99-40error: DaemonSet-managed pods (use --ignore-daemonsets to ignore): calicoopsmonitor-mfpqs, arachnia-agent-j56n8; pods not managed by ReplicationController, ReplicaSet, Job, DaemonSet or StatefulSet (use --force to override): kube-proxy-bjzw-prek8sredis-99-40
2、存在DaemonSet方式管理的Pod,需要设置--ignore-daemonsets参数忽略报错。
$ kubectl drain bjzw-prek8sredis-99-40 --forcenode "bjzw-prek8sredis-99-40" already cordonederror: unable to drain node "bjzw-prek8sredis-99-40", aborting command...There are pending nodes to be drained:bjzw-prek8sredis-99-40error: DaemonSet-managed pods (use --ignore-daemonsets to ignore): calicoopsmonitor-mfpqs, arachnia-agent-j56n8
4. kubectl drain
$ kubectl drain --helpDrain node in preparation for maintenance.The given node will be marked unschedulable to prevent new pods from arriving. 'drain' evicts the pods if the APIServersupports eviction (http://kubernetes.io/docs/admin/disruptions/). Otherwise, it will use normal DELETE to delete thepods. The 'drain' evicts or deletes all pods except mirror pods (which cannot be deleted through the API server). Ifthere are DaemonSet-managed pods, drain will not proceed without --ignore-daemonsets, and regardless it will not deleteany DaemonSet-managed pods, because those pods would be immediately replaced by the DaemonSet controller, which ignoresunschedulable markings. If there are any pods that are neither mirror pods nor managed by ReplicationController,ReplicaSet, DaemonSet, StatefulSet or Job, then drain will not delete any pods unless you use --force. --force willalso allow deletion to proceed if the managing resource of one or more pods is missing.'drain' waits for graceful termination. You should not operate on the machine until the command completes.When you are ready to put the node back into service, use kubectl uncordon, which will make the node schedulable again.! http://kubernetes.io/images/docs/kubectl_drain.svgExamples:# Drain node "foo", even if there are pods not managed by a ReplicationController, ReplicaSet, Job, DaemonSet orStatefulSet on it.$ kubectl drain foo --force# As above, but abort if there are pods not managed by a ReplicationController, ReplicaSet, Job, DaemonSet orStatefulSet, and use a grace period of 15 minutes.$ kubectl drain foo --grace-period=900Options:--delete-local-data=false: Continue even if there are pods using emptyDir (local data that will be deleted whenthe node is drained).--dry-run=false: If true, only print the object that would be sent, without sending it.--force=false: Continue even if there are pods not managed by a ReplicationController, ReplicaSet, Job, DaemonSetor StatefulSet.--grace-period=-1: Period of time in seconds given to each pod to terminate gracefully. If negative, the defaultvalue specified in the pod will be used.--ignore-daemonsets=false: Ignore DaemonSet-managed pods.-l, --selector='': Selector (label query) to filter on--timeout=0s: The length of time to wait before giving up, zero means infiniteUsage:kubectl drain NODE [options]Use "kubectl options" for a list of global command-line options (applies to all commands).
参考文档:
- https://kubernetes.io/docs/tasks/administer-cluster/safely-drain-node/
- https://kubernetes.io/docs/tasks/run-application/configure-pdb/
- https://kubernetes.io/docs/reference/generated/kubectl/kubectl-commands#drain
