Kubernetes Job Cleanup

From what I understand the Job object is supposed to reap pods after a certain amount of time. But on my GKE cluster (Kubernetes 1.1.8) it seems that "kubectl get pods -a" can list pods from days ago.

All were created using the Jobs API.

I did notice that after delete the job with kubectl delete jobs The pods were deleted too.

My main concern here is that I am going to run thousands and tens of thousands of pods on the cluster in batch jobs, and don't want to overload the internal backlog system.

标签： kubernetes jobs

5条回答

时光不老，我们不散

2楼-- · 2019-01-18 11:56

It's true that you used to have to delete jobs manually. @puja's answer was correct at the time of writing.

Kubernetes 1.12.0 released a TTL feature (in alpha) where you can set it to automatically clean up jobs a specified number of seconds after completion (changelog). You can set it to zero for immediate cleanup. See the Jobs docs.

Example from the doc:

apiVersion: batch/v1
kind: Job
metadata:
  name: pi-with-ttl
spec:
  ttlSecondsAfterFinished: 100
  template:
    spec:
      containers:
      - name: pi
        image: perl
        command: ["perl",  "-Mbignum=bpi", "-wle", "print bpi(2000)"]
      restartPolicy: Never

0人赞添加讨论(0) 举报

【Aperson】

3楼-- · 2019-01-18 11:57

This is the intended behaviour of Jobs even in Kubernetes 1.3. Both the job and its pods stay in the system until you delete them manually. This is to provide you with a way to see results of the pods (i.e. through logs) that were not transported outside by some mechanism already or check for errors, warnings, or other diagnostic output.

The recommended/official way to get rid of the pods is to delete the job as you mentioned above. Using the garbage collector would only delete the pods, but the job itself would still be in the system.

If you don't want to delete the job manually, you could write a little script that is running in your cluster and checks for completed jobs and deletes them. Sadly, Scheduled Jobs are only coming in 1.4 but you could run the script in a normal pod instead.

0人赞添加讨论(0) 举报

Emotional °昔

4楼-- · 2019-01-18 12:00

It looks like starting with Kubernetes 1.6 (and the v2alpha1 api version), if you're using cronjobs to create the jobs (that, in turn, create your pods), you'll be able to limit how many old jobs are kept. Just add the following to your job spec:

successfulJobsHistoryLimit: X
failedJobsHistoryLimit: Y

Where X and Y are the limits of how many previously run jobs the system should keep around (it keeps jobs around indefinitely by default [at least on version 1.5.])

Edit 2018-09-29:

For newer K8S versions, updated links with documentation for this are here:

0人赞添加讨论(0) 举报

疯言疯语

5楼-- · 2019-01-18 12:00

I recently built a kubernetes-operator to do this task.

After deploy it will monitor selected namespace and delete completed jobs/pods if they completed without errors/restarts.

https://github.com/lwolf/kube-cleanup-operator

0人赞添加讨论(0) 举报

\"骚年 ilove

6楼-- · 2019-01-18 12:07

In kubernetes v1.2, there is a garbage collector for reaping terminated pods with a global threshold --terminated-pod-gc-threshold=12500 (see the flags in controller manager. I am not aware of any GC mechanism for terminated pods in v1.1.8. You may want to run a script/pod to periodically clean up the pods/jobs to prevent the master components from being overwhelmed. By the way, there is an open issue to automatically adjust the GC threshold.

0人赞添加讨论(0) 举报

Kubernetes Job Cleanup

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间