Use containerd cri plugin from init #79

detiber · 2018-05-04T06:04:26Z

This PR attempts to address #74

The kernel, init, and runc image updates are a bit extraneous for this changeset, but the containerd update is required.

kubelet changes:

update to latest stable k8s release
update to latest cri tools
onboot hack to create /var/lib/cni/{bin,conf}
- removing the cri-containerd image caused the directories to not exist and containerd appears to be processing the runtime mounts before the mkdir entries.

I also introduced files necessary to deploy calico networking rather than weave, since weave currently does not work with these changes. The issue appears to be with the way weave cni plugin uses nsenter.

- update kernel, init, runc, and containerd images Signed-off-by: Jason DeTiberus <[email protected]>

- rev k8s version to v1.10.2 - rev critools version Signed-off-by: Jason DeTiberus <[email protected]>

- remove cri-containerd package - kubelet container mounting hack - previously cri-containerd created the /var/lib/cni/{bin,conf} directories and containerd appears to process runtime mounts before mkdir entries Signed-off-by: Jason DeTiberus <[email protected]>

Signed-off-by: Jason DeTiberus <[email protected]>

detiber · 2018-05-04T15:18:39Z

Currently all of the cri rtf tests are failing with flakiness starting the static pod manifests, there appears to be some flakiness around the interaction between the kubelet and containerd for pods that fail to come up and should be restarted, I'm seeing errors similar to:

I0504 14:43:30.680271     638 kuberuntime_manager.go:513] Container {Name:kube-scheduler Image:k8s.gcr.io/kube-scheduler-amd64:v1.10.2 Command:[kube-scheduler --address=127.0.0.1 --leader-elect=true --kubeconfig=/etc/kubernetes/scheduler.conf] Args:[] WorkingDir: Ports:[] EnvFrom:[] Env:[] Resources:{Limits:map[] Requests:map[cpu:{i:{value:100 scale:-3} d:{Dec:<nil>} s:100m Format:DecimalSI}]} VolumeMounts:[{Name:kubeconfig ReadOnly:true MountPath:/etc/kubernetes/scheduler.conf SubPath: MountPropagation:<nil>}] VolumeDevices:[] LivenessProbe:&Probe{Handler:Handler{Exec:nil,HTTPGet:&HTTPGetAction{Path:/healthz,Port:10251,Host:127.0.0.1,Scheme:HTTP,HTTPHeaders:[],},TCPSocket:nil,},InitialDelaySeconds:15,TimeoutSeconds:15,PeriodSeconds:10,SuccessThreshold:1,FailureThreshold:8,} ReadinessProbe:nil Lifecycle:nil TerminationMessagePath:/dev/termination-log TerminationMessagePolicy:File ImagePullPolicy:IfNotPresent SecurityContext:nil Stdin:false StdinOnce:false TTY:false} is dead, but RestartPolicy says that we should restart it.
linuxkit-b2a001ae29c0:/# Connection to localhost closed by remote host.

I suspect this might be similar to the flakes mentioned in #64

A bit of digging, and I suspect that containerd/cri#733 might be coming into play.

GordonTheTurtle added the status/0-triage label May 4, 2018

detiber added 4 commits May 4, 2018 09:48

Dependency updates

2fe00b6

- update kernel, init, runc, and containerd images Signed-off-by: Jason DeTiberus <[email protected]>

Update kubelet package

dcef9d9

- rev k8s version to v1.10.2 - rev critools version Signed-off-by: Jason DeTiberus <[email protected]>

Add calico networking

5dca63c

Signed-off-by: Jason DeTiberus <[email protected]>

detiber force-pushed the containerd-cri branch from 9248327 to 5dca63c Compare May 4, 2018 13:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use containerd cri plugin from init #79

Use containerd cri plugin from init #79

detiber commented May 4, 2018

detiber commented May 4, 2018

Use containerd cri plugin from init #79

Are you sure you want to change the base?

Use containerd cri plugin from init #79

Conversation

detiber commented May 4, 2018

detiber commented May 4, 2018