Hey So Our testing rancher instance just burned up… -_- Im trying to figure out what happened
Here are some details:
Its a single node rancher installation through docker. We are useing self signed certs that rancher and kubernetes creates we have the server on
We are getting these errors when I do docker logs on rancher docker container
E0326 17:24:15.341812 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.StorageClass: Get https://localhost:6443/apis/storage.k8s.io/v1/storageclasses?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.341983 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.342834 6 reflector.go:134] k8s.io/kubernetes/cmd/kube-scheduler/app/server.go:178: Failed to list *v1.Pod: Get https://localhost:6443/api/v1/pods?fieldSelector=status.phase!%3DFailed%2Cstatus.phase!%3DSucceeded&limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.342887 I | http: TLS handshake error from remote error: tls: bad certificate
2020-03-26 17:24:15.343833 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.343885 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.PersistentVolume: Get https://localhost:6443/api/v1/persistentvolumes?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
E0326 17:24:15.350151 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.PersistentVolumeClaim: Get https://localhost:6443/api/v1/persistentvolumeclaims?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.350252 I | http: TLS handshake error from remote error: tls: bad certificate
2020-03-26 17:24:15.356600 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.358639 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.Service: Get https://localhost:6443/api/v1/services?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
E0326 17:24:15.365366 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.StatefulSet: Get https://localhost:6443/apis/apps/v1/statefulsets?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.367842 I | http: TLS handshake error from remote error: tls: bad certificate
2020-03-26 17:24:15.368033 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.368080 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.ReplicaSet: Get https://localhost:6443/apis/apps/v1/replicasets?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.374517 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.374564 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1beta1.PodDisruptionBudget: Get https://localhost:6443/apis/policy/v1beta1/poddisruptionbudgets?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.376002 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.376042 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.ReplicationController: Get https://localhost:6443/api/v1/replicationcontrollers?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
2020-03-26 17:24:15.376960 I | http: TLS handshake error from remote error: tls: bad certificate
E0326 17:24:15.377020 6 reflector.go:134] k8s.io/client-go/informers/factory.go:127: Failed to list *v1.Node: Get https://localhost:6443/api/v1/nodes?limit=500&resourceVersion=0: x509: certificate has expired or is not yet valid
When Looking at the rancher agents I get this:
ERROR: is not accessible (Failed to connect to port 7443: Connection refused)
INFO: Arguments: --server --token REDACTED --ca-checksum 6a2f0c412cdd4499b5ace7ac81407d616a197b7b1a6ee0ad5e8412140d8ccc62 --no-register --only-write-certs
INFO: Using resolv.conf: nameserver
ERROR: is not accessible (Failed to connect to port 7443: Connection refused)
Any ideas? I dont want to have to reinstall everything.
Note: Yes I know a single node installation is a bad thing. I am current setting a multi node ha rancher cluster on one of our public servers. This was just to get us familar with the system, but we are currently relying on it for testing and dev. So I need to get it back up so others can continue to work on their projects (Or untill I configure everything to use the public servers).