Rancher-nfs painfully slow

huggla · December 7, 2016, 1:42pm

I’m running Rancher 1.2 on RancherOS 0.7 iso. I’m using rancher-nfs with joebiellik/nfs4 as nfs-server. Everything starts fine, there are no healthcheck issues and I’m able to create and add volumes to containers. The problem is the speed. All diskoperations on the rancher-nfs volumes takes forever. It takes about 15 seconds just to get a simple directory listing. The nfs-driver logs are full of errors, but they doesn’t say me anything. Here’s an example:

2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.request name=qgisserverprojects
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=“qgisserverprojects already mounted on /var/lib/rancher/volumes/rancher-nfs/qgisserverprojects”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.response mountpoint=“/var/lib/rancher/volumes/rancher-nfs/qgisserverprojects”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.request name=qgisservervarnish
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=“qgisservervarnish already mounted on /var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.response mountpoint=“/var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=unmount.request name=qgisserverprojects
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=unmount.response
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=unmount.request name=qgisservervarnish
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=unmount.response
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.request name=qgisservervarnish
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=“qgisservervarnish already mounted on /var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.response mountpoint=“/var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=unmount.request name=qgisservervarnish
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=unmount.response
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.request name=qgisserverprojects
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=“qgisserverprojects already mounted on /var/lib/rancher/volumes/rancher-nfs/qgisserverprojects”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.response mountpoint=“/var/lib/rancher/volumes/rancher-nfs/qgisserverprojects”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.request name=qgisservervarnish
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=“qgisservervarnish already mounted on /var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:00time=“2016-12-07T09:00:00Z” level=info msg=mount.response mountpoint=“/var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=unmount.request name=qgisserverprojects
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=unmount.response
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=unmount.request name=qgisservervarnish
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=unmount.response
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=mount.request name=qgisservervarnish
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=“qgisservervarnish already mounted on /var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=mount.response mountpoint=“/var/lib/rancher/volumes/rancher-nfs/qgisservervarnish”
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=unmount.request name=qgisservervarnish
2016-12-07 10:00:01time=“2016-12-07T09:00:01Z” level=info msg=unmount.response

Those kinds of errors appears over and over again in stderror.

Any suggestions how to fix this?

kchebani · March 21, 2017, 2:51pm

Hi,

I’ve got the same issue :

NFS export config :
/mnt *(rw,fsid=0,no_root_squash,no_subtree_check,insecure)

Rancher-NFS config :
MOUNT_OPTS: proto=tcp,port=2049,rw,nfsvers=4
and i tried also
MOUNT_OPTS: rw,nfsvers=4

My response time on services is about 15-20 seconds the first time, then it works normally for 1 minutes. Then if my services stays Idle for more than 1 minute the 15-20 seconds delay reappears.

Thanks for your reply,

K.

huggla · March 23, 2017, 3:21pm

I did a fresh install of server and hosts and after that it worked fine.

NFS export config :
/nfs-share *(rw,async,no_subtree_check,no_auth_nlm,no_root_squash,crossmnt,no_ac,fsid=0)

Rancher-NFS config :
noacl,noatime,nodiratime,minorversion=1,nolock,nfsvers=4

I’ve set owner and group for the shared folder to nobody:nogroup. My settings makes the share totally insecure but I haven’t published any ports outside Rancher so it should be ok.

Topic		Replies	Views
Rancher-NFS - Version 1.5.3 Rancher 1.x	0	904	March 31, 2017
Rancher NFS Caching Convoy	1	2100	September 26, 2017
Mounted volumes extremely slow RancherOS	0	1159	January 5, 2017
Rancher is too slow(1.6 version takes 10min to load) Rancher 1.x	0	1100	August 22, 2018
How to debug NFS issues? RancherOS	2	1042	August 18, 2018

Rancher-nfs painfully slow

Related topics