I think you found the log that tells you the problem, and you confirmed it with the iSCSI decoder ring, but yer not looking in the right place to get the details... I don’t think this is a network issue at all, IMHO. I Could be dead wrong!
VMware Knowledge Base
vSphere Documentation Center
So my thought is that there is some soft limit to the LUN utilization configured.. somewhere.. I’m not familiar with where or how that is set, but yer definitely hitting it. If your storage says “OUT OF SPACE” the datastore/LUN goes offline and VMware has to pause all machines on the datastore to prevent data loss.
Also, the size of the ZFS file system/ZDEV does not necessarily equal the size of the data store. You can easily increase the size on the ZFS side, but you can’t use the space until you expand the data store in VMware.
What does the data store capacity and utilization look like on the host?
VMware Knowledge Base
vSphere Documentation Center
So my thought is that there is some soft limit to the LUN utilization configured.. somewhere.. I’m not familiar with where or how that is set, but yer definitely hitting it. If your storage says “OUT OF SPACE” the datastore/LUN goes offline and VMware has to pause all machines on the datastore to prevent data loss.
Also, the size of the ZFS file system/ZDEV does not necessarily equal the size of the data store. You can easily increase the size on the ZFS side, but you can’t use the space until you expand the data store in VMware.
What does the data store capacity and utilization look like on the host?