Not sure if this is what is causing the inability to failover over VMs. They do drain and failover fine if I reboot a host. Was able to move them fine last night.
Storage is Dell Equallogic PS6210X
The first step I thought would be to validate the cluster and it has shown a failure on storage with below:
For the device-specific module (DSM) named Microsoft DSM, versions do not match between node Server1.domain.private and node Server2.domain.private.
For the device-specific module (DSM) named Microsoft DSM, versions do not match between node Server2.domain.private and node Server1.domain.private.
I now have a recently created VM while testing stuck as failed, checking the cluster events I suddenly have repeated as below:
Cluster Shared Volume 'Volume1' ('Cluster Disk 2') is no longer accessible from this cluster node because of error '(1460)'. Please troubleshoot this node's connectivity to the storage device and network connectivity.
and
Cluster Shared Volume 'Volume1' ('Cluster Disk 2') has entered a paused state because of '(80000011)'. All I/O will temporarily be queued until a path to the volume is reestablished.
All the disks are online, both in iSCSI initiator, disk management (as reserved or whatever) and in the Storage\Disks in Failover Cluster Manager.
Really unsure what is wrong here or why this has suddenly all occurred.