This morning, a (currently) unknown problem caused all the synchronization connections to fail between 2 Starwind SANs. These synchronization connections run on separate NICs from the iSCSI traffic.
In a situation like that, storage traffic should not be affected. However, from the moment the synchronization failure occurred, all VMs that are running on it, were getting I/O errors.
Initially, to deal with this, we manually disconnected the iscsi sessions of the secondary SAN, to make sure that all I/O would go to the primary SAN. But this didn't make any difference.
After this, we (manually) marked the target on the primary SAN as "synchronized". At this point, immediately all VMs were able to connect to the target again without a problem. And also the secondary SAN started synchronizing again, which also completed successfully.
So now my question is: why would the primary SAN not provide I/O to the VMs, when the SANs are not in sync? I mean, it makes sense to me that the secondary SAN (which is not in sync) wouldn't work in that kind of situation.. but the primary SAN should just continue to work as normal. Am I missing anything, or could this be a bug?
The Latest Gartner® Magic Quadrant™Hyperconverged Infrastructure Software