Starwind HA demo
Posted: Wed Sep 22, 2010 10:10 pm
Hi all,
I've been demoing Starwind HA in the "lab" in order to be better able to pitch it to my clients. I ran into a problem today while simulating an iSCSI network failure. Here's what I have:
2 dual quad core servers running W2K8 Enterprise R2
cluster node1 runs starwind ha (primary)
cluster node2 runs starwind ha partner (secondary)
test Hyper-V R2 VM running on Node2 with primary starwind storage on node1
Now, the cluster is setup well with CSV, etc. I created a test VM in Hyper-V R2 and had great fun using Live Migration to fail back and forth. I was very proud of myself. Next, I wanted to see what would happen if the iSCSI network had an issue. So, on node1 (primary ha) I disabled the NIC for iSCSI communication. The VM survived it seems. However when I went to syncronize the HA images, something went wrong. I did a full sync on the quorum img which took no time and came back up in cluster manager. The storage img is 600gb (really only 2gb of actual data) and looks like it will take a few hours to come back up as the sync is only 5% through after 20 minutes or so. In cluster manager the volume is marked as failed upon resync.
Questions:
1.) Should sync take 4-5 hours?
2.) Should sync (resync) kill the cluster storage availability?
3.) Did I do something wrong in my config or simulation?
I am using 4 Intel 1GB server NICs in each server. One for iSCSI only, one for VM-LAN, one for VM-DMZ, and one for server nodes. Advice needed. I'll freely admit I'm an iSCSI noob. I would think that the point of HA is to eliminate iSCSI downtime either on failure or rebuild. I assume I did something wrong.
Thanks for any pointers or advice,
Matt
I've been demoing Starwind HA in the "lab" in order to be better able to pitch it to my clients. I ran into a problem today while simulating an iSCSI network failure. Here's what I have:
2 dual quad core servers running W2K8 Enterprise R2
cluster node1 runs starwind ha (primary)
cluster node2 runs starwind ha partner (secondary)
test Hyper-V R2 VM running on Node2 with primary starwind storage on node1
Now, the cluster is setup well with CSV, etc. I created a test VM in Hyper-V R2 and had great fun using Live Migration to fail back and forth. I was very proud of myself. Next, I wanted to see what would happen if the iSCSI network had an issue. So, on node1 (primary ha) I disabled the NIC for iSCSI communication. The VM survived it seems. However when I went to syncronize the HA images, something went wrong. I did a full sync on the quorum img which took no time and came back up in cluster manager. The storage img is 600gb (really only 2gb of actual data) and looks like it will take a few hours to come back up as the sync is only 5% through after 20 minutes or so. In cluster manager the volume is marked as failed upon resync.
Questions:
1.) Should sync take 4-5 hours?
2.) Should sync (resync) kill the cluster storage availability?
3.) Did I do something wrong in my config or simulation?
I am using 4 Intel 1GB server NICs in each server. One for iSCSI only, one for VM-LAN, one for VM-DMZ, and one for server nodes. Advice needed. I'll freely admit I'm an iSCSI noob. I would think that the point of HA is to eliminate iSCSI downtime either on failure or rebuild. I assume I did something wrong.
Thanks for any pointers or advice,
Matt