The Latest Gartner® Magic Quadrant™Hyperconverged Infrastructure Software

jeffhamm wrote:So by default, if the heartbeat goes down for any reason (nic down, one node blue screens, etc ) the whole SAN goes offline?

anton (staff) wrote:NO! By default if all links between HA nodes will go down (ALL means multiple synchronization channels and multiple heartbeat channels as well) node holding "slave" token will turn itself OFF to avoid split brain issue. With properly configured cluster (heartbeat routed thru initiator side subnetwork) you have ZERO chances to see both nodes down.
P.S. The only way to avoid such an issue completely is going to multiple HA nodes. More then two. Then we'll have a voting quorum. And we'll represent such a solution quite soon. So stay tuned
jeffhamm wrote:So by default, if the heartbeat goes down for any reason (nic down, one node blue screens, etc ) the whole SAN goes offline?
jeffhamm wrote:Anton - I think I can live with the Split Brain scenario for now. How do you change the default setting to allow the Slave to stay online?
Thanks,
Jeff


jeffhamm wrote:OK - I get it that split brain is bad and will stop talking about that![]()
What I'm trying to simulate is a situation where one of the two nodes goes down hard. I thought I could do this by just disabling all the network connections on the primary node. I'm guessing what you are trying to tell me is that this is a bad test?
Would a better test be for me to just push the reset button on the primary node? And if I do reset the primary node, is the expected behavior that the slave node will continue to service requests from virtual machines, or that the slave node will stop servicing requests at that point to avoid split brain?
Sorry to be such a pain - we're close to going into production with our Hyper-V cluster, and we need to make sure we have all the fail over scenarios accounted for and procedures for dealing with them in place if (or when) they occur.
Thanks!
Jeff

I would like to echo this concern. For SQL database servers that are mirrored (functionally what the SAN servers are doing) I only need a witness server that doesn't have to be a fully kitted out production SQL server. I can see reasons why I would want a fully functional three legged HA SAN configuration (offsite replication/failover for instance), but I can see equal validity in just having a witness server in place to act as a quorum voter. Would it be possible to have both options available? I haven't got the budgets to absorb another fully provisioned SAN server.rchisholm wrote:Will the additional member of the voting quorum have to be a storage node? It would be great if it could just provide a quorum. In my situation, if it has to be a 3rd storage node, it increases my costs greatly. With 100's of TB's, the cost of the drives, controllers, servers, rack space, power, and cooling makes a big difference for a 3rd server.
anton (staff) wrote:P.S. The only way to avoid such an issue completely is going to multiple HA nodes. More then two. Then we'll have a voting quorum. And we'll represent such a solution quite soon. So stay tuned
hixont wrote:I would like to echo this concern. For SQL database servers that are mirrored (functionally what the SAN servers are doing) I only need a witness server that doesn't have to be a fully kitted out production SQL server. I can see reasons why I would want a fully functional three legged HA SAN configuration (offsite replication/failover for instance), but I can see equal validity in just having a witness server in place to act as a quorum voter. Would it be possible to have both options available? I haven't got the budgets to absorb another fully provisioned SAN server.rchisholm wrote:Will the additional member of the voting quorum have to be a storage node? It would be great if it could just provide a quorum. In my situation, if it has to be a 3rd storage node, it increases my costs greatly. With 100's of TB's, the cost of the drives, controllers, servers, rack space, power, and cooling makes a big difference for a 3rd server.
anton (staff) wrote:P.S. The only way to avoid such an issue completely is going to multiple HA nodes. More then two. Then we'll have a voting quorum. And we'll represent such a solution quite soon. So stay tuned

Thanks.anton (staff) wrote:1) We'll do support all of the listed scenarios. So you're not going to be forced to pick up working way.
I am a Hyper-V (Windows 2008 R2) shop and bounce between Hyper-V Manager, Failover Cluster Manager and SCVMM 2008 R2 as my management consoles.anton (staff) wrote:2) What hypervisor do you run at this moment?