We are currently due for a refresh of our SAN/Hyper-V host infrastructure, and I am looking for comments and advice regarding a potential design.
We currently run 5 x DL380 G5 servers as hyper-v cluster hosts. These servers are connected to a 2Gb FC storage fabric to an IBM DS4300 with 15k FC disks and an EXP810 with 2TB Sata disks.
We have around 30 virtual servers running a fairly typical SME workload.
Rather than go with the typical option of replacing the SAN with a newer faster version of the IBM, or Equalogic, I've been doing some research and thinking into using Starwind Native SAN for Hyper-V and local storage instead.
My current idea of specification for this setup would be:
3 x Dell 720xd Servers
2 x Xeon E5-2680 CPU (total 16 cores per physical box)
128GB Memory
16 x 1TB near line-SAS 7200RPM drives in Raid 0
2 x 500GB near line-SAS drives in Raid 1
1 x 400GB Intel DC S3700 Enterprise SSD for read-only Cachecade
Thoughts behind this configuration:
16 x near line SAS drives in Raid 0 really makes me cringe, but gives good throughput and capacity.
Memory configuration gives us the ability to run entire 30 guest VM environment on a single node if necessary, giving n+2 redundancy.
3 Node HA configuration with Starwind would require us to have 3 separate drive failures on all three nodes within the space of the sync process not completing to have a total failure.
As I understand it, Starwind can use physical memory for read/write cache to increase performance to local disk arrays, giving very fast response and IO for in-cache reads.
Cachecade addition on reads should mean that any data not in Starwinds cache will be read from the 400GB SSD, and still provide very high IOPS and low read latency. Documentation on Cachecade shows that it is very competent at retrieving and keeping hot files in cache.
Physical size footprint would reduce from our current 16RU to 6RU.
As this seems to be a fairly uncommon Server/SAN configuration, I’d really like to get some advice and feedback on any major flaws or aspects that I have failed to think about. From where I’m sitting, it looks like we could get a significant increase in speed, reliability (redundancy) and footprint, all for less than half the cost of an equivalent SAN provider product. Has anybody else used a similar configuration in production? Be interested to hear.
The Latest Gartner® Magic Quadrant™Hyperconverged Infrastructure Software