SAN Backup cluster

Software-based VM-centric and flash-friendly VM storage + free version

Moderators: anton (staff), art (staff), Max (staff), Anatoly (staff)

Post Reply
nuhu
Posts: 4
Joined: Fri Jun 04, 2010 11:29 am

Thu Sep 02, 2010 8:04 am

Hello All,

I have a potential client that is interested in the starwind solution because of cost (HP too expensive) but they have 2 sites connected via a satellite link (512kb/512kb). They want to have a SAN setup at both ends of the satellite link and be able to incremental backup of the data on site A to site B (via satellite) and vice versa (incremental backup of the data from site B to site A via the same satellite link). The whole initial data at both sides will be manually backed up to the other side and then we only expect incremental byte level changes to the data to be backed up.

Even though i have been told that this will not work unless i get a 100Mbps link (this is Africa, we do not get those kind of luxuries here), i have taken a look at other solutions line storage mirroring from HP and they seem to be able to do incremental backups via LANs and WANs (even over slow links).

Has anyone tried this before? and/or will this work?

how does Starwind's CDP (Continuous Data Protection) help me?

Cheers!!

NM
User avatar
anton (staff)
Site Admin
Posts: 4010
Joined: Fri Jun 18, 2004 12:03 am
Location: British Virgin Islands
Contact:

Thu Sep 02, 2010 7:30 pm

Unfortunately for our HA configuration you need to have fast link between both nodes. Or link speed is going to be a performance limiting factor... For "one-way-ticket" (background replication) you can use slow link and we do use some techniques to take care of slow connection (cached and overlapped I/O, delayed writes, pseudo-de-duplication etc). But in this case you just have second backup site, if first one will go AWOL you'll have non-zero time to recover. Upcoming StarWind will have fast link requirement eliminated but it's going to take time before you'll see one... (true GEO cluster).
Regards,
Anton Kolomyeytsev

Chief Technology Officer & Chief Architect, StarWind Software

Image
nuhu
Posts: 4
Joined: Fri Jun 04, 2010 11:29 am

Fri Sep 03, 2010 12:41 pm

Hello Anton,

Thank you for the response.

The solution we are proposing is not the typical HA solution or a cluster solution. The client is not looking for automatic failover, they basically just want to have a copy of their data from site A at site B and vice versa in the event that Site A or Site B burns down. This is basically a disaster recovery solution.

You said “For "one-way-ticket" (background replication) you can use slow link and we do use some techniques to take care of slow connection (cached and overlapped I/O, delayed writes, pseudo-de-duplication etc). this may work for me, please can you explain on this further or point me to a document that will help.

Thanks

Nuhu M
User avatar
Aitor_Ibarra
Posts: 163
Joined: Wed Nov 05, 2008 1:22 pm
Location: London

Fri Sep 03, 2010 1:35 pm

Hi Nuhu,

Even with replication rather than HA, I don't think that the satellite link will be good enough, unless you expect very/b] low i/o.

Apart from the low bandwidth, 512Kbit/s, if used 24/7 and with no contention (other users of the same bandwidth) and no tcp/ip overhead, you would only be able to transfer 5.2GB a day maximum. (512Kbit = 64Kbyte - i.e. you can transfer the RAM of a Commodore 64 every second, or 3.75MB per minute, 225MB per hour, 5400MB per day)

Realistically you're going to see much less than 5GB a day over 512Kbit/s, and if your data changes more than that, then your recovery site would never be in sync with your primary site.

On top of low bandwidth, satellite means higher latency because of the larger distances (your traffic has to go up into space and back, not along a cable) - nothing goes faster than the speed of light! This will be down to the straight line distance between your two sites on the ground and the satellite in space. According to this http://www.dslreports.com/faq/2001 , your ping time (the time it takes for a packet to go out from one location, be received at the other location, and a return packet come back) is going to be about 850 milliseconds. I don't know how well iSCSI can cope with such high latency; the initiators may see the high latency as a broken link. Anton will know!

Maybe you can prioritise some data to go over the link (stuff that doesn't change much every day, but absolutely has to be in sync at both sites) and other larger but lower priority data through a lower-tech method like physically transporting a backup disk every week?

Another option is to look at file level replication, e.g. something like DFS with RDC on windows, which is smart enough to only send the changed part of each changed file, uses compression and can cope with high latency. The main problem with it (unlike block level replication like Starwind) is that a file has to be closed for it to replicate. No good for stuff which keeps locks on files (E.g. databases).

hope this helps,

Aitor
User avatar
anton (staff)
Site Admin
Posts: 4010
Joined: Fri Jun 18, 2004 12:03 am
Location: British Virgin Islands
Contact:

Fri Sep 03, 2010 5:33 pm

In such a case you need "Mirror and Replication" edition and you really need start looking at mirror and replication PDFs.
nuhu wrote:Hello Anton,

Thank you for the response.

The solution we are proposing is not the typical HA solution or a cluster solution. The client is not looking for automatic failover, they basically just want to have a copy of their data from site A at site B and vice versa in the event that Site A or Site B burns down. This is basically a disaster recovery solution.

You said “For "one-way-ticket" (background replication) you can use slow link and we do use some techniques to take care of slow connection (cached and overlapped I/O, delayed writes, pseudo-de-duplication etc). this may work for me, please can you explain on this further or point me to a document that will help.

Thanks

Nuhu M
Regards,
Anton Kolomyeytsev

Chief Technology Officer & Chief Architect, StarWind Software

Image
User avatar
anton (staff)
Site Admin
Posts: 4010
Joined: Fri Jun 18, 2004 12:03 am
Location: British Virgin Islands
Contact:

Fri Sep 03, 2010 5:36 pm

Indeed, only de-duplication based solution with HUGE lookup database and very small transfer block is going to work with such a slow connection. But with tons of generated data and very poor de-duplication coefficients (say we're talking about IP video surveillance or voice recording or whatever generating non-overlapping content) nothing is going to help...
Aitor_Ibarra wrote:Hi Nuhu,

Even with replication rather than HA, I don't think that the satellite link will be good enough, unless you expect very/b] low i/o.

Apart from the low bandwidth, 512Kbit/s, if used 24/7 and with no contention (other users of the same bandwidth) and no tcp/ip overhead, you would only be able to transfer 5.2GB a day maximum. (512Kbit = 64Kbyte - i.e. you can transfer the RAM of a Commodore 64 every second, or 3.75MB per minute, 225MB per hour, 5400MB per day)

Realistically you're going to see much less than 5GB a day over 512Kbit/s, and if your data changes more than that, then your recovery site would never be in sync with your primary site.

On top of low bandwidth, satellite means higher latency because of the larger distances (your traffic has to go up into space and back, not along a cable) - nothing goes faster than the speed of light! This will be down to the straight line distance between your two sites on the ground and the satellite in space. According to this http://www.dslreports.com/faq/2001 , your ping time (the time it takes for a packet to go out from one location, be received at the other location, and a return packet come back) is going to be about 850 milliseconds. I don't know how well iSCSI can cope with such high latency; the initiators may see the high latency as a broken link. Anton will know!

Maybe you can prioritise some data to go over the link (stuff that doesn't change much every day, but absolutely has to be in sync at both sites) and other larger but lower priority data through a lower-tech method like physically transporting a backup disk every week?

Another option is to look at file level replication, e.g. something like DFS with RDC on windows, which is smart enough to only send the changed part of each changed file, uses compression and can cope with high latency. The main problem with it (unlike block level replication like Starwind) is that a file has to be closed for it to replicate. No good for stuff which keeps locks on files (E.g. databases).

hope this helps,

Aitor
Regards,
Anton Kolomyeytsev

Chief Technology Officer & Chief Architect, StarWind Software

Image
nuhu
Posts: 4
Joined: Fri Jun 04, 2010 11:29 am

Sat Sep 04, 2010 4:35 am

Hello All,

Thanks for the responses and the insights.

The client will not be generating more than 100MB a day. in anyway, we intend to do a POC just to be sure that the proposed solution will work before we hopefully go live in production.

Thanks again.

Nuhu M
User avatar
anton (staff)
Site Admin
Posts: 4010
Joined: Fri Jun 18, 2004 12:03 am
Location: British Virgin Islands
Contact:

Sat Sep 04, 2010 7:58 am

Sure. If you want to have on-line assistance from pre-sales engineers please drop us an e-mail to support@starwindsoftware.com and we'll be happy to help you with the configuration. Thanks!
nuhu wrote:Hello All,

Thanks for the responses and the insights.

The client will not be generating more than 100MB a day. in anyway, we intend to do a POC just to be sure that the proposed solution will work before we hopefully go live in production.

Thanks again.

Nuhu M
Regards,
Anton Kolomyeytsev

Chief Technology Officer & Chief Architect, StarWind Software

Image
Jdemuynck
Posts: 3
Joined: Thu Feb 04, 2016 4:39 pm

Thu Feb 04, 2016 4:54 pm

Is it possible to setup a geo cluster using a 10GB fibre connection between 2 datacenters ?

We have 4 Windows 2012R2 servers setup 2 by 2 in two separate clusters, each using Starwind for the high availability internal SAN solution.
On these servers some 40 virtual servers are active.
We would like to move the 2th server of each cluster to a second datacenter and setup a synchronous replication between the two datacenters.

We have a 10GB fibre connection between the two datacenters (ring structure with 10GB backup line)
The distance is about 50km and the delay on the interconnect is 1ms
In both datacenters we have a 1Mb internet connection.

Is this possible ?
Will this result in performance loss ?

Has anyone setup such a system before ?

Jan
User avatar
darklight
Posts: 185
Joined: Tue Jun 02, 2015 2:04 pm

Wed Feb 10, 2016 4:33 pm

Hi Jdemuynck,

If you can do Ethernet and iSCSI over this 10GB fibre channel, it might be possible for you to implement the idea. Will need at least 2 VLANs or different subnets routing to avoid mixing iSCSI and Sync traffic in same subnet. Delay 1ms should be fine for both, iSCSI and synchronization. Of course you have to avoid total connection down between sites at all cost, so you may need to get a heartbeat running over WAN then.
User avatar
Tarass (Staff)
Staff
Posts: 113
Joined: Mon Oct 06, 2014 10:40 am

Fri Feb 12, 2016 3:13 pm

Thank you very much for your time and help, darklight.

Jdemuynck, should you have any further questions, do not hesitate to contact us.
Senior Technical Support Engineer
StarWind Software Inc.
User avatar
Tarass (Staff)
Staff
Posts: 113
Joined: Mon Oct 06, 2014 10:40 am

Fri Feb 12, 2016 3:13 pm

Thank you very much for your time and help, darklight.

Jdemuynck, should you have any further questions, do not hesitate to contact us.
Senior Technical Support Engineer
StarWind Software Inc.
Jdemuynck
Posts: 3
Joined: Thu Feb 04, 2016 4:39 pm

Fri Feb 12, 2016 3:21 pm

Thanks for your answer.
We will now setup the second datacenter infrastructure first and setup a test environment
Vladislav (Staff)
Staff
Posts: 180
Joined: Fri Feb 27, 2015 4:31 pm

Wed Feb 17, 2016 8:23 am

Hi there!

Have fun with your StarWind setup and do not hesitate to contact us should you have any questions :)
Post Reply