Okay so after reading a little bit it seems ATTO is not really an appropriate benchmark (only a queue of 4)
So I've tried SQLIO with two threads and a queue of 8 per thread
Sizes of:
1k
4k
64k
Command Line: (once for size for local and once for each size for remote)
sqlio.exe -kR -s60 -fsequential -o8 -b1 -LS -FC:\param timeout /T 10
sqlio.exe -kR -s60 -frandom -o8 -b1 -LS -FC:\param timeout /T 10
sqlio.exe -kR -s60 -fsequential -o8 -b1 -LS -FC:\param timeout /T 10
sqlio.exe -kR -s60 -frandom -o8 -b1 -LS -FC:\param timeout /T 10
I've used a simple Image File, and two iSCSI Connection over the two X520T Ports with enabled MPIO
64k full speed for random and sequential
4k half speed at sequential, full speed at random
1k quarter at sequential, full speed at random
so anton, do you think there is a way to also get full speed for the 1k and 4k sequentials test? Or is this due to overhead?
And through the various tests I've noticed one thing that really bugs me:

- MS Initiator Access Times
- HDTUNE_Access_Times_6_remote_dual_con_nocache.PNG (46.08 KiB) Viewed 24975 times

- Starwind StarPort Access Times
- Starwind_Initator_HDTUNE_Access_Times_1_remote.PNG (48.55 KiB) Viewed 24973 times
As you see the MS Initiator takes quite a time on the max. Access Times, with your (the Starwind StarPort Initiator) the maximum access times are much better, but not as good as on the real device (add network lat. etc)
I've attached the outputfiles as a zip archive