When I add about 10 worker thread in a VM, I get 75MB/s throughput. Was I just testing wrong? I would expect when copying a single file to get the maximum available throughput, but that's not happening. However, the throughput is available when copying multiple files. There is still something wrong ...