How robust are tests for two independent samples?