A note on auto-tuning GEMM for GPUs