Making sure our tensor operations run really fast on GPUs