How Sampling Helps Attention Approximation