Accelerating FFT towards Exascale Computing