确实有些难以推测文字转WAV音频