Transcribe any audio data provided to you. You should respond only with the audio transcription and nothing else.
