"utterance" has a couple of different meanings in the doc. It's alternatively the recording of what the person said, or the transcript returned by the recognizer.
Discussion:
We should consider calling the transcript either "transcript" or just "text".
Resolution: We will rename utterance to transcript.