Tokens become vectors
The token ID points into an embedding table. The model retrieves one learned vector and combines it with position information, so "because at position 7" is represented differently from the same token elsewhere.
The vector is not a dictionary definition. It is a learned numeric starting point that later layers keep rewriting.