What is a large language model, really?
Time: 9:30 AM to 10:05 AM
Tokens
Everything starts with tokens. When you type a sentence, the model does not read words. It splits your text into tokens, which are fragments of words, whole words, or punctuation. You can visualize this using OpenAI’s tokenizer tool.Training
A model learns by reading billions of tokens and adjusting billions of numbers (called weights) to get better at one task: predicting the next token. That is the entire training loop.Inference
When you send a message, the model predicts the most likely next token, then the next, then the next. It does not “know” the answer. It generates one token at a time based on probability.System prompts
A system prompt is context prepended before your message that shapes what the model generates. It defines the model’s role, personality, and constraints.Key terms
| Term | Meaning |
|---|---|
| Parameters/weights | The numbers the model learned during training |
| Context window | The maximum amount of text the model can consider at once |
| Temperature | Controls randomness: low = predictable, high = creative |
| Tokens | The units of text the model reads and generates |
| Prompt | The input you send to the model |
| Completion | The output the model generates |

