Tokenization is the process of breaking language into smaller units called tokens, often fragments of words. Large language models use tokens as the primary units for processing input and generating output. Model memory limits and processing costs are typically measured in tokens rather than words.