A large language model (LLM) is a type of artificial intelligence that can generate human language and perform related tasks. These models are trained on huge datasets, often containing billions of words. By analyzing all this data, the LLM learns patterns and rules of language, similar to how a human learns to communicate through exposure to language. LLMs can perform various language tasks, such as answering questions, summarizing text, translating between languages, and writing content.
Some examples of LLMs include ChatGPT, Claude, Microsoft Copilot, Gemini, and Meta AI.
Since language models are now becoming multimodal (working with media types beyond text), they are now also called “foundation models.” This refers to models that are trained on vast amounts of data and can be adapted to a wide range of tasks and operations, not just working with language.