Key concepts
-
1 A language model is a machine learning model that aims to predict and generate plausible language.
-
2 Large language models (LLMs) are a category of foundation models trained on immense amounts of data making them capable of understanding and generating natural language and other types of content to perform a wide range of tasks.
-
3 LLMs are designed to understand and generate text like a human, in addition to other forms of content. They have the ability to infer from context, generate coherent and contextually-relevant responses, translate to languages other than English, summarize text, answer questions, and even assist in creative writing or code-generation tasks.
Learning objectives
Now that you’ve completed this module, you should be able to:
- Define large language models (LLMs)
- Identify key LLM concepts such as transformers
- Describe some common applications of large language models