Scientists Say: Large language model

These programs use machine learning to power reading and writing AI technology

Digital generated image of abstract smartphone with pop up message chat icons against blue background. Artificial intelligence chatbot communication concept.

Large language models are behind some of the most powerful reading and writing AI systems today.

Andriy Onufriyenko/Moment/Getty Images

Large Language Model (noun, “LARJ LANG-wuhj MAH-del”)

A large language model — or LLM — is an artificial intelligence computer program that can read text and write responses. LLMs power many modern artificial intelligence, or AI technologies, such as ChatGPT.

LLMs generate human-sounding text using computer programs that “learn.” These algorithms learn by analyzing huge amounts of data. The data might be the text on webpages, online books or social media posts. Scientists “train” algorithms by providing them these data.

An LLM first breaks the training data into smaller pieces, such as words or parts of words. These are called tokens. The system then divides the tokens into categories. It also maps out relationships and patterns between them. Computers don’t use words to solve problems. They use math. So, an LLM must assign numbers to text tokens. That allows the LLM to calculate responses based on probabilities.

Once trained, an LLM can read a new text prompt — such as a question — and break it into tokens. Then, the LLM puts together a response one token at a time. It calculates which tokens are most likely to come next based on patterns learned in training.

Humans are very good at detecting these patterns. For example, imagine a friend telling you to “turn right.” The word “right” has several meanings. “Right” could mean accurate. It could also mean morally good. But these alternate definitions wouldn’t confuse you. In this context, coming after the word “turn,” the word “right” clearly refers to a direction. You’ve learned that from experience.

LLMs can also use their experience from training to figure out meanings. LLMs do this with another program called a transformer. A transformer focuses on a bit of text, such as a sentence. It considers multiple meanings that tokens in sentences might have. Then, it decides which meaning is most likely intended.

LLMs are powerful AI tools. However, they also come with problems. The language data used to train LLMs sometimes bakes biases into these systems. They also tend to oversimplify some topics. And they have limited ability to verify their information. That’s because they do not understand the concept of truth or accuracy as a person does.

Despite how humanlike LLMs may sound, it’s crucial to remember their limits. LLMs do not truly comprehend the text they generate. While they mimic some patterns of human reasoning, they do not have feelings and cannot understand emotions like a person.

In a sentence

Large language models (LLMs) can power AI job-screening technology that comes with cooked-in biases, favoring applicant names that sound male.

Check out the full list of Scientists Say.

Katie Grace Carpenter is a science writer and curriculum developer, with degrees in biology and biogeochemistry. She also writes science fiction and creates science videos. Katie lives in the U.S. but also spends time in Sweden with her husband, who’s a chef.

Use up and down arrow keys to explore.Use right arrow key to move into the list.Use left arrow key to move back to the parent list.Use tab key to enter the current list item.Use escape to exit the menu.Use the Shift key with the Tab key to tab back to the search input.