The Math You Need To Start Understanding LLMs

1 / 15

The Math You Need To Start Understanding LLMs

Hackaday·Maya Posch·28 days ago

#9ct20blb

#comments #comment #largelanguagemodels #respond #inference #auto

Reading 0:00

15s threshold

Skip to content Once you peel back the hype and mysticism, large language models (LLMs) are a fascinating application of statistical models, effectively what you get when you dial a basic auto-complete model up to eleven. In order to analyze a mind-boggling amount of text and produce meaningful auto-completion results quite a bit of math is involved, with a recent three-part article series by [Giles] going through the basics of inference , being the prediction step using a trained model. The text is encoded in the LLM’s vector space as token IDs, each token being a text fragment that has some probability of following another ID, such as when cats may be found on desks, as in the above photo by [Giles]. With inference multiple of such IDs are retrieved in a vector from which in successive steps a sentence can be pieced together.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

The Math You Need To Start Understanding LLMs