“…at their core, LLMs are nothing more than fancy probability engines.
Probability is a tool for quantifying randomness and uncertainty. Having a probabilistic nature is the source of an LLM’s power… and its unpredictability. It’s what makes it possible to generate novel, creative, actionable responses, and also makes LLMs very difficult to train or debug.
Whether you’re a layperson, practitioner, or researcher, the entire process of working with LLMs is an exercise in forming and manipulating their latent probability distributions into giving you the outputs you want.”
The blog gives a quick refresher on probability distributions, something that most of us would have studied in high school math, before going on to explain how LLM’s use probability of the occurrence of a word to generate text which is likely to make sense and give us the outputs we seek. Given this context, unsurprisingly, the most common application of LLM so far has been Chatbots. He gives an intuitive sense of how LLMs drive chat:
“Chat interfaces have become the dominant way for most people to interact with LLMs, capturing the public imagination and showcasing these models’ capabilities. But how do we get from generating individual words to engaging in full-fledged conversations? The answer lies in cleverly applying the principles we’ve discussed so far.
Here’s how it works:
B. The model generates a response based on this context, just as we described earlier.
C. For your next message, the model doesn’t just look at what you’ve just said. Instead, it considers everything that’s been said so far – your initial message, its first response, and your new message.
D. This process repeats for each turn of the conversation. The context grows longer, incorporating each new message and response.
However, this method also introduces some challenges:
Whether you are a user of AI or an analyst trying to assess the disruptive potential of this technology, the blog is a compelling and easy read to understand what’s tipped to be the next big thing, as seen with the tens of billions of dollars already being invested by the world’s biggest technology companies.
If you want to read our other published material, please visit https://marcellus.in/blog/
Note: The above material is neither investment research, nor financial advice. Marcellus does not seek payment for or business from this publication in any shape or form. The information provided is intended for educational purposes only. Marcellus Investment Managers is regulated by the Securities and Exchange Board of India (SEBI) and is also an FME (Non-Retail) with the International Financial Services Centres Authority (IFSCA) as a provider of Portfolio Management Services. Additionally, Marcellus is also registered with US Securities and Exchange Commission (“US SEC”) as an Investment Advisor.