With the rate at which AI develops, it’s easy to forget where it came from. Most obviously, generative AI has gone from comically bad to professionally viable, with video experiencing the largest quality jump in the last year. However, Large Language Models (LLMs) remain the most impactful across industries – and it’s why each release is so eagerly anticipated.
In this article, I explore the evolution of LLMs and what the next big innovation is set to be.
Our AI journey to date
In 2017, the breakthrough with Transformers in AI changed everything for LLMs. This introduced a model using self-attention mechanisms, enabling more efficient, scalable, and versatile processing of data – all the ingredients that make today’s AI models so good.
In 2019, OpenAI's GPT-2 introduced preschool-level intelligence, with the model able to perform basic counting and write simple stories, including one about a four-horned unicorn. The release of GPT-3 elevated the model’s abilities to that of an elementary student, with the model able to generate simple text that could be applied in real-world use cases, such as for meta and product descriptions.
GPT-4 made a significant leap, comparable to a high schooler in writing, math, and coding. From my point of view as a copywriter, its ability to write coherent – and sometimes decent – copy was a clear indication of the trajectory we’re moving towards. Since then, we’ve seen a wave of LLMs introduced and there’s no sign that development is slowing.