A no-code way to learn AI
Learn how AI works from a real LLM implemented entirely in Excel
AS SEEN ON
Skip the mountains of prep work and learn how AI works in just a few hours with my live cohort-based course How LLMs Work: A Technical Crash Course for Busy STEM Professionals.
Here’s what past students have said:
“I could’ve read 20 papers on this and not gotten nearly the intuitive understanding as working hands on with the course materials as you explained the concepts.”
July ’24 Course Student
“I would recommend this course to anyone wanting to build a foundation in LLMs. Very interesting material that is well-presented. Additional value is the direct interaction with Ishan.”
July ’24 Course Student
This is the class I wish I had a long time ago. I’ll cover not only how LLMs work but the powerful ideas behind why they work while making it as intuitive and easy to understand as possible.
Hope to see you there!
Details and registration at https://maven.com/spreadsheets-are-all-you-need-ai/ai-for-everyone-master-ai-with-spreadsheets
“Probably the best 10 minutes you can invest to understand LLMs”
Guillaume Decugis (Entrepreneur & VC)
“I have seen nothing which could come close in traceability and accessibility to understand transformers and LLMs”
Maximilian Hentschel (AI Principal Product Manager)
Sophisticated yet simple
Spreadsheets-are-all-you-need is a low-code introduction to the details behind today’s Large Language Models (LLMs) that’s ideal for:
- Technical executives, marketers, and product managers
- Developers and scientists transitioning into machine learning
- AI policy makers and ethicists
If you can understand a spreadsheet, then you can understand AI!
Learn from a real LLM
Spreadsheets-are-all-you-need implements the forward pass of GPT2 (an ancestor of ChatGPT that was state of the art only a few years ago) entirely in Excel using standard spreadsheet functions.
This same Transformer architecture is the foundation for OpenAI’s ChatGPT, Anthropic’s Claude, Google’s Bard/Gemini, Meta’s Llama, and many other LLMs.
More lessons to come! Get notified!
Future videos will walk through more details on the internals of modern AI. Subscribe below to get notified about new tutorials and updates.
“Absolutely amazing for getting a deeper understanding of large language models.”
ShiSh S. (Microsoft)
“Demonstrates step-by-step how transformers work in generative AI language models using a tool familiar to us all: Excel.”
Lucy Tancredi (SVP, Technology)
“Should be required coursework.”
Scott Arnold
“A force of nature at helping people understand LLMs…the Gutenberg Press of AI.”
Rafael Martins
“This is the AI equivalent to getting Doom running on a microwave oven.”
Brian Roemmele
“Finally understood LLM tokenization well enough to ask smart questions about it because of these videos. Great resource!”
Pranav Deshpande
Lesson 1: Demystifying GPT with Excel
In this 10-minute video we kick things off by walking through the high-level architecture of GPT-2 and witnessing each phase of the Transformer come to life in an Excel spreadsheet.
Lesson 2: Byte Pair Encoding & Tokenization
In this lesson we dive into the first phase of GPT, the tokenization phase and the Byte Pair Encoding (BPE) algorithm used in models like ChatGPT. We cover,
- Detailed walkthrough of the BPE algorithm, including its learning phase and application in language data tokenization.
- Spreadsheet Simulation: A hands-on demonstration of the GPT-2’s tokenization process via a spreadsheet model.
- Limitations and Alternatives: Discussion on the challenges of BPE and a look at other tokenization methods.
Try it yourself
Downloading
The sheet is available as an xlsb (Excel binary) file in the Releases section of the github repo. You should be able to download and run this file in Excel for Mac or PC.
Using
If you’re quickly trying to orient yourself to the spreadsheet this walk through video may be helpful though it is not oriented to beginners. For beginners, it’s recommended to start with the lesson videos.
Please realize the implementation is just enough to run very small workloads:
- Full GPT2 small (124M parameters) model including byte pair encoding, embeddings, multi-headed attention, and multi-layer perceptron stages
- Inference/forward pass only (no training)
- Context is limited to 10 tokens in length
- 10 characters per word limit
- Zero temperature output only
This sheet is very big. Unfortunately, it is not unusual for Excel to lock up (but only on a Mac) while using this spreadsheet. It is highly recommended to use the manual calculation mode in Excel and the Windows version of Excel (either on a Windows directory or via Parallels on a Mac).
Issues
Bugs are not out of the question. Please file issues on Github
Contact
@ianand on Twitter
ianand/spreadsheets-are-all-you-need on Github
FAQ
What about Google Sheets?
This project actually started on Google Sheets but the full 124M model was too big and switched to Excel. I’m still exploring ways to make this work in Google Sheets but it is unlikely to fit into a single file as it can with Excel.
Why can’t I chat with it like ChatGPT? It doesn’t match the output of ChatGPT?
Aside from the minuscule context length, it also lacks the instruction tuning and reinforcement learning from human feedback (RLHF) that turn a large language model into a chatbot.
Why is it called Spreadsheets-are-all-you-need
The name is a play on the title of the famous Attention Is All You Need paper which first described the Transformer machine learning architecture that underlies ChatGPT, Claude, Bard, and many of the latest Generative AI tools.