A no-code way to learn AI

Learn how AI works from a real LLM implemented entirely in Excel

Skip the mountains of prep work and learn how AI works in just a few hours with my live cohort-based course How LLMs Work: A Technical Crash Course for Busy STEM Professionals.

Here’s what past students have said:

“I could’ve read 20 papers on this and not gotten nearly the intuitive understanding as working hands on with the course materials as you explained the concepts.”

July ’24 Course Student

“I would recommend this course to anyone wanting to build a foundation in LLMs. Very interesting material that is well-presented. Additional value is the direct interaction with Ishan.”

July ’24 Course Student

This is the class I wish I had a long time ago. I’ll cover not only how LLMs work but the powerful ideas behind why they work while making it as intuitive and easy to understand as possible.

The course is available in multiple schedules so you can pick what work best for you:

  • Sept 17th & 19th (Thu/Thu)
  • Sept 23th – 27th (Mon – Fri)
  • Sept 28th & 29th (Sat/Sun)

Hope to see you there!


Details and registration at https://maven.com/spreadsheets-are-all-you-need-ai/ai-for-everyone-master-ai-with-spreadsheets

“Probably the best 10 minutes you can invest to understand LLMs”

Guillaume Decugis (Entrepreneur & VC)

“I have seen nothing which could come close in traceability and accessibility to understand transformers and LLMs”

Maximilian Hentschel (AI Principal Product Manager)

Sophisticated yet simple

Spreadsheets-are-all-you-need is a low-code introduction to the details behind today’s Large Language Models (LLMs) that’s ideal for:

  • Technical executives, marketers, and product managers
  • Developers and scientists transitioning into machine learning
  • AI policy makers and ethicists

If you can understand a spreadsheet, then you can understand AI!

Learn from a real LLM

Spreadsheets-are-all-you-need implements the forward pass of GPT2 (an ancestor of ChatGPT that was state of the art only a few years ago) entirely in Excel using standard spreadsheet functions.

This same Transformer architecture is the foundation for OpenAI’s ChatGPT, Anthropic’s Claude, Google’s Bard/Gemini, Meta’s Llama, and many other LLMs.

More lessons to come! Get notified!

Future videos will walk through more details on the internals of modern AI. Subscribe below to get notified about new tutorials and updates.

Intuit Mailchimp

“Absolutely amazing for getting a deeper understanding of large language models.”

ShiSh S. (Microsoft)

“Demonstrates step-by-step how transformers work in generative AI language models using a tool familiar to us all: Excel.”

Lucy Tancredi (SVP, Technology)

“Should be required coursework.”

Scott Arnold

“A force of nature at helping people understand LLMs…the Gutenberg Press of AI.”

Rafael Martins

“This is the AI equivalent to getting Doom running on a microwave oven.”

Brian Roemmele

“Finally understood LLM tokenization well enough to ask smart questions about it because of these videos. Great resource!”

Pranav Deshpande


Lesson 1: Demystifying GPT with Excel

In this 10-minute video we kick things off by walking through the high-level architecture of GPT-2 and witnessing each phase of the Transformer come to life in an Excel spreadsheet.

Lesson 2: Byte Pair Encoding & Tokenization

In this lesson we dive into the first phase of GPT, the tokenization phase and the Byte Pair Encoding (BPE) algorithm used in models like ChatGPT. We cover,

  • Detailed walkthrough of the BPE algorithm, including its learning phase and application in language data tokenization.
  • Spreadsheet Simulation: A hands-on demonstration of the GPT-2’s tokenization process via a spreadsheet model.
  • Limitations and Alternatives: Discussion on the challenges of BPE and a look at other tokenization methods.


Try it yourself

Downloading

The sheet is available as an xlsb (Excel binary) file in the Releases section of the github repo. You should be able to download and run this file in Excel for Mac or PC.

Using

If you’re quickly trying to orient yourself to the spreadsheet this walk through video may be helpful though it is not oriented to beginners. For beginners, it’s recommended to start with the lesson videos.

Please realize the implementation is just enough to run very small workloads:

  • Full GPT2 small (124M parameters) model including byte pair encoding, embeddings, multi-headed attention, and multi-layer perceptron stages
  • Inference/forward pass only (no training)
  • Context is limited to 10 tokens in length
  • 10 characters per word limit
  • Zero temperature output only

This sheet is very big. Unfortunately, it is not unusual for Excel to lock up (but only on a Mac) while using this spreadsheet. It is highly recommended to use the manual calculation mode in Excel and the Windows version of Excel (either on a Windows directory or via Parallels on a Mac).

Issues

Bugs are not out of the question. Please file issues on Github

Contact

@ianand on Twitter

ianand/spreadsheets-are-all-you-need on Github

Discuss on Hacker News

FAQ

What about Google Sheets?

This project actually started on Google Sheets but the full 124M model was too big and switched to Excel. I’m still exploring ways to make this work in Google Sheets but it is unlikely to fit into a single file as it can with Excel.

Why can’t I chat with it like ChatGPT? It doesn’t match the output of ChatGPT?

Aside from the minuscule context length, it also lacks the instruction tuning and reinforcement learning from human feedback (RLHF) that turn a large language model into a chatbot.

Why is it called Spreadsheets-are-all-you-need

The name is a play on the title of the famous Attention Is All You Need paper which first described the Transformer machine learning architecture that underlies ChatGPT, Claude, Bard, and many of the latest Generative AI tools.