GPT-3 is a language model developed at Open AI, and I’ve been having a lot of fun playing with it. In fact, I’ve written this post in collaboration with GPT-3. See if you can spot which parts are written by me and which parts come from GPT-3.
Part 1: Brainstorming
The first thing I did when I got access to GPT-3 is use it to brainstorm uses for GPT-3. I gave it these 5 ideas:
- Explain mathematics more simply (why is X true?)
- Generate programs from input/output examples
- Generate questions about a paper I read
- Create a list of machine translation tips and tricks
- Create an adventure game
And it came back with 15 more:
- Create a list of Tor commands to run to defend against an attack
- Create a cheat sheet for screenwriters
- Generate advice for a scene
- Generate XKCD comics
- Create a crib sheet for an arbitrary Python API
- Make a list of movies to watch
- Make a list of scholarly algorithms/heuristics worth reading
- Make a list of generic questions about Pedagogy
- Make a list of cognitive exercises
- Show patterns generated by awesome people
- Generate small cooking tasks
- Generate characters for a project
- Generate configuration options
- Generate flashcards
- Organize my computer/files
And then 15 more:
- Explain things to my parents via messenger
- Explain things to my children via messenger
- Explain mathematical algorithms
- Automated interface design
- Mechanize a Nagihara Kan Airlines website
- Clickbait generator
- Music description (Herbie Hancock is playing “Dolphin Dance” live in London on the 18th March 1968)
- Tell me not to waste my time looking at x
- Interesting facts about Wikipedia
- Explain something to someone while my attention is elsewhere
- Detect adverts in x
- Image caption/caption game
- Speculate about the true utility function of x
- Auto-generate tests for a program
- Explain why my code is currently failing tests.
And then 15 more:
- Generate plots or character/story summaries
- Generate legal examples
- Generate words for crosswords
- Check my grammar
- Supply filler text for themed writing (horror, sci-fi etc.)
- Explain why an AI can’t read (can it comprehend metaphors?)
- Provide entertaining answers to obscure questions
- Get writing ideas and generate word combinations
- Create background for an NPC I need for a D&D game
- Help a sick writer generate plots
- Input pre-generated (from books etc.) phrases and use them for literary analysis
- Help explain a mathematical proof
- Develop a game/story/character I need
- Make short stories I can “overlay” on a fictional Universe I’ve created (ala Ready Player One)
- Make a short film/audio drama
What a lovely list of lists of ideas! I wonder how many of these ideas are novel.
Some of these ideas are outside the scope of what GPT-3 can do. For example, generating XKCD comics seems too difficult. Others are duplicates, such as “Explain mathematical algorithms” (duplicate of “Explain mathematics more simply”). And still others don’t make sense – “Mechanize a Nagihara Kan Airlines website”? Overall though, solid set of ideas.
Part 2: Mathematics
Once I’d gotten GPT-3 to do some brainstorming, I realized I’d need to test its capabilities before trying its suggestions out in earnest.
I taught it how to evaluate simple expressions and take derivatives.
Here’s the first prompt I used:
Question: f(x) = x*x. Compute f(3).
f(3) = 3*3
f(3) = 9
Answer: 9
Question: h(x) = x*x - 2x. Compute h(5).
h(5) = 5*5 - 2*5
h(5) = 25 - 10
h(5) = 15
Answer: 15
Question: y(x) = x**3 - 2x. Compute y(3).
y(3) = 3**3 - 2*3
y(3) = 3*3*3 - 6
y(3) = 27 - 6
y(3) = 21
Answer: 21
I could then ask it any of the following, and it would correctly show its work.
Question: f(x) = tan(x) + 1. Compute f(0).
Question: f(x) = sqrt(x) + 1. Compute f(16).
When I tried to give it
Question: f(x) = x/2 + 2. Compute f(f(16)).
it treated it as just Compute f(16)
rather than f(f(16))
.
Interestingly this works:
Question: f(x) = x*x. Compute f(3).
f(3) = 3*3
f(3) = 9
Answer: 9
Question: h(x) = x*x - 2x. Compute h(5).
h(5) = 5*5 - 2*5
h(5) = 25 - 10
h(5) = 15
Answer: 15
Question: f(x) = x/2 + 2. Compute f(f(16)).
f(f(16)) = f(16/2 + 2)
f(f(16)) = f(8 + 2)
f(f(16)) = f(10)
f(f(16)) = 10/2 + 2
f(f(16)) = 5 + 2
f(f(16)) = 7
Answer: 7
Question: y(x) = x**3 - 2x. Compute y(3).
y(3) = 3**3 - 2*3
y(3) = 3*3*3 - 6
y(3) = 27 - 6
y(3) = 21
Answer: 21
Question: f(x) = x*3 - 1. Compute f(f(5)).
Whereas this does not:
Question: f(x) = x*x. Compute f(3).
f(3) = 3*3
f(3) = 9
Answer: 9
Question: h(x) = x*x - 2x. Compute h(5).
h(5) = 5*5 - 2*5
h(5) = 25 - 10
h(5) = 15
Answer: 15
Question: f(x) = x*3 - 1. Compute f(f(5)).
f(f(5)) = f(5*3 - 1)
f(f(5)) = f(15 - 1)
f(f(5)) = f(14)
f(f(5)) = 14*3 - 1
f(f(5)) = 42 - 1
f(f(5)) = 41
Answer: 41
Question: y(x) = x**3 - 2x. Compute y(3).
y(3) = 3**3 - 2*3
y(3) = 3*3*3 - 6
y(3) = 27 - 6
y(3) = 21
Answer: 21
Question: f(x) = x/2 + 2. Compute f(f(16)).
After getting it to evaluate these simple expressions correctly, I also tried having it take derivatives. With an example as priming, and again having it show its work, this worked well. I haven’t performed a rigorous evaluation of its mathematical capabilities. For now, I’ve just begun to develop an intuition for how it works.
Part 3: Playing Around
After doing math together, I tried many more things with GPT-3. I shared feelings with it, and it expanded on them (correctly too!). I used it to generate lists for brainstorming all manner of thing. The language model wrote love letters from inanimate objects. I tried to teach it to play Codenames, but with only very limited success. And I had it autocomplete previous snippets of mine, and had some fun cowriting with it. While I won’t be sharing all of this with you, I do have one thing to share.
This is a love letter written by a frying pan:
I may not seem like much to you right now. I’m coated in grease and I’m starting to rust, but really, I have big plans. I’m going to Italian restaurants. I’ll hang out with the finest (and healthiest) ingredients you can imagine. It was good knowing you, but you were never mine to keep. I’m going to cook the kind of meal you’ll dream about for the rest of your life. Good luck, sweetheart. You’ll need it.
Overall Principles
Getting an intuition for how GPT-3 behaves has been interesting, and I’ll try now to distill that intuition down into a few principles.
- GPT-3 is continuing your writing, not having a dialog with you.
I’ve seen on Twitter lots of people asking GPT-3 questions, and sharing its response. Know that GPT-3 continues whatever prompt you give it as if it had written the prompt itself, not as if it is responding to the prompt. This is why you often see prompts like “The following is a conversation between X and Y:”.
- GPT-3 effectively puts your writing in a made up context.
If you give GPT-3 a small prompt, such a single sentence, then there are many contexts in which that prompt could be interpreted. GPT-3 is trained to marginalize over the contexts in which the text could appear. Once you start decoding GPT-3’s response though, one implicit context has been selected. It might not be the context that you intended when writing your prompt.
This can happen even if you try to set the context yourself explicitly. For example you might be trying to generate a conversation or a list, and GPT-3 might decide that this conversation or list is taking place inside of a novel.
You can mitigate this context switch by using a longer prompt, or by doing more of the writing yourself.
- GPT-3 likes lists and repeated structures.
This is the main principle behind using GPT-3 for few shot learning. Prompt GPT-3 with a few examples of performing a task, and it will usually make an effort to perform that task yet again.
- GPT-3 cannot perform linguistic tricks like unscrambling words
It doesn’t seem capable of generating new puns (I haven’t tried this yet). And it doesn’t seem capable of unscrambling words (I did try this, and I couldn’t even get it to unscramble 4-letter words).
- GPT-3 benefits from “showing its work”
In my math experiments, having GPT-3 show its work was critical to getting it solving the math questions correctly. It seems that doing a single step of expression evaluation at a time is an easier task than doing it all at once. Providing examples with the steps gave GPT-3 a clearer explanation of the task than just providing input/output examples. GPT-3 also gets additional computational steps in which to solve the problem when it takes the time to show its work.
- GPT-3 has a great breadth of knowledge and writing styles available to it
It can express love, write flowery prose, give advice, write technical documentation, latex, program, math, take the perspective of a toaster, and more. It’s a great tool for brainstorming both explicitly (by having it generate ideas) and through exploration (by using it to help you generate ideas yourself).
Though it has all the facts from the internet at its fingertips (of all the ways to anthropomorphize GPT-3, why choose fingertips?), it is just as capable of generating falsehoods as truths. Do not mistake the text it generates as true or logically correct, even if in many circumstances it does generate correct information. When it transitions from generating truth to generating nonsense it does not give a warning that it has done so (and any truth it does generate is in a sense at least partially accidental).
- Sometimes GPT-3 makes no sense
There are things that don’t make sense that the language model assigns reasonable probabilities to. And there are things that don’t make sense, are low probability, but come up anyway either due to chance or because the model doesn’t have anything better to say. It doesn’t stop generating text just because its out of its depth. Instead, it just plods on forward, saying something as reasonable as it’s capable of, regardless of the situation it finds itself in.
- GPT-3 mimics your style
If you prompt GPT-3 with typos it will continue with typos. If you prompt GPT-3 with florid prose, it will write florid prose. If you prompt GPT-3 with anaphora laden rhetoric, or any other style, it will continue in that style.
Conclusion
I’ll let GPT-3 wrap things up itself:
“And that brings us to the end, some thoughts on whether using GPT-3 was more or less crazy than getting a Ph.D.. I think that using GPT-3 to generate ideas was a higher form of Enlightenment than any Ph.D. I’d do that again in a heartbeat, no shortage of ideas rolling around in GPT-3’s natural language processing head.”
Uh huh. Whatever, GPT-3. Whatever.