Plain Language Gauntlet

One dense methods paragraph, three very different audiences.

Take a jargon-dense pain-neuroimaging methods paragraph and make an AI translate it for a 10th grader, a patient advocate, and an NIH program officer — then ship the best one.

One paragraph of dense methods-speak goes in; three clear explanations come out — and you decide which one is good enough to actually send.

35 min Beginner-friendly teams of 3-5 Any chat AI Recommended core challenge

The goal

The Goal Take the dense methods paragraph below and produce a version your assigned audience would actually understand and trust — a 10th grader, a patient advocate, or a non-specialist NIH program officer. By the end you should have one explanation polished enough that you’d be comfortable pasting it into a grant aim, a study newsletter, or a recruitment flyer. We are judging the output for a real reader, not the cleverness of the method you used to get there.

Why it matters

Every translational pain scientist lives a triple life. You write Specific Aims that a program officer outside your subfield has to find compelling in 90 seconds. You sit on community advisory boards where a patient with fibromyalgia asks what your scanner actually measures. You do outreach, mentor undergrads, and explain your work at Thanksgiving. The skill of compressing “resting-state functional connectivity of the periaqueductal gray” into something a smart non-expert grasps — without misleading them — is one of the highest-leverage academic skills there is, and it is exactly the kind of task a chat AI is well suited to scaffold. Learn to drive it well here and you’ll reuse the move on every abstract, lay summary, and Plain Language Statement you write.

Run of show

0:00–0:05 · Challenge introduction (5 min)
0:05–0:20 · Work in your group (15 min)
0:20–0:22 · Post your best prompt (2 min)
0:22–0:32 · Share & debrief (10 min)
0:32–0:35 · Reset (3 min)

Bad prompt to better prompt

Weak prompt

Explain this paragraph in simple terms: [paste methods paragraph]

The AI will swap a few big words for slightly smaller ones and stop there. “Simple terms” has no audience, no length, no purpose, and no quality bar — so you get a flat, generic gloss that’s still too technical for a patient and too vague for a program officer. It also keeps the paper’s hedging (“we examined the association between…”) instead of telling the reader why any of it matters.

Strong prompt

You are explaining a pain-neuroimaging study to a patient advocate on our community advisory board — a smart, motivated adult with no science background who has lived with chronic low back pain for years.

Rewrite the methods paragraph below into about 120 words at a 9th-grade reading level. Rules: no acronyms without a plain-English gloss the first time; use one everyday analogy for “functional connectivity”; lead with what we were trying to find out and why it could matter to someone in pain; never overclaim — keep it honest about what a brain scan can and can’t show.

Then, in a separate section, list any sentence where you simplified something in a way a methods expert might object to, and rate your own confidence (1-5) that each sentence is still accurate.

Methods paragraph: [paste]

It works because it covers the four levers the weak prompt missed: a concrete audience (with a backstory that sets vocabulary and stakes), a format (length + reading level), explicit constraints (gloss acronyms, one analogy, no overclaiming), and a built-in self-check that surfaces exactly the spots a domain expert needs to verify. You get a usable draft and a map of where to fact-check it.

Prompting moves to try

Cast the reader, not just the topic. Give the AI a one-sentence persona with stakes: “a program officer who funds basic neuroscience but isn’t a pain specialist and skims 40 aims a day.” Specific readers produce specific prose.
Decompose the goal. Ask first for a glossary of the 5 hardest terms in plain English, then ask it to write the explanation using that glossary. Splitting “understand” from “rewrite” beats doing both in one shot.
Demand an analogy budget. “Use exactly one analogy for connectivity, and flag it as an analogy” stops the AI from drowning a patient in metaphors or none at all.
Adversarial self-evaluation. “Now critique your own version as a skeptical senior methods reviewer: what did you oversimplify, and where might a reader be misled? Score each risky sentence 1-5 for accuracy.” This catches the polite-but-wrong sentences.
Make it improve your prompt. Paste your prompt and ask: “Before answering, rewrite my instructions to get a clearer result for this audience, then proceed.” You’ll often learn what you forgot to specify.
Force a contrast. “Give me two versions — one cautious, one vivid — and tell me which you’d send to this audience and why.” Comparison makes the right voice obvious.

Starter materials

The dense methods paragraph (give this identical text to every group):

Methods excerpt — “Altered descending pain modulation in chronic low back pain” Forty-two patients with chronic low back pain (cLBP) and 40 age- and sex-matched pain-free controls underwent quantitative sensory testing (QST) and resting-state functional MRI on a 3T scanner. QST included pressure pain thresholds (PPT) over the lumbar paraspinals and a conditioned pain modulation (CPM) paradigm using a cold-pressor conditioning stimulus, indexed as the change in PPT from baseline. Resting-state BOLD data (TR = 2000 ms, 6 min, eyes open) were preprocessed with motion scrubbing (FD > 0.5 mm censored), nuisance regression (24 motion parameters, white matter and CSF signals, aCompCor), and band-pass filtering (0.01–0.1 Hz). We computed seed-based functional connectivity between the periaqueductal gray (PAG), rostral ventromedial medulla (RVM), and dorsolateral prefrontal cortex (dlPFC), and tested group differences using a general linear model with age and framewise displacement as covariates, thresholded at a voxelwise p < 0.001 with cluster-level FWE correction (p < 0.05). PAG–dlPFC connectivity was negatively correlated with CPM efficiency across the full sample (r = -0.34, p = 0.002).

Audience cards (assign ONE per group):

Card A · 10th grader

<p>A curious 15-year-old in a high-school biology class. Goal: they could explain back to a friend "what these scientists did and what they found." ~100 words, no jargon, one good analogy.</p>

Card B · Patient advocate

<p>An adult with chronic low back pain on your community advisory board. Goal: they trust that the study respects their experience and grasp why a brain scan is relevant to <em>their</em> pain. ~120 words, honest about limits.</p>

Card C · NIH program officer

<p>A non-specialist program officer who funds neuroscience broadly. Goal: the significance and rigor land in ~90 seconds of skimming. ~120 words, keep one quantitative result, emphasize the "so what."</p>

Quality rubric (use this to pick your one best version):

Criterion	Failing	Solid	Excellent
Audience fit	Wrong vocabulary / level	Mostly right register	Sounds written for that reader
Accuracy	Introduces an error	Technically defensible	Honest about what’s uncertain
Clarity	Jargon survives	Plain but flat	Plain and genuinely vivid
“So what”	Missing	Stated	Compelling and specific

Self-check the AI should run before you trust it:

Re-read your explanation as the target reader. List anything they still wouldn’t understand, anything that overstates what a brain scan or QST can prove, and the single weakest sentence. Then fix only those.

Debrief questions

Which audience was hardest to write for, and why? (Most rooms find the patient advocate hardest — honesty plus warmth is a real constraint.)
Where did the AI quietly overclaim? Did “correlated with” silently become “causes,” or did a scan become a “pain detector”?
What single line in your prompt produced the biggest jump in quality?
Compare two groups who had the same audience but different AIs or prompts — what explains the gap, the tool or the prompt?
Would you actually send your best version as-is? If not, what would you still need to fix by hand, and what does that tell you about where AI stops being trustworthy?

Level up

Round-trip test. Feed your plain-language version back to a fresh AI session and ask it to reconstruct the original methods. Compare to the real paragraph — what got lost or distorted is your accuracy leak.
One source, three outputs, one prompt. Write a single reusable template with an {{AUDIENCE}} variable that produces all three versions in one go, then have the AI grade each against the rubric above.
Ship it for real. Turn the program-officer version into the opening 3 sentences of a Significance section, and the patient version into a 50-word recruitment-flyer blurb — then critique both as if you were the IRB and the study section.

← Back to the Challenge menu · AI Toolkit →