Core Content
Part 1 — What generative AI actually is (and how it fails), in plain language
A generative AI tool (a "large language model," or LLM — like Claude, ChatGPT, or Microsoft Copilot) is, at heart, a very sophisticated prediction engine. It has read an enormous amount of text and learned the patterns of how words follow one another. When you ask it something, it predicts a fluent, plausible-sounding response — one word at a time. It is not looking anything up in a database of facts, and it is not "thinking" the way a person does.
That single idea explains all three of the failure modes every staff member must recognize:
HOW LLMs FAIL (the three you must know)
┌───────────────────┬───────────────────────────────────────────────────────────┐
│ Hallucination │ It invents facts, citations, statistics, or quotes that │
│ │ sound real but are false. It is predicting plausible text, │
│ │ not retrieving truth. │
├───────────────────┼───────────────────────────────────────────────────────────┤
│ Bias │ It reflects patterns in its training data — which can carry │
│ │ racial, gender, language, and socioeconomic bias against │
│ │ the very people you serve. │
├───────────────────┼───────────────────────────────────────────────────────────┤
│ Confident-wrong │ It states wrong answers in the same calm, polished, certain │
│ │ tone as right ones. Fluency is NOT accuracy. │
└───────────────────┴───────────────────────────────────────────────────────────┘
Non-profit example. A program coordinator asks an AI tool for "three peer-reviewed studies showing our after-school model reduces truancy." The tool returns three studies — titles, authors, journals, years — all formatted perfectly. Two of them do not exist. The tool hallucinated them, and presented them with total confidence. If that had gone into a grant report unchecked, it would have been a fabrication in a funder document.
The most dangerous failure is not the obvious gibberish — you would catch that. It is the confident, polished, plausible answer that is subtly wrong. Treat AI as a bright, fast, sometimes-mistaken intern: useful, but never the final authority.
| Do | Don't |
|---|---|
| Treat every factual claim, number, name, and citation as unverified until you check it | Assume that because it reads well, it is right |
| Use AI for first drafts, structure, summaries, brainstorming | Use AI as your source of truth for facts about your beneficiaries, finances, or compliance |
| Notice when a topic touches a community you serve and slow down for bias | Forward AI output to a funder, donor, or beneficiary without a human read |
Part 2 — The 4D fluency spine
This whole training kit — all eight modules — is anchored on the 4D AI Fluency framework from Anthropic + GivingTuesday's AI Fluency for Nonprofits (built on the AI Fluency framework by Profs. Rick Dakan and Joseph Feller). Four habits, in order, for every role:
DELEGATION ──► DESCRIPTION ──► DISCERNMENT ──► DILIGENCE
what to hand how to ask how to judge how to own
to AI vs. keep clearly, with the output for it: verify,
human context error & bias disclose, sign
└────────── Delegation–Diligence loop ───────────┘
└─ Description–Discernment loop ─┘
The two inner loops matter: you describe → discern → re-describe until the output is right, and you delegate → stay diligent so you never lose ownership of what AI helped produce.
Delegation — decide what to hand to AI, and what to keep human
The official framework breaks this into goal/task awareness, platform awareness, and task delegation. In plain terms: know your goal, know what this tool can and cannot do, and split the work wisely. In a non-profit, Delegation is first a safeguard: high-stakes decisions about real people — eligibility, benefits, crisis or safety messaging, mental-health support, case notes with sensitive data — stay human. (Module 7 covers the full do-not-use list.)
- Example: Delegate to AI — drafting a volunteer newsletter, summarizing meeting notes, turning bullet points into a tidy paragraph. Keep human — deciding which families get a limited number of housing slots.
Description — prompt clearly, with context and constraints
Officially: product, process, and performance description. Plain version: tell the tool who it's for, what you want, in what format, and what constraints apply. A vague prompt gets a vague answer; a specific prompt with context gets usable work. This is the single skill that most separates frustrated staff from productive staff. (See the before/after artifact below and Part 4's exercise.)
- Example: Not "write a thank-you letter." Instead: "Draft a warm, 150-word thank-you to a first-time $50 donor to our youth literacy program. Mention their gift funds books for one child for a year. Plain, sincere tone, no jargon, 6th-grade reading level."
Discernment — critically evaluate the output
Officially: product, process, and performance discernment. Plain version: read it like a skeptical editor. Is it accurate? Is it biased against anyone you serve? Is it actually relevant, or just fluent? This is where you catch hallucinations and confident-wrong answers before they cause harm.
- Example: AI drafts intake-form language. A discerning reader notices it assumes everyone has a permanent address and a smartphone — a bias that would exclude the unhoused clients the program exists to serve.
Diligence — verify, disclose, and take ownership
Officially: creation, transparency, and deployment diligence. Plain version: you are accountable for anything you send out, whether or not AI helped. Fact-check before release. Disclose AI assistance where your policy or a funder requires it. And follow the cardinal data rule.
The cardinal data rule: Never paste sensitive beneficiary or donor data — names, case details, health or immigration status, financial distress, contact info — into a public/consumer AI tool. Public tools may use what you type to train future models. Use only the organization's approved tools (Module 3), and de-identify first. When in doubt, leave it out.
Part 3 — The do-use / do-not-use rules (co-created, not handed down)
Every staff member must be able to state the organization's rules before doing real work with AI. The point is not a long policy document — it is a short, memorable list that people actually internalize. A starting template (your Champions and leadership will tailor it):
| ✅ Good to start with (low risk) | ⛔ Off-limits without explicit approval & human decision |
|---|---|
| Drafting/editing internal docs, emails, newsletters | Eligibility, benefit, or service-allocation decisions about beneficiaries |
| Summarizing meeting notes, articles, long reports | Crisis, safety, or mental-health communications |
| Brainstorming, outlining, reformatting | Anything using identifiable beneficiary/donor data in a public tool |
| Translating non-sensitive general content | Case notes, medical, immigration, or financial-distress records |
| Generating first-draft grant/proposal language (human verifies all facts) | Sending AI output to a funder, donor, or beneficiary unread |
"We need a 30-page AI policy before anyone can touch a tool." You don't. You need a one-page do/don't list people remember, plus the cardinal data rule. Depth comes later (Modules 2, 3, 7). Perfect-policy paralysis is itself a reason adoption stalls.
Part 4 — Free, sector-built places to learn this (so you don't build from scratch)
You can deliver the whole foundational tier on free, non-profit-specific courses. Pick one primary course and supplement.
| Resource | What it teaches | Length / cost |
|---|---|---|
| Anthropic + GivingTuesday — AI Fluency for Nonprofits (Anthropic Academy) | The 4D framework applied to non-profit tasks; Description–Discernment and Delegation–Diligence loops; privacy and data understanding | Self-paced, free, completion certificate + quiz |
| NetHope + Microsoft — Unlocking AI for Nonprofits | Four pathways (AI Basics; Applications of Generative AI; Advanced/Copilot; Responsible Use). Prompting, controlling tone/format, evaluating output for bias & hallucination | ~90-min modules, free, CPD-certified |
| Microsoft Learn — Introduction to AI Skills for Nonprofits | 3 modules: intro to Copilot + responsible AI, exploring Copilot across M365, crafting prompts to draft content | Self-paced, free, beginner |
| data.org — AI Skills for Nonprofits (with Microsoft) | 3 modules: demystifying AI, crafting prompts, ethical decision-making | Self-paced, free, digital badge |
| NTEN — AI for Nonprofits Professional Certificate & free Nonprofit Tech Readiness cohort | Role/decision-maker depth; the cohort is a ~6-month, US-based, free, sponsored program (limited seats) from concept to action plan | Certificate (paid courses) + cohort (free, competitive) |
Match the course to your stack. If you live in Microsoft 365, NetHope + Microsoft and Microsoft Learn map directly to Copilot. If you want the cleanest treatment of the 4D habits this kit is built on, use the Anthropic + GivingTuesday course. Don't make staff take all of them — one primary, then role cheat sheets.
Part 5 — The AI Champions Network (the A7 companion track)
A single trained "AI person" is a turnover risk, not a strategy. The durable model is a small network of champions — respected peers across roles — who spread adoption from demonstrated value. Grassroots evidence is strong: organizations that picked a handful of natural champions, gave them time and air cover, and ran monthly peer-sharing sessions saw voluntary adoption climb (one cited case reached ~68% in six months) as fence-sitters watched peers get real value (directional).
Why grassroots beats mandate. With AI specifically — where fear, skepticism, and real uncertainty drive resistance — a top-down mandate doesn't just fail, it backfires. Culture can't be commanded. It spreads when a colleague doing your job shows you something that saved them an hour.
THE CHAMPIONS MODEL
Recruit (invite, don't assign) 2–4 natural champions across roles
│ curious · willing to experiment · trusted
▼
Equip within guardrails approved tools · protected time · air cover
│
▼
Demonstrate value monthly "here's what I tried" peer shares
│ quick wins first, honest failures welcome
▼
Spread to fence-sitters social proof from peers; permission to fail
│
▼
Embed & preserve cheat sheets · prompt library · onboarding
│ recorded sessions survive turnover
▼
Learning loop ──────────────────────► staff issues → proposed policy/cheat updates
Selecting champions. Look for the ~10–15% who show three traits: curiosity about AI, willingness to experiment, and existing social capital (peers already trust them). Recruit by invitation and self-selection, never by assignment. Spread them across functions — a fundraiser, a program lead, an ops person — not just "the tech-comfortable" ones.
Overcoming fear and resistance. Create psychological safety: it's fine to admit you don't understand AI, fine to experiment and fail, and fine to thoughtfully not adopt a tool. Leaders should model the learning curve out loud ("here's where Copilot confused me this week"). And don't dismiss the skeptics — involve them: appoint resisters to draft the do-not-use list, set output-quality standards, or stress-test prompts. That turns resistance into ownership of the very safeguards they care about.
Frame AI honestly as removing administrative burden so staff spend more time with the people they serve — not as headcount reduction. The fastest way to kill grassroots adoption in a mission-driven team is to let people believe the goal is to replace them.
Part 6 — Low-cost formats and a realistic time-to-competency
Fluency is built through practice, not a single sitting — the 4D framework is explicit that it develops over repeated use, not overnight. For a lean, high-turnover team, sequence a stack of mostly-free formats rather than one big event:
| Format | What it does | Cadence / cost |
|---|---|---|
| 30-min live kickoff + cheat sheets | Sets tone, rules, and the cardinal data rule; signals leadership support | Once per cohort · free |
| ~90-min self-paced course | Core 4D habits + how AI fails (one of the free sector courses) | Once, pre-kickoff · free |
| Lunch-and-learns | Peer "here's what I tried" shares with role-specific scenarios; feels like sharing, not training | Bi-weekly · free |
| Sandbox practice | A safe space in an approved tool to practice prompts with non-sensitive data | Always available · free/low-cost |
| Shared prompt library | A living doc of strong prompts staff reuse and improve — the antidote to the ~4% repeatable-workflow gap | Continuous · free |
| Onboarding embed + recorded sessions | New hires get the same fluency on day one; knowledge survives churn | Per hire · free |
| Optional 6-month cohort | Deeper readiness/leadership (e.g., NTEN Nonprofit Tech Readiness, ~8–10 hrs/month) | Quarterly intake · free (competitive) |
A realistic time-to-competency. Set expectations honestly so people don't quit after one frustrating session:
Day 0 ~90 min self-paced course + 30-min kickoff
Week 1–2 First real-but-low-risk task (draft, summary, reformatting) with the cheat sheet at hand
Week 3–6 Two to three workflows feel natural; staff start contributing to the prompt library
Month 2–3 "Aware" → early "Practicing": at least one documented, repeatable workflow per team
"We did the 90-minute course, so we're trained." A course produces awareness, not competency. Competency shows up weeks later as documented, reused workflows — which is exactly what the Outcome Scorecard measures (L3), not course completion alone (L2).