Lara J. Martin

Here is a non-exhaustive list of various resources you might want if you're interested in automated story generation, interactive fiction (IF), or related research areas (such as story understanding and tabletop roleplaying games—TRPGs).

I also made a list of related researchers, and I try to keep up a list of upcoming conference and workshop deadlines. If you want me to add or update anything on any of these lists, please let me know! You can unscramble my email address here:

Note: This is not a list of papers in the field, but rather a list of corpora & code and their corresponding papers if they have it.
If you're looking for paper lists, you might be interested in @arnicas's list of text generation papers found on arXiv, Stephen Ware's Narrative Intelligence Lab reading list, or the Tsinghua Natural Language Processing Group's text generation list.
You might also be interested in Peter Jansen, Marc-Alexandre Côté, and Prithviraj Ammanabrolu's Interactive Virtual Environments list.

Story Datasets

Dataset	Year	Papers	Paper Code (Baselines)	Hugging Face Link
AmbiStory – lexically-ambiguous short stories	2025	AmbiStory: A Challenging Dataset of Lexically Ambiguous Short Stories
StoryPerceptions – crowd annotations of narratives in social media posts	2024	The Empirical Variability of Narrative Perceptions of Social Media Texts
Anthology	2024	Virtual Personas for Language Models via an Anthology of Backstories
Quest-GPT-2 – quests and descriptions from 6 RPGs	2024	Generating Role-Playing Game Quests With GPT Language Models
StorySeeker – finding stories in online communities	2024	Where Do People Tell Stories Online? Story Detection Across Online Communities
EmoBench – English and Chinese stories and LLM's judgments of emotional intelligence	2024	EmoBench: Evaluating the Emotional Intelligence of Large Language Models	https://github.com/Sahandfer/EmoBench
STORYSUMM – faithful abstractive summarization of stories	2024	STORYSUMM: Evaluating Faithfulness in Story Summarization	https://github.com/melaniesubbiah/storysumm
Choice-75 – branching scripts	2024	Choice-75: A Dataset on Decision Branching in Script Learning	https://github.com/JoeyHou/branching
HEART-felt Narratives (Human Empathy and Narrative Taxonomy) – narrative styles that lead to empathy	2024	HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs
DnD Spells – structured information for Dungeons and Dragons spells	2024	Leveraging Large Language Models for Spell-Generation in Dungeons & Dragons	https://github.com/m-elio/spell_generation
STORiCo TTS – story TTS in Hindi	2024	STORiCo: Storytelling TTS for Hindi with Character Voice Modulation		https://huggingface.co/datasets/Pavankalyan/Hindi_story_telling
SAGA (Story Alternatives and Goal Applicability) – annotated goals in stories from the perspective of the participant	2024	SAGA: A Participant-specific Examination of Story Alternatives and Goal Applicability for a Deeper Understanding of Complex Events
StoryNory TTS – expressive text-to-speech for storytelling	2023	Narrator or Character: Voice Modulation in an Expressive Multi-speaker TTS	https://github.com/tpavankalyan/Storynory	https://huggingface.co/datasets/Pavankalyan/StoryNoryTTS
CONCOCT (CONCrete Outline ConTrol) – long-form story generation	2023	Improving Pacing in Long-Form Story Planning		https://huggingface.co/datasets/ZachW/GPT-BookSum
PASTA – participant states in stories	2023	PASTA: A Dataset for Modeling PArticipant STAtes in Narratives	https://github.com/StonyBrookNLP/pasta
NarrativeXL – long stories	2023	NarrativeXL: a Large-scale Dataset for Long-Term Memory Models	https://github.com/r-seny/NarrativeXL
r\AmITheAsshole stories	2023	Author as Character and Narrator: Deconstructing Personal Narratives from the r/AmITheAsshole Reddit Community
FIREBALL – Avrae Discord bot commands + natural language	2023	FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information		https://huggingface.co/datasets/lara-martin/FIREBALL
NEAT (Narrative Elements AnnoTation) – text annotated with narrative elements	2022	Detecting Narrative Elements in Informational Text
POQue (Participant Outcome Questions) – participant outcomes from events in stories	2022	POQue: Asking Participant-specific Outcome Questions for a Deeper Understanding of Complex Events
NarraSum – data for narrative summarization	2022	NarraSum: an Abstractive Narrative Summarization Dataset
FairytaleQA	2022	Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension		https://huggingface.co/datasets/WorkInTheDark/FairytaleQA
SYMON (SYnopses of MOvie Narratives) – movie synopses from video summaries	2022	Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding
TVShowGuess	2022	TVShowGuess: Character Comprehension in Stories as Speaker Guessing	https://github.com/YisiSang/TVSHOWGUESS
TV Tropes – movie scripts annotated with tropes from TVTropes	2022	Computational Support for Trope Analysis of Textual Narratives	https://github.com/mandarsc/TropeAnalysis
Possible Stories – questions about stories	2022	Possible Stories: Evaluating Situated Commonsense Reasoning under Multiple Possible Scenarios	https://github.com/nii-cl/possible-stories
CoAuthor – collaborative writing dataset	2022	CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities
SummScreen	2022	SummScreen: A Dataset for Abstractive Screenplay Summarization	https://github.com/mingdachen/SummScreen
LiSCU (Literature Summary and Character Understanding) – character descriptions, summaries, and names	2021	“Let Your Characters Tell Their Story”: A Dataset for Character-Centric Narrative Understanding	https://github.com/fabrahman/char-centric-story
InSentive – authored stories from BookCorpus	2021	Inspiration through Observation: Demonstrating the Influence of Automatically Generated Text on Creative Writing	https://github.com/roemmele/InSentive
ESTER (Event Semantic Relation Reasoning) – tagged events from news articles from the TempEval3(TE3) workshop	2021	ESTER: A Machine Reading Comprehension Dataset for Event Semantic Relation Reasoning	https://github.com/PlusLabNLP/ESTER
TellMeWhy – Q&A for stories	2021	TellMeWhy: A Dataset for Answering Why-Questions in Narratives		https://huggingface.co/datasets/StonyBrookNLP/tellmewhy
TVRecap – TV shows from Fandom and TVMegaSite (soap operas)	2021	TVRecap: A Dataset for Generating Stories with Character Descriptions
Scifi TV Plots – science fiction episode summaries from Fandom	2020	Story Realization: Expanding Plot Events into Sentences	https://github.com/rajammanabrolu/StoryRealization	https://huggingface.co/datasets/lara-martin/Scifi_TV_Shows
STORIUM – storium.com (gamified storytelling)	2020	STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story Generation	https://github.com/dojoteef/storium-gpt2
ScriptWriter – from GraphMovie, which no longer exists (descriptions of movie plots)	2020	ScriptWriter: Narrative-Guided Script Generation	https://github.com/DaoD/ScriptWriter
PerSenT – author sentiment prediction (news articles)	2020	Author's Sentiment Prediction		https://huggingface.co/datasets/community-datasets/per_sent
WikiHow Goal-Step	2020	Reasoning about Goals, Steps, and Temporal Ordering with WikiHow	https://github.com/zharry29/wikihow-goal-step
Lit Bank – annotated Project Gutenberg	2019	An Annotated Dataset of Literary Entities and Literary Event Detection		https://huggingface.co/datasets/coref-data/litbank_raw
TimeTravel – stories and counterfactual continuations	2019	Counterfactual Story Reasoning and Generation	https://github.com/qkaren/Counterfactual-StoryRW	https://huggingface.co/datasets/wza/TimeTravel
HPAC (Harry Potter's Action prediction Corpus)	2019	Harry Potter and the Action Prediction Challenge from Natural Language	https://github.com/aghie/hpac
SQuAD 2.0 (Stanford Question Answering Dataset) – reading comprehension	2018	SQuAD: 100,000+ Questions for Machine Comprehension of Text and Know What You Don't Know: Unanswerable Questions for SQuAD		https://huggingface.co/datasets/rajpurkar/squad_v2
WritingPrompts – r/WritingPrompts	2018	Hierarchical Neural Story Generation	https://github.com/facebookresearch/fairseq/tree/main/examples/stories	https://huggingface.co/datasets/rewardsignal/reddit_writing_prompts
Naive Psychology of Characters in Simple Commonsense Stories – "cause and effect of mental state changes of characters in a story"	2018	Modeling Naive Psychology of Characters in Simple Commonsense Stories
NarrativeQA – movie scripts from various sources and Project Gutenberg books	2018	The NarrativeQA Reading Comprehension Challenge	https://github.com/google-deepmind/narrativeqa	https://huggingface.co/datasets/deepmind/narrativeqa
Deep Dungeons and Dragons (DDD) Corpus – roleplayerguild.com	2018	Deep Dungeons and Dragons: Learning Character-Action Interactions from Role-Playing Game Transcripts
EmotionLines – dialog from the Friends TV show & EmotionPush private chat logs	2018	EmotionLines: An Emotion Corpus of Multi-Party Conversations
RACE (ReAding Comprehension dataset from Examinations)	2017	RACE: Large-scale ReAding Comprehension Dataset From Examinations	https://github.com/qizhex/RACE_AR_baselines	https://huggingface.co/datasets/ehovy/race
CNN/Daily Mail	2016	Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond		https://huggingface.co/datasets/abisee/cnn_dailymail
FanFiction Archive – fanfiction.net	2016	Beyond Canonical Texts: A Computational Analysis of Fanfiction
ROCStories (ROChester stories) – 5-sentence crowdsourced stories for Story Cloze Test	2016	A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories and LSDSem 2017 Shared Task: The Story Cloze Test
CaTeRS (Causal and Temporal Relation Scheme) – Causal and temporal relations using ROC Stories	2016	CaTeRS: Causal and Temporal Relation Scheme for Semantic Annotation of Event Structures
Character Relations	2015	Annotating Character Relations in Literary Texts
bAbI & The Children’s Book Test (CBT) – kids' books from Project Gutenberg	2015	The Goldilocks Principle: Reading Children’s Books with Explicit Memory Representations and Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks	https://github.com/facebookarchive/bAbI-tasks	https://huggingface.co/datasets/facebook/babi_qa
MCTest – 150-300 word stories written by crowdworkers	2013	MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text		https://huggingface.co/datasets/sagnikrayc/mctest
CMU Movie Summary Corpus – Wikipedia movie summaries	2013	Learning Latent Personas of Film Characters
Cornell Movie Dialog – movie scripts and metadata	2011	Chameleons in Imagined Conversations: A New Approach to Understanding Coordination of Linguistic Style in Dialogs	https://convokit.cornell.edu/documentation/movie.html	https://huggingface.co/datasets/cornell_movie_dialog
Various corpora from UCSC's Natural Language and Dialogue Systems (NLDS) lab

Mixed Visual & Textual Datasets and Methods

Dataset	Year	Papers	Paper Code	Hugging Face Link
VP-CSV (Visual Planning-based Character-centric Story Visualization)	2022	Character-centric Story Visualization via Visual Planning and Token Alignment	https://github.com/PlusLabNLP/VP-CSV
VLC Story GAN	2021	Integrating Visuospatial, Linguistic, and Commonsense Structure into Story Visualization	https://github.com/adymaharana/VLCStoryGan
DramaQA – Video Story Understanding on Korean TV Show "Another Miss Oh"	2021	DramaQA: Character-Centered Video Story Understanding with Hierarchical QA	https://github.com/liveseongho/DramaQA
KG-Story	2020	Knowledge-Enriched Visual Storytelling
CP-CSV (Character-Preserving Coherent Story Visualization) – character-based story visualization	2020	Character-Preserving Coherent Story Visualization
StoryGAN – story visualization	2019	StoryGAN: A Sequential Conditional GAN for Story Visualization
COIN (COmprehensive INstructional video analysis)	2019	COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis	https://github.com/coin-dataset
Pororo-SV – StoryGAN CLEVR dataset	2019	StoryGAN: A Sequential Conditional GAN for Story Visualization		https://huggingface.co/datasets/dhruvrnaik/pororo_storyviz
WikiHow Summarization	2018	WikiHow: A Large Scale Text Summarization Dataset	https://github.com/mahnazkoupaee/WikiHow-Dataset	https://huggingface.co/datasets/wikihow
MovieGraphs – knowledge graphs, images, and descriptions	2018	MovieGraphs: Towards Understanding Human-Centric Situations from Videos
VIST – Visual storytelling data + task	2016	Visual Storytelling
MovieQA	2016	MovieQA: Understanding Stories in Movies through Question-answering	https://github.com/makarandtapaswi/MovieQA_CVPR2016/
BookCorpus	2015	Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books and Skip-thought vectors	https://github.com/ryankiros/skip-thoughts	https://huggingface.co/datasets/bookcorpus

Story Evaluation, Understanding, Cloze Tests, & Benchmarks

Test	Year	Papers
WhoDunIt	2025	WhoDunIt: Evaluation benchmark for culprit detection in mystery stories
MiniCheck – looks up facts across datasets	2024	MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
FABLES (Faithfulness Annotations for Book-LEngth Summarization)	2024	FABLES: Evaluating faithfulness and content selection in book-length summarization
HelloBench – long text generation benchmark	2024	HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Reflections & Resonance – multiple agents for automatic story annotation	2024	Reflections & Resonance: Two-Agent Partnership for Advancing LLM-based Story Annotation
Do Language Models Enjoy Their Own Stories? – meta analysis of using LLMs for story evaluation	2024	Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
AlignScore – factual consistency	2023	AlignScore: Evaluating Factual Consistency with A Unified Alignment Function
Authorial Leverage – (quality x variations x control) / authorial effort	2023	Towards an Authorial Leverage Evaluation Framework for Expressive Benefits of Deep Generative Models in Story Writing
DeltaScore – fine-grained story evaluation	2023	DeltaScore: Fine-Grained Story Evaluation with Perturbations
CoRRPUS (Code Representations to Reason & Prompt over for Understanding in Stories) – story understanding with code-based prompting	2023	CoRRPUS: Code-based Structured Prompting for Neurosymbolic Story Understanding
UniEval	2022	Towards a Unified Multi-Dimensional Evaluator for Text Generation
StoryER – breaking story evaluation into 3 steps: ranking, rating, reasoning	2022	StoryER: Automatic Story Evaluation via Ranking, Rating and Reasoning
Figurine – Cloze test for idioms and similies	2022	It's not Rocket Science: Interpreting Figurative Language in Narratives
CIS^2 (Contextual Commonsense Inference In Sentence Selection) – commonsense reasoning in stories as a classification task	2022	CIS^2: A Simplified Commonsense Inference Evaluation for Story Prose
Plot-guided Coherence Evaluation	2021	Plot-guided Adversarial Example Construction for Evaluating Open-domain Story Generation
CARP (Contrastive Authoring and Reviewing Pairing)	2021	Cut the CARP: Fishing for zero-shot story evaluation
Choose Your Own Adventure Evaluation – pairwise model human evaluation	2021	Choose Your Own Adventure: Paired Suggestions in Collaborative Writing for Evaluating Story Generation Models
SuQA (SUmmarizer-augmented Question Answering)	2021	Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension
RoFT (Real or Fake Text)	2020	RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text
Do Massively Pretrained Language Models Make Better Storytellers?	2019	Do Massively Pretrained Language Models Make Better Storytellers?
Sentence Mover's Similarity	2019	Sentence Mover’s Similarity: Automatic Evaluation for Multi-Sentence Texts
BookTest – Cloze Test using Project Gutenberg	2017	Embracing Data Abundance
Story Cloze Test	2016	A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories and LSDSem 2017 Shared Task: The Story Cloze Test
Who-did-What – Cloze Test using LDC English Gigaword newswire corpus	2016	Who did What: A Large-Scale Person-Centered Cloze Dataset
Narrative Cloze Test	2008	Unsupervised Learning of Narrative Event Chains

Data Scrapers & Processors

Dataset	Info
ScreenPy	parsing and annotating screenplays
TropeScraper	films and corresponding tropes from TV Tropes
Novel Chapter Summaries	full book chapters and their summaries
Archive of Our Own Scraper	scraper for Archive of Our Own fanfiction
Fanfiction Scraper	scraper for fanfiction.net
BookNLP	process your own book data
Newspaper3k	newspaper scraper Python library
Homemade BookCorpus	recreation of BookCorpus

Interactive Fiction Environments

Environment	Year	Papers
TALES	2025	TALES: Text Adventure Learning Environment Suite
Word2World	2024	Word2World: Generating Stories and Worlds through Large Language Models
JerichoWorld – Text-to-KG dataset	2021	Modeling Worlds in Text
Jericho – RL simulator	2020	Interactive Fiction Games: A Colossal Adventure
WorldGeneration	2020	Bringing Stories Alive: Generating Interactive Fiction Worlds
NetHack	2020	The NetHack Learning Environment
Camelot – visualizing IF with 3D models	2020	Camelot: a modular customizable sandbox for visualizing interactive narratives
LIGHT (Learning in Interactive Games with Humans and Text)	2019	Learning to Speak and Act in a Fantasy Text Adventure Game
TextWorld	2018	TextWorld: A Learning Environment for Text-based Games

Interactive Fiction Agents

Agent	Year	Papers
ChatRPG	2025	Static Vs. Agentic Game Master AI for Facilitating Solo Role-Playing Experiences
Generative Agents	2023	Generative Agents: Interactive Simulacra of Human Behavior
Playing text-based adventure games with SAC	2023	Learning to Play Text-Based Adventure Games with Maximum Entropy Reinforcement Learning
AFK (Asking for Knowledge)	2022	Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language
MPRC-DQN (Multi-Paragraph Reading Comprehension DQN)	2020	Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning
KG-A2C	2020	Graph Constrained Reinforcement Learning for Natural Language Action Spaces
Q*BERT	2020	How to Avoid Being Eaten by a Grue: Structured Exploration Strategies for Textual Worlds
KG-DQN	2019	Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning
NAIL (Navigate Acquire Interact Learn)	2019	NAIL: A General Interactive Fiction Agent

Story Planning Systems

Planner	Year	Papers
PDDLEGO - itarative plan construction with PDDL and LLMs	2024	PDDLEGO: Iterative Planning in Textual Environments
Sabre - next-gen Glaive	2021	Sabre: A Narrative Planner Supporting Intention and Deep Theory of Mind
Winnow - "declarative domain-specific query language for story sifting"	2021	Winnow: A Domain-Specific Language for Incremental Story Sifting
Belief and Intentional PDDL	2020	Using Domain Compilation to Add Belief to Narrative Planners
StoryAssembler - "a narrative system for procedurally generating choice-based interactive narratives"	2019	StoryAssembler: An Engine for Generating Dynamic Choice-Driven Narratives
Felt - "simple story sifting and simulation engine for emergent narrative play experiences"	2019	Felt: A Simple Story Sifter
Partial Order Causal-Link (POCL) Planner (Python)	2019
Recurve (C++) - decompositional planner	2016
Glaive - a fast planner for multi-agent stories	2014	Glaive: a state-space narrative planner supporting intentionality and conflict
STRIPS Planner (Python)	2012

Story Generation Code

Code	Year	Papers
Temporal Entity Annotator (TEA) - extracting narrative timelines	2017	Temporal Information Extraction for Question Answering Using Syntactic Dependencies in an LSTM-based Architecture
IBSEN - director-actor agent collaboration	2024	IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
Collaborative Comic Generation	2024	Collaborative Comic Generation: Integrating Visual Narrative Theories with AI Models for Enhanced Creativity
LegalStories - generating stories and questions about legal concepts	2024	Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling
DRAGN Town Quests	2023	Personalized Quest and Dialogue Generation in Role-Playing Games: A Knowledge Graph- and Language Model-based Approach
ConPer - personality-focused characters in stories	2022	Persona-Guided Planning for Controlling the Protagonist’s Persona in Story Generation
DOC (Detailed Outline Control) - generate stories from outlines with OPT	2023	DOC: Improving Long Story Coherence With Detailed Outline Control
Re^3 (Recursive Reprompting and Revision) - code to generate stories from premises with GPT-3	2022	Re3: Generating Longer Stories With Recursive Reprompting and Revision
COINS (COntextualized Inference Rules for Narrative Story Completion)	2021	COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion
C2PO (Causal, Commonsense Plot Ordering)	2021	Automated Storytelling via Causal, Commonsense Plot Ordering
TD-VAE for Story Generation	2021	A Temporal Variational Model for Story Generation
Story Gen BART	2020	Content Planning for Neural Story Generation with Aristotelian Rescoring
Story Realization	2020	Story Realization: Expanding Plot Events into Sentences
Infilling by Language Modeling (ILM)	2020	Enabling Language Models to Fill in the Blanks
Switching Linear Dynamical System (SLDS)	2020	Generating Narrative Text in a Switching Dynamical System
Label Semantics for Predicting Emotional Reactions	2020	Modeling Label Semantics for Predicting Emotional Reactions
Paranoid Transformer	2020	Paranoid Transformer: Reading Narrative of Madness as Computational Approach to Creativity
SoCP (Storytelling of multi-Character Psychology)	2020	Controllable Multi-Character Psychology-Oriented Story Generation
PlotMachines	2020	PlotMachines: Outline-Conditioned Generation with Dynamic Plot State Tracking
Toward Better Storylines with Sentence-Level Language Models	2020	Toward Better Storylines with Sentence-Level Language Models
AI Dungeon 2	2019
Plan-And-Write	2019	Plan-and-Write: Towards Better Automatic Storytelling
EnGen	2018	Neural text generation in stories using entity representations as context
ASTER (Automated Story-Telling using Event Representations)	2018	Event Representations for Automated Story Generation with Deep Neural Nets
Talk of the Town	2016	Characters Who Speak Their Minds: Dialogue Generation in Talk of the Town and Simulating Character Knowledge Phenomena in Talk of the Town
Creative Help	2015	Creative Help: A Story Writing Assistant

Libraries & Toolkits

Library	Info
OpenAI	GPT-3, ChatGPT, GPT-4
Hugging Face	Hugging Face provides state-of-the-art general-purpose neural language model architectures like BERT, GPT-2, and others.
Hugging Face Transformer Library
AllenNLP	Deep learning for NLP with state of the art models
Spacy	"Industrial-Strength Natural Language Processing" in Python
NLTK - Natural Language Toolkit	Basic NLP tools for Python & interfacing with some external models
Stanford NLP	various NLP models in Java
Stanza	Stanford NLP for Python
ConvKit	Cornell Conversation Analysis Toolkit
Open IE	information extraction on sentences

Knowledge Bases & Commonsense Reasoning

Knowledge Base	Year	Papers	Hugging Face Link
DoWhy - causal inference	2020	DoWhy: An End-to-End Library for Causal Inference
CultureBank	2024	CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies	https://huggingface.co/collections/SALT-NLP/culturebank-6626ee7dcd54f5fffb6769fe
Riveter 💪 - extract connotation frames	2023	Connotation Frames of Power and Agency in Modern Films and Riveter: Measuring Power and Social Dynamics Between Entities
(Comet-) ATOMIC 2020 - updated ATOMIC	2021	COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
SCRUPLES (Subreddit Corpus Requiring Understanding Principles in Life-like Ethical Situations) - ethical judgements	2021	SCRUPLES: A Corpus of Community Ethical Judgments on 32,000 Real-life Anecdotes
COLLIE-V (Comprehensive OntoLogy and Lexicon In English - Verb)	2020	A Broad-Coverage Deep Semantic Lexicon for Verbs
Eraser - Movie Rationales	2020	ERASER: A Benchmark to Evaluate Rationalized NLP Models	https://huggingface.co/datasets/eraser-benchmark/movie_rationales
GLUCOSE (GeneraLized and COntextualized Story Explanations)	2020	GLUCOSE: GeneraLized and COntextualized Story Explanations	https://huggingface.co/datasets/glucose
CausalBank	2020	Guided Generation of Cause and Effect
PeKo (Precondition Knowledge) - event preconditions	2020	Modeling Preconditions in Text with a Crowd-sourced Dataset
Social Chemistry	2020	Social Chemistry 101: Learning to Reason about Social and Moral Norms
ATOMIC (ATlas Of MachIne Commonsense) - social commonsense reasoning	2019	ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning	https://huggingface.co/datasets/atomic
COMET (COMmonsEnse Transformers) - uses ATOMIC and ConceptNet	2019	COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
HellaSwag (Harder Endings, Longer contexts, and Low-shot Activities for Situations With Adversarial Generations) - commonsense inference (harder SWAG)	2019	HellaSwag: Can a Machine Really Finish Your Sentence?	https://huggingface.co/datasets/hellaswag
CLUTRR (Compositional Language Understanding with Text-based Relational Reasoning)	2019	CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text	https://huggingface.co/datasets/CLUTRR/v1
SWAG (Situations With Adversarial Generations) - NLI from video captions	2018	SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference	https://huggingface.co/datasets/allenai/swag
VerbNet	2005, 2018	VerbNet: A Broad-Coverage, Comprehensive Verb Lexicon and Integrating Generative Lexicon Event Structures into VerbNet
ECIpedia (Elementary Composable Ideas) - ECIs from DARPA's Communicating with Computers program	2018	Assembling the ECIpedia: Refining concepts in contex
ConceptNet 5	2017	ConceptNet 5.5: An Open Multilingual Graph of General Knowledge	https://huggingface.co/datasets/conceptnet5
Open-SESAME (open-source SEmi-markov Softmax-margin ArguMEnt parser) - FrameNet frame extraction with RNNs	2017	Frame-Semantic Parsing with Softmax-Margin Segmental RNNs and a Syntactic Scaffold
FrameNet	1997, 2016	FrameNet II: Extended Theory and Practice	https://paperswithcode.com/dataset/framenet
The NOC (Non-Official Characterization) List - pop culture knowledge	2016	Round Up The Usual Suspects: Knowledge-Based Metaphor Generation
VADER (Valence Aware Dictionary and sEntiment Reasoner)	2014	VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text
NU-LEX - combines WordNet, VerbNet, and Wiktionary	2011	NULEX: An Open-License Broad Coverage Lexicon
WordNet - hierarchical dictionary	1985, 1998, 2005	WordNet: An Electronic Lexical Database

Extras

Programming Languages & Authoring Tools for Writing Interactive Fiction

Language	Repo
Expressionist
Adventuron Classroom
Tracery	https://github.com/galaxykate/tracery/tree/tracery2
ink/inklewriter	https://github.com/inkle/ink
Twine	https://github.com/tweecode/twine
Inform 7	https://github.com/ganelson/inform
ADRIFT
TADS
Quest
Storychoices
Varytale
Adventure Game Toolkit (AGT)
Professional Adventure Writer (PAW)
Pangur

Notable IF Games

Game	Year	Authors	Papers
Judith van Stegeren's list of games that use Natural Language Generation
Agence	2020	Transitional Forms	Agence: a dynamic film exploring multi-agent systems and human agency
Hammurabi	2017	LabLabLab	Striving for author-friendly procedural dialogue generation
The Icebound Concordance	2014	Aaron Reed & Jacob Garbe	Ice-Bound: Combining Richly-Realized Story with Expressive Gameplay
Save the Date	2013	Chris Cornell (Montolli)
Prom Week	2012	Josh McCoy, Mike Treanor, Ben Samuel, Aaron A. Reed, Noah Wardrip-Fruin, & Michael Mateas	Prom week
Curveship	2009	Nick Montfort	Curveship: An Interactive Fiction System for Interactive Narrating Curveship's Automatic Narrative Style
Façade	2005	Michael Mateas & Andrew Stern	Façade: An experiment in building a fully-realized interactive drama Integrating Plot, Character and Natural Language Processing in the Interactive Drama Façade
The Gostak	2001	Carl Muckenhoupt
Galatea	2000	Emily Short
Anchorhead	1998	Michael S. Gentry

RPG/IF Inspiration

Name	Info
Polygon's Favorite Actual Play Podcasts	Personal recommendation: The Adventure Zone
Actual Play Podcasts
Roll 20	Play tabletop games with friends virtually
chooseyourstory.com
AI Dungeon
Interactive Fiction on Itch.io	Find cool indie IF games
Interactive Fiction Database	IMDb for IF
Interactive Fiction Wiki

Related Courses

Course	Taught By	Year
Interactive Narrative	Nick Montfort	2019 (Fall)
Interactive Fiction and Text Generation	Lara J. Martin & Chris Callison-Burch	2022 (Spring)
AI Storytelling in Virtual Worlds	Mark Riedl	2022 (Spring)
Computational Poetics	Kathy Wu	2021 (Spring & Fall)

Generators for TRPGs and IF

Name	Info
Picrew	Make customizable character images
Fantasy Map Generator
RPG Tinker	D&D 5e NPC Generator
AnyDice	Dice Probability Calculator
Print graph paper	Just blank graph paper!
donjon	Random generators for tabletop games
RPG Maps in Wolfram Language	Code to tile hex pieces together to make a map
RPG Map Editor 2	Downloadable app for making maps
RPGgen	A collection of generators

Various Tools

Name	Info
Versu	"an engine for telling interactive stories about people"
WOOL	"dialogue platform for creating virtual agent conversations"
Sudo Write	"Bust writer’s block with our magical writing AI."
Verse by Verse	"An experimental AI-powered muse that helps you compose poetry inspired by classic American poets"

Lara's Storytelling Resources