Librecode Annotation Standard

Document Version: 2.0 Last Updated: 2025-10-01 Target Audience: Session recorders, annotators, reviewers

Glossary

Session An asciinema recording of a terminal session, typically showing the deployment or configuration of software.

Annotated Session A session that has structured annotations added to it, conforming to this specification.

Recording The raw asciinema capture of terminal activity.

Annotation Structured metadata added to a recording that describes what is happening, why, and the outcome.

Timeline A hierarchical structure in the annotator tool where annotations are organized. Different goal levels exist on separate timelines.

Goal Hierarchy The nested structure of goals, subgoals, and sub-subgoals that describe the overall task and its component steps.

Annotation Tag A specific marker type (e.g., goal, mistake, successResult) used to categorize parts of the session.

Person A (Recorder) The individual(s) who created the original terminal session recording.

Person B (Annotator) The individual who interviews Person A and adds structured annotations to the recording.

.session File A zip file containing the annotated recording, audio transcriptions, and metadata conforming to this specification.

Overview & Purpose

What are Annotated Sessions?

An annotated session is an asciinema recording of terminal activity (typically software deployment) that has been enriched with structured annotations describing:

What actions were performed
Why those actions were taken
Whether goals were achieved
What mistakes occurred and their consequences
What discoveries changed the approach

Why Create Them?

The Librecode project aims to train Large Language Models (LLMs) to understand and work with terminal recordings. Annotated sessions serve as training data, teaching models to:

Recognize common deployment patterns
Understand error recovery strategies
Learn from human problem-solving approaches
Eventually perform automatic annotation

Once sufficient manual annotations exist, trained models can automatically annotate new sessions, enabling advanced tooling like automated documentation generation.

Document Purpose

This document provides:

An annotation process for creating annotated sessions
An annotation standard defining what to annotate and how
Guidelines for annotators to ensure consistency and quality

The Annotation Process

The process of creating an annotated session involves four distinct phases:

Phase 1: Recording → Phase 2: Interview → Phase 3: Annotation → Phase 4: Submission

Phase 1: Recording

Objective: Create a terminal recording of meaningful software deployment activity. Who: Person A (one or more individuals)

Steps:

Set up the recording environment:
- Use a Debian Stable VM (see Appendix D)
- Upgrade to unstable if required packages are missing
Start the asciinema recording client
Attempt to deploy a piece of GNU software
Keep all input/output inside the terminal:
- Use CLI web tools (curl, wget, lynx, etc.)
Stop recording when deployment completes

Output: .cast asciinema recording file

Common Issues:

Using GUI tools
Stopping recording too early

Phase 2: Interview

Objective: Ensure the annotator fully understands what was done and why. Who: Person A and Person B

Steps:

Record audio of both sides separately
Review the recording together
Explain what was done and why
Record follow-up clarification conversations if needed

Output:

Audio recordings
Full understanding of actions and rationale

Why Separate Audio Recordings?

Better transcription quality
Clear speaker attribution
Higher-quality training data

Phase 3: Annotation

Objective: Add structured annotations using the annotator tool. Who: Person B

Steps:

Load the recording
Create timelines for goal hierarchy levels
Apply annotations per standard
Review completeness
Note spec version used

Output: Annotated recording

Phase 4: Submission

Objective: Package and submit the session.

Steps:

Transcribe audio with Whisper
Create .session archive
Include:
- session.yaml
- recording.asciinema
- Audio + transcriptions
Upload to repository

---

AnnotationStandard

Contents

Librecode Annotation Standard

Table of Contents

Glossary

Overview & Purpose

What are Annotated Sessions?

Why Create Them?

Document Purpose

The Annotation Process

Phase 1: Recording

Phase 2: Interview

Phase 3: Annotation

Phase 4: Submission

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools