Genna - Generative Annotation System

Genna (GENerative ANNotation) is a LLM-centric text annotation system designed for efficient text annotation and comparison of AI model outputs. Genna makes it easy to test out different LLMs, prompts, temperature and scalability of structured calls.

Demo & Tutorial

Key Features at a Glance

🤖 AI Model Integration: Configure multiple AI models as annotators: supports OpenAI, Anthropic and Ollama
📊 Custom Dataset Management: Evolve your dataset, add new files, remove files, while keeping track of metrics
📋 Human Annotation Interface:
- Interactive annotation with thumbs up/down feedback
- Real-time disagreement detection between AI models
- Expandable annotation view for each row
- Dark mode support for comfortable viewing
🔍 Smart Filtering:
- Filter by content across any column
- Show only rows with AI model disagreements
- Category-based multi-filtering
📈 Annotation Analysis:
- Visual indicators for model agreements/disagreements
- Summary view showing differences between model outputs
- Quick toggle to show/hide all annotations

Setup

Start and environment and install requirements:

conda create --name genna
conda activate genna
pip install -r requirements.txt

Start the application

python app.py

Usage

Setting Up a Project

Create a new project
Upload your CSV file
Configure column settings:
- Show: Columns to display
- Label: Columns to annotate
- Content: Columns containing text to annotate
- Filter: Category columns for filtering

Creating Annotators

Go to project settings
Click "Add New Model"
Configure the annotator:
- Give it a unique name
- Select the AI model
- Set temperature
- Write base prompt
- Configure label prompts
Save the annotator

Using the Annotation Interface

Select a file to annotate
Use the expand/collapse buttons to view annotations
Review AI model outputs
Provide feedback using thumbs up/down
Use filters to focus on specific content:
- Text search in any column
- Show only rows with model disagreements
- Category-based filtering
Toggle dark mode for comfortable viewing
Use bulk actions to show/hide all annotations

Using Judges

Create a judge model similar to annotators
When running the judge:
- Click "Annotate" on the judge
- Select two annotators to compare
- Judge will evaluate their annotations
- Results appear in the scores section

Managing Files

Files are automatically organized by project
Annotations stored separately from source files
Easy deletion of files when needed
Category-based filtering for better organization

Features

Project Management

Create and manage multiple annotation projects
Upload CSV files to projects
Configure which columns to show, label, use as content, or filter
Set project objectives and annotation guidelines

AI Models

Configure AI models (like GPT-3, GPT-4) as annotators or judges
Customize model parameters:
- Temperature for controlling randomness
- Custom prompts for different annotation tasks
- Label types (text, boolean, numeric)

Annotation System

Annotators
- AI models configured to annotate text
- Each annotator has a unique name and configuration
- Can handle multiple label types per annotation
- Annotations are stored separately from source data
- Real-time visual feedback for model agreements/disagreements
- Interactive thumbs up/down interface for manual annotation
Annotation Interface
- Expandable rows showing all model annotations
- Summary row indicating differences between model outputs
- Dark mode support for reduced eye strain
- Quick filters to show only rows with disagreements
- Bulk show/hide annotations across all rows
Judges
- Special AI models that evaluate annotations
- Compare annotations from two different annotators
- Selected dynamically at annotation time
- Help evaluate annotation quality and consistency

Category Columns

Filter data using category columns
Multi-select filtering capabilities
Organize and manage annotations by categories
Quick text search within any column

Technical Details

File Structure

project_name/
├── data/
│   ├── source_files/
│   │   └── uploaded_csv_files
│   └── annotations/
│       └── annotator_results
├── settings/
│   └── project_settings.json
└── models/
    └── model_configurations

Annotation Format

Annotations are stored in a structured format:

{
  "annotator_id": "unique_id",
  "timestamp": "ISO_timestamp",
  "labels": {
    "column1": "value1",
    "column2": true,
    "column3": 5
  }
}

Judge Evaluation

Judges compare annotations by:

Analyzing both annotators' results
Evaluating consistency and quality
Providing numerical scores
Highlighting discrepancies

Best Practices

Project Setup
- Clear project objectives
- Well-defined annotation guidelines
- Consistent label schemas
Model Configuration
- Descriptive model names
- Clear, specific prompts
- Appropriate temperature settings
Annotation Review
- Use the disagreement filter to focus on problematic cases
- Review all model outputs before providing feedback
- Pay attention to the summary indicators
- Regular evaluation using judges
Data Management
- Regular backups
- Clear category organization
- Periodic cleanup of unused files
- Use filters effectively to manage large datasets

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
src		src
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
favicon.py		favicon.py
llm_models.yaml		llm_models.yaml
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Genna - Generative Annotation System

Demo & Tutorial

Key Features at a Glance

Setup

Usage

Setting Up a Project

Creating Annotators

Using the Annotation Interface

Using Judges

Managing Files

Features

Project Management

AI Models

Annotation System

Category Columns

Technical Details

File Structure

Annotation Format

Judge Evaluation

Best Practices

About

Uh oh!

Releases

Packages

Uh oh!

Languages

md-experiments/genna3

Folders and files

Latest commit

History

Repository files navigation

Genna - Generative Annotation System

Demo & Tutorial

Key Features at a Glance

Setup

Usage

Setting Up a Project

Creating Annotators

Using the Annotation Interface

Using Judges

Managing Files

Features

Project Management

AI Models

Annotation System

Category Columns

Technical Details

File Structure

Annotation Format

Judge Evaluation

Best Practices

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages