Skip to content

datasittersclub/dsc23

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DSC23 Repository

This repository contains three projects that were showcased in issue 23 of the Data Sitter's Club by Anastasia Salter and John Murray.

Projects Overview

Author: @amsucf (Anastasia Salter)

A nostalgic 90s-style interactive website showcasing technology mentions from The Baby-Sitters Club book series. Features authentic retro aesthetics with a CRT monitor display and classic BSC vibes.

Key Features:

  • Authentic 90s web design with CSS animations
  • Interactive technology term explorer
  • Auto-rotating mentions with keyboard navigation
  • Responsive design for all devices
  • GitHub Pages ready deployment

Files:

  • index.html - Main retro website
  • styles.css - 90s aesthetic styling
  • script.js - Interactive functionality
  • concordance.json - Technology mention data

Author: @lucidbard (John Murray)

A complete speaker diarization pipeline that identifies "who spoke when" in audio recordings using WhisperX for transcription and PyAnnote.audio for speaker identification. Includes automatic correction for common transcription errors and handles natural conversation dynamics.

Key Features:

  • WhisperX + PyAnnote integration for accurate speaker identification
  • Automatic transcription error correction
  • Web interface for easy audio processing
  • Support for both CPU and GPU processing
  • Comprehensive output formatting

Author: @lucidbard (John Murray)

A comprehensive workflow for analyzing narrative structure, character goals, and scene segmentation in literary corpora. This project provides both a Jupyter notebook for processing and annotation, and an interactive HTML dashboard for visualization. There is also a sample dataset extracted from an analysis of Baby Sitter's Club books using openai-oss.

Key Features:

  • Scene segmentation and narrator identification
  • Character goal and conflict annotation using LLM analysis
  • Interactive D3.js visualization dashboard
  • Support for multiple LLM providers (Anthropic, OpenAI, Ollama)
  • Project Gutenberg integration for text acquisition

Getting Started

Each project has its own README with detailed setup instructions:

Repository Structure

dsc23/
├── README.md                    # This file
├── bsc-corpus-analysis/         # Literary corpus analysis tools
├── speaker-diarization/         # Audio speaker identification
└── dsc-retro-tech/             # 90s-style tech concordance

Releases

No releases published

Packages

 
 
 

Contributors