Janitor

A customer engineering application that automatically triages and responds to Github issues on public repositories (defaulted to LangChain). The application classifies issues, assigns severity, and provides relevant documentation-based responses. It also supports comprehensive evaluation using LangSmith.

Overview

The application processes support queries through a series of chains:

Issue Type Classification: Identifies if the query is a bug report, feature request, or support question
Severity Assessment: Assigns a severity score (1-4) based on impact
Category Classification: Categorizes the query (setup, chains, agents, memory, retrieval, other)
Documentation Retrieval: Finds relevant documentation from LangChain docs
Response Generation: Provides a helpful response with documentation links

Usage

The application processes issues from a CSV file (langchain_issues_dataset.csv), which is pulled from the Github API. Each issue is analyzed and classified, with support questions receiving documentation-based responses.

Setup

Create a .env file with your API keys:

OPENAI_API_KEY=your_openai_key
LANGSMITH_API_KEY=your_langsmith_key
LANGSMITH_PROJECT=your_project_name
GITHUB_TOKEN=your_github_token  # Optional, for higher rate limits

Install dependencies:

pip install -r requirements.txt

Running the Application

Run the Main Chain (Triage and Respond)

python chains.py

This will process issues from the dataset and print out the triage and response results for each issue.

Run the Evaluation Pipeline

python eval.py

This will run the comprehensive evaluation workflow using LangSmith, applying multiple LLM-as-a-judge evaluators to all outputs.
After completion, a link to the experiment will be printed. Open this link to view and compare results in the LangSmith UI.

Vectorize Documentation (if needed)

python vectorize_docs.py

This script builds or updates the vector store for LangChain documentation, used for retrieval in the main chain.

Evaluation System

The application includes a comprehensive evaluation system with multiple evaluators:

Classification Evaluators

Issue Type Accuracy: Evaluates the accuracy of issue type classification
Severity Accuracy: Assesses the correctness of severity assignments

Response Quality Evaluators

Response Action Accuracy: Evaluates if the response correctly addresses the issue
Tone Appropriateness: Assesses professionalism, empathy, clarity, and positivity
Response Completeness: Evaluates technical details, explanation quality, and references
Technical Accuracy: Assesses code references, documentation usage, and terminology

Retrieval Quality Evaluators

Relevance Score: Evaluates how relevant retrieved documents are to the issue
Coverage Score: Assesses if retrieved documents cover all necessary information

All evaluators use LLM-as-a-judge for robust, context-aware scoring and provide detailed explanations for their assessments.

Project Structure

chains.py: Main application logic and chain definitions
eval.py: Comprehensive evaluation pipeline using LangSmith
vectorize_docs.py: Vector store setup for LangChain documentation
get_github_issues.py: GitHub issue fetching and dataset creation
langchain_issues_dataset.csv: Sample issues for testing
.env: Environment variables for API keys
requirements.txt: Python dependencies

Notes

Make sure your dataset (langchain_issues_dataset.csv) includes both the issue description and URL for each example.
All evaluation results and experiment comparisons are available in the LangSmith UI.
You can add or modify evaluators in eval.py as needed for your use case.
The system uses GPT-4 for evaluation to ensure high-quality assessments.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
chains.py		chains.py
create_dataset.py		create_dataset.py
eval.py		eval.py
get_github_issues.py		get_github_issues.py
ideas.md		ideas.md
langchain_issues_dataset.csv		langchain_issues_dataset.csv
langchain_issues_dataset_first10.csv		langchain_issues_dataset_first10.csv
requirements.txt		requirements.txt
vectorize_docs.py		vectorize_docs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Janitor

Overview

Usage

Setup

Running the Application

Run the Main Chain (Triage and Respond)

Run the Evaluation Pipeline

Vectorize Documentation (if needed)

Evaluation System

Classification Evaluators

Response Quality Evaluators

Retrieval Quality Evaluators

Project Structure

Notes

About

Uh oh!

Releases

Packages

Languages

midaz/langchain-ce-app

Folders and files

Latest commit

History

Repository files navigation

Janitor

Overview

Usage

Setup

Running the Application

Run the Main Chain (Triage and Respond)

Run the Evaluation Pipeline

Vectorize Documentation (if needed)

Evaluation System

Classification Evaluators

Response Quality Evaluators

Retrieval Quality Evaluators

Project Structure

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages