Socrates AI - Philosophical Dialogue Application

An AI-powered web application that engages users in Socratic dialogue using advanced LLM APIs (Claude 4, GPT-4). The application processes user input with NLP techniques and generates thoughtful, philosophical responses.

Features

Socratic Method Implementation: Engages users through thoughtful questions and philosophical dialogue
Multiple LLM Support: Works with Anthropic Claude, OpenAI GPT, or Google Gemini models
NLP Processing: Tokenization, lemmatization, and POS tagging of user input
Web Interface: Clean, responsive UI for dialogue interaction
Error Handling: Robust handling of API rate limits and errors
Input Analysis: Shows processed NLP data for transparency

Installation

Quick Setup (Using Make)

git clone <repository-url>
cd socrates-app
make setup  # Installs dependencies and downloads NLTK data
# Edit .env file with your API keys
make run    # Start the application

Manual Setup

Clone the repository:

git clone <repository-url>
cd socrates-app

Create a virtual environment:

python -m venv venv # use python or python3 based on your system.
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

cp .env.example .env

Edit .env and add your API key for either Anthropic, OpenAI, or Google.

Download NLTK data:

python download_nltk_data.py  # use python or python3 based on your system
# OR manually:
python -c "import nltk; nltk.download(['punkt', 'punkt_tab', 'wordnet', 'stopwords', 'averaged_perceptron_tagger', 'averaged_perceptron_tagger_eng'])"

Running the Application

Local Development

uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Visit http://localhost:8000 in your browser.

Production

uvicorn app.main:app --host 0.0.0.0 --port $PORT

API Endpoints

GET / - Web interface
POST /api/dialogue - JSON API endpoint
POST /dialogue - Form submission endpoint
GET /health - Health check endpoint

API Usage Example

curl -X POST "http://localhost:8000/api/dialogue" \
  -H "Content-Type: application/json" \
  -d '{"message": "What is the nature of truth?"}'

Deployment

Vercel Deployment

Install Vercel CLI:

npm i -g vercel

Create vercel.json in the project root:

{
  "builds": [
    {"src": "app/main.py", "use": "@vercel/python"}
  ],
  "routes": [
    {"src": "/(.*)", "dest": "app/main.py"}
  ]
}

Deploy:

vercel

Heroku Deployment

Quick Deployment (Recommended):

./deploy_heroku.sh

Manual Deployment: See the complete guide: DEPLOY_HEROKU.md

Quick Manual Steps:

heroku create your-app-name
heroku config:set LLM_PROVIDER=google
heroku config:set GOOGLE_API_KEY=your-google-key-here
heroku config:set GOOGLE_MODEL=gemini-1.5-flash
git push heroku main
heroku run python download_nltk_data.py

Docker Deployment

Create a Dockerfile in the project root:

FROM python:3.11-slim

WORKDIR /app

COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

COPY . .

RUN python -c "import nltk; nltk.download(['punkt', 'wordnet', 'stopwords', 'averaged_perceptron_tagger'])"

EXPOSE 8000

CMD ["uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8000"]

Build and run:

docker build -t socrates-ai .
docker run -p 8000:8000 --env-file .env socrates-ai

Configuration

Environment Variables

LLM_PROVIDER: Choose between 'anthropic', 'openai', or 'google'
ANTHROPIC_API_KEY: Your Anthropic API key (if using Claude)
ANTHROPIC_MODEL: Claude model to use (default: claude-3-5-sonnet-20241022)
OPENAI_API_KEY: Your OpenAI API key (if using GPT)
OPENAI_MODEL: OpenAI model to use (default: gpt-4-turbo-preview)
GOOGLE_API_KEY: Your Google API key (if using Gemini)
GOOGLE_MODEL: Google model to use (recommended: gemini-1.5-flash, gemini-1.5-pro, or gemini-pro)

Machine Learning Model Training

The application includes a philosophical question categorizer that uses machine learning to classify user questions into different philosophical domains (Ethics, Metaphysics, Epistemology, etc.).

When to Train the Model

The model training should be executed in the following scenarios:

Initial Setup: When first setting up the application, if the model files don't exist in the models/ directory
Model Updates: When you want to improve the categorization by updating the training data or algorithm
After Major Changes: After modifying the philosophical categories or training examples in app/ml_categorizer.py

Training the Model

To train or retrain the categorizer model:

python train_categorizer.py

This command will:

Generate training data from philosophical examples
Train a Decision Tree classifier
Save the model files to the models/ directory:
- models/philosophy_categorizer.pkl - The trained classifier
- models/tfidf_vectorizer.pkl - The text vectorizer
Display test predictions to verify the model is working

Automatic Model Loading

When the application starts:

It automatically attempts to load the pre-trained model from the models/ directory
If the model files are not found, it will automatically train a new model
This ensures the app always has a working categorizer, even on first run

Note: The pre-trained model files are included in the repository, so manual training is typically not required unless you want to update or improve the model.

Project Structure

socrates-app/
├── app/
│   ├── __init__.py
│   ├── main.py              # FastAPI application
│   ├── llm_service.py       # LLM API integration
│   ├── nlp_processor.py     # NLP processing logic
│   ├── socratic_dialogue.py # Socratic method implementation
│   └── ml_categorizer.py    # ML categorizer for philosophical questions
├── models/
│   ├── philosophy_categorizer.pkl  # Trained ML model
│   └── tfidf_vectorizer.pkl       # Text vectorizer
├── static/
│   ├── style.css           # CSS styles
│   └── script.js           # Frontend JavaScript
├── templates/
│   └── index.html          # HTML template
├── train_categorizer.py    # Script to train the ML model
├── requirements.txt        # Python dependencies
├── .env.example           # Environment variables template
└── README.md              # This file

Error Handling

The application handles:

API rate limits with exponential backoff
API errors with retries
Invalid input validation
Missing API keys
Network timeouts

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

Troubleshooting

NLTK Data Errors

If you encounter errors like Resource punkt_tab not found or Resource averaged_perceptron_tagger_eng not found, run:

python download_nltk_data.py

Or manually download all required NLTK data:

python -c "import nltk; nltk.download('all')"

API Key Issues

Make sure your .env file contains the correct API key for your chosen provider:

For Anthropic: ANTHROPIC_API_KEY=your-key-here
For OpenAI: OPENAI_API_KEY=your-key-here
For Google: GOOGLE_API_KEY=your-key-here

Port Already in Use

If port 8000 is already in use, you can specify a different port:

uvicorn app.main:app --reload --host 0.0.0.0 --port 8001

Google Gemini Safety Filter Issues

If you encounter errors with Google Gemini like "Invalid operation: The response.text quick accessor requires the response to contain a valid Part", this usually means the content was blocked by safety filters. The app now handles this gracefully, but you can:

Try rephrasing your question
Use a different model like gemini-1.5-flash or gemini-1.5-pro
Switch to a different LLM provider (Anthropic or OpenAI)

Application is deployed here: https://socrates-ai-app-1711abfba870.herokuapp.com/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
bin		bin
static		static
templates		templates
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
DEPLOY_HEROKU.md		DEPLOY_HEROKU.md
Dockerfile		Dockerfile
ML_CATEGORIZER.md		ML_CATEGORIZER.md
Makefile		Makefile
Procfile		Procfile
README.md		README.md
deploy_heroku.sh		deploy_heroku.sh
download_nltk_data.py		download_nltk_data.py
fix_git_deploy.sh		fix_git_deploy.sh
nltk.txt		nltk.txt
redeploy_openai.sh		redeploy_openai.sh
requirements.txt		requirements.txt
runtime.txt		runtime.txt
test_google_api.py		test_google_api.py
test_setup.py		test_setup.py
train_categorizer.py		train_categorizer.py
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Socrates AI - Philosophical Dialogue Application

Features

Installation

Quick Setup (Using Make)

Manual Setup

Running the Application

Local Development

Production

API Endpoints

API Usage Example

Deployment

Vercel Deployment

Heroku Deployment

Docker Deployment

Configuration

Environment Variables

Machine Learning Model Training

When to Train the Model

Training the Model

Automatic Model Loading

Project Structure

Error Handling

Contributing

Troubleshooting

NLTK Data Errors

API Key Issues

Port Already in Use

Google Gemini Safety Filter Issues

About

Uh oh!

Releases

Packages

Languages

jobiaj/SocraticAI

Folders and files

Latest commit

History

Repository files navigation

Socrates AI - Philosophical Dialogue Application

Features

Installation

Quick Setup (Using Make)

Manual Setup

Running the Application

Local Development

Production

API Endpoints

API Usage Example

Deployment

Vercel Deployment

Heroku Deployment

Docker Deployment

Configuration

Environment Variables

Machine Learning Model Training

When to Train the Model

Training the Model

Automatic Model Loading

Project Structure

Error Handling

Contributing

Troubleshooting

NLTK Data Errors

API Key Issues

Port Already in Use

Google Gemini Safety Filter Issues

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages