LLM Input Sanitizer

A Python package for sanitizing and securing user inputs before sending them to large language models (LLMs).

Features

PII Detection & Masking: Automatically detects and masks emails, phone numbers, SSNs, and credit card numbers
Profanity Filtering: Removes or masks profanity and inappropriate language
Input Truncation: Prevents excessively long inputs
Unicode Normalization: Handles special characters and ensures consistent text encoding
Prompt Injection Defense: Detects and blocks common LLM prompt injection attacks
Jailbreak Prevention: Identifies attempts to bypass LLM safety measures

Installation

pip install llm-input-sanitizer

Quick Start

from llm_input_sanitizer import InputSanitizer, prepare_llm_messages, is_input_appropriate

# Initialize the sanitizer
sanitizer = InputSanitizer(max_length=1000)

# Sanitize user input
user_input = "My email is john@example.com and my phone is 555-123-4567"
sanitized_input = sanitizer.sanitize_input(user_input)
# Result: "My email is [EMAIL] and my phone is [PHONE]"

# Check if input is appropriate (no injection attempts)
if is_input_appropriate(sanitized_input):
    # Prepare messages for the LLM
    messages = prepare_llm_messages(
        sanitized_input, 
        system_message="You are a helpful assistant."
    )
    # Send messages to your LLM
else:
    print("Potentially harmful input detected")

Custom Profanity list

sanitizer = InputSanitizer(profanity_file="path/to/profanity_words.txt")

Custom forbidden paths

from llm_input_sanitizer import is_input_appropriate

my_patterns = [
    r'custom_pattern_1',
    r'custom_pattern_2',
]

is_safe = is_input_appropriate(text, forbidden_patterns=my_patterns)

Integration with OpenAI

import openai
from llm_input_sanitizer import InputSanitizer, prepare_llm_messages, is_input_appropriate

sanitizer = InputSanitizer()

def safe_llm_call(user_input):
    # Sanitize the input
    clean_input = sanitizer.sanitize_input(user_input)
    
    # Check if appropriate
    if not is_input_appropriate(clean_input):
        return "I'm sorry, I can't process that request."
    
    # Prepare messages
    messages = prepare_llm_messages(clean_input)
    
    # Call the LLM API
    response = openai.ChatCompletion.create(
        model="gpt-4",
        messages=messages
    )
    
    return response.choices[0].message.content

This package provides a baseline of protection against common attacks but is not a complete security solution. Always implement defense in depth for production systems:

Server-side validation
Rate limiting
Monitoring for unusual patterns
Regular updates to security patterns

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dist		dist
llm_input_sanitizer.egg-info		llm_input_sanitizer.egg-info
llm_input_sanitizer		llm_input_sanitizer
tests		tests
.DS_Store		.DS_Store
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
PKG-INFO		PKG-INFO
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Input Sanitizer

Features

Installation

Quick Start

Custom Profanity list

Custom forbidden paths

Integration with OpenAI

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Input Sanitizer

Features

Installation

Quick Start

Custom Profanity list

Custom forbidden paths

Integration with OpenAI

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages