Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
33 changes: 28 additions & 5 deletions .env.local.example
Original file line number Diff line number Diff line change
@@ -1,12 +1,35 @@
# Databricks Configuration
# Get these values from your Databricks workspace settings

# Your Databricks workspace hostname (without https://)
DATABRICKS_SERVER_HOSTNAME=your-workspace.cloud.databricks.com

# Your Databricks Personal Access Token
# Generate from: User Settings > Developer > Access tokens
DATABRICKS_PAT_TOKEN=dapi123...your-databricks-pat-token

# Your SQL Warehouse HTTP Path
# Get from: SQL Warehouses > Select warehouse > Connection details
DATABRICKS_HTTP_PATH=/sql/1.0/warehouses/your-warehouse-id
DATABRICKS_PAT_TOKEN=dapi123...your-pat-token
DATABRICKS_METASTORE_REGION=us-west-1

# Skyflow Configuration
# Get these values from your Skyflow vault

# Your Skyflow vault URL
SKYFLOW_VAULT_URL=https://your-vault.vault.skyflowapis.com

# Your Skyflow Personal Access Token
# Generate from: Skyflow Studio > Settings > Tokens
SKYFLOW_PAT_TOKEN=eyJhbGci...your-skyflow-pat-token

# Your Skyflow vault ID
SKYFLOW_VAULT_ID=your-vault-id
SKYFLOW_PAT_TOKEN=eyJhbGciOiJSUzI1NiIsInR5cCI6IkpXVCJ9...your-pat-token
SKYFLOW_TABLE=pii
SKYFLOW_BATCH_SIZE=25

# Your Skyflow table name (where PII will be stored)
SKYFLOW_TABLE=customer_data

# Optional: Group Mappings for Role-Based Access Control
# These control which user groups can see real vs tokenized data
PLAIN_TEXT_GROUPS=auditor # Groups that see real data
MASKED_GROUPS=customer_service # Groups that see masked data
REDACTED_GROUPS=marketing # Groups that see redacted data
69 changes: 58 additions & 11 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,7 +1,47 @@
# Environment files with credentials
# Environment files
.env.local
.env

# OS generated files
# Python
__pycache__/
*.py[cod]
*$py.class
*.so
.Python
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
wheels/
pip-wheel-metadata/
share/python-wheels/
*.egg-info/
.installed.cfg
*.egg
MANIFEST

# Virtual environments
venv/
env/
ENV/
env.bak/
venv.bak/

# IDE
.vscode/
.idea/
*.swp
*.swo
*~

# OS
.DS_Store
.DS_Store?
._*
Expand All @@ -10,17 +50,24 @@
ehthumbs.db
Thumbs.db

# IDE files
.vscode/
.idea/
*.swp
*.swo
*~

# Logs
*.log
logs/

# Temporary files
tmp/
temp/
*.tmp
*.temp
temp/

# Databricks
.databricks/
databricks.yml

# Test outputs
test_results/
coverage/

# Runtime files
*.pid
*.seed
*.pid.lock
Loading