Domain-Informed-Monotonicity

DIM: Enforcing Domain-Informed Monotonicity in Deep Neural Networks

Overview

This repository contains the implementation and experimental code for the paper "DIM: Enforcing Domain-Informed Monotonicity in Deep Neural Networks". DIM is a new regularization method that maintains domain-informed monotonic relationships in deep learning models to improve predictions and reduce overfitting.

Key Features

Model-Agnostic: Works with any neural network architecture without requiring structural modifications
Domain-Informed: Incorporates expert knowledge about monotonic relationships between features and outputs
Linear Baseline Reference: Establishes objective violation measurement through fitted linear trends
Consistent Performance: Demonstrates MSE improvements of 20-30% across multiple architectures

Method Overview

DIM addresses the fundamental limitation of existing monotonicity methods by establishing an explicit linear reference trend before measuring violations. For each monotonic feature, the method:

Fits a linear baseline to current model predictions
Sorts predictions and corresponding feature values
Measures deviations from expected monotonic behavior
Applies squared penalty for violations
Integrates penalty into training loss function

Repository Structure

├── README.md
├── models/
│   ├── ann_model.py              # Single-layer ANN implementation
│   ├── cnn_model.py              # Conv1D model implementation
│   ├── mlp3_model.py             # 3-layer MLP implementation
│   └── mlp5_model.py             # 5-layer MLP implementation
├── data/
│   ├── alldata_downtownTodowntown.csv     # Chicago ridesourcing dataset
│   └── synthetic_monotonic_trips.csv      # Synthetic data generation

Installation and Environment

Python 3.8 or higher TensorFlow 2.7.0 CUDA-capable GPU (recommended for faster training)

TensorFlow Conda Environment

# Create the environment with TensorFlow 2.7.0
conda create -n tf270 python=3.8 tensorflow=2.7.0

# Activate the environment
conda activate tf270

Dataset Configuration

By default, all models are configured to use the Chicago ridesourcing dataset. To run experiments on the synthetic dataset, you need to modify the file_path variable in each model file.

Switching to Synthetic Dataset

In each model file (ann_model.py, cnn_model.py, mlp3_model.py, mlp5_model.py), change the file path:

# Change this line:
file_path = './alldata_downtownTodowntown.csv'

# To this:
file_path = './synthetic_monotonic_trips.csv'

Dataset-Specific Monotonic Features

Chicago Dataset:

monotonic_features = ['downtown_downtown', 'EmpDen_Des', 'EmpDen_Ori', 'Commuters_HW', 'Commuters_WH']

Synthetic Dataset:

monotonic_features = ['x1', 'x2', 'x3']  # Adjust based on your synthetic data structure

Make sure to update the monotonic_features list accordingly when switching between datasets.

Dropped Columns

Chicago Dataset:

X = data.drop(columns=['total_number_trips', 'Unnamed: 0'])

Synthetic Dataset:

X = data.drop(columns=['total_number_trips'])

Make sure to update the dropped columns accordingly when switching between datasets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain-Informed-Monotonicity

Overview

Key Features

Method Overview

Repository Structure

Installation and Environment

TensorFlow Conda Environment

Dataset Configuration

Switching to Synthetic Dataset

Dataset-Specific Monotonic Features

Chicago Dataset:

Synthetic Dataset:

Dropped Columns

Chicago Dataset:

Synthetic Dataset:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
data		data
models		models
README.md		README.md

SERMOS-LAB/Domain-Informed-Monotonicity

Folders and files

Latest commit

History

Repository files navigation

Domain-Informed-Monotonicity

Overview

Key Features

Method Overview

Repository Structure

Installation and Environment

TensorFlow Conda Environment

Dataset Configuration

Switching to Synthetic Dataset

Dataset-Specific Monotonic Features

Chicago Dataset:

Synthetic Dataset:

Dropped Columns

Chicago Dataset:

Synthetic Dataset:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages