Skip to content

sidshas03/AntiMoneyLaundering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

9 Commits
Β 
Β 
Β 
Β 

Repository files navigation

πŸ•΅οΈβ€β™‚οΈ Anti Money Laundering Detection using Transaction Data

This project analyzes synthetic banking transactions to detect patterns associated with money laundering. Using a Kaggle dataset, we perform exploratory data analysis and prepare the ground for classification modeling.

πŸ“‚ Dataset

  • Source: Kaggle – Synthetic Transaction Monitoring Dataset
  • File: SAML-D.csv (~996MB)
  • Key Columns:
    • Time, Date: Transaction timestamp
    • Sender_account, Receiver_account: Pseudonymous account IDs
    • Amount: Monetary value of transaction
    • Payment_type: Type of transaction (e.g., CASH_OUT, TRANSFER)
    • Is_laundering: Target flag (1 = laundering, 0 = legitimate)

πŸ› οΈ Tech Stack

  • Python 3
  • Google Colab
  • Pandas, NumPy
  • Seaborn, Matplotlib
  • Scikit-learn (for metrics and modeling)

πŸ“Š Exploratory Data Analysis

We analyze trends in:

  • Frequency of laundering by Payment_type
  • Amount patterns in laundering vs normal cases
  • Group-wise statistical summaries

Sample Insights:

  • Certain payment types (e.g., TRANSFER) have higher laundering rates.
  • Laundering transactions often involve higher maximum and average amounts than regular ones.

πŸš€ How to Run (Colab)

  1. Open Anti_Money_Laundering.ipynb in Google Colab.
  2. Upload your kaggle.json API key.
  3. Run the cell to authenticate and download the dataset.
  4. Execute the EDA blocks and modify or extend as needed.

Checkout Kaggle of AML:

Kaggle

πŸ“ˆ Planned Extensions

  • Train machine learning models (Logistic Regression, XGBoost)
  • ROC-AUC and confusion matrix evaluations
  • Integrate anomaly detection
  • Convert pipeline into real-time dashboard

🧠 Author's Note

This is a synthetic dataset, but it reflects realistic fraud detection challenges. The aim is to develop interpretable workflows for financial surveillance.

πŸ“© Contact

Siddharthan P S
πŸ“§ Email: sp8004@nyu.edu
πŸ”— LinkedIn: Siddharthan P S

πŸ“˜ Medium Article: Detecting Money Laundering with Python and AML Dataset
Portfolio: Siddharthan P S

βœ… Tip

To reduce RAM usage in Colab:

  • Use usecols to load selective columns
  • Downcast numeric types with pd.to_numeric(..., downcast=)

Releases

No releases published

Packages

 
 
 

Contributors