Bank Account Fraud Analysis

Introduction

Detecting fraud in bank account transactions requires a keen eye for unusual patterns or activities. Monitoring specific aspects of transactions can help banks and customers identify potential fraudulent behavior. Here are key aspects to monitor in order to detect fraud effectively:

Transaction Amounts and Frequency
- Large or Unusual Transactions: Monitor for unusually large transactions, especially those that exceed normal spending patterns for the account holder. Fraudulent transactions often involve large sums of money that are quickly moved or withdrawn.
- Frequent Small Transactions: Fraudsters may break down large transactions into smaller amounts to avoid detection, especially if there are withdrawal limits or anti-fraud thresholds. Monitoring for multiple small transactions in a short period of time can help identify such patterns.
- Transactions Just Below Limits: Fraudsters may try to stay below transaction limits that trigger alerts, so it's important to monitor for activities where the amounts are consistently close to these thresholds.
Geographic Location and Device Use
- Location Inconsistencies: If a customer typically makes transactions in one geographic region but then suddenly initiates a large transaction from a different country or region, it could indicate a potential fraud. Cross-border transactions or rapid location shifts may signal fraud, especially if the account holder hasn’t traveled.
- Device/Channel Consistency: Fraudsters may use different devices (e.g., mobile, desktop) or unfamiliar networks to access accounts. Sudden shifts in device usage or IP addresses can trigger alerts. Monitoring the type of device and internet network (e.g., new IP address, public Wi-Fi) used for transactions can reveal suspicious activity.
Time of Day and Frequency
- Unusual Hours: Fraudulent transactions often occur outside of regular banking hours or during odd hours (e.g., late night or weekends). If a customer typically performs transactions during business hours, any transactions outside these hours should be flagged for review.
- High Transaction Volume in Short Periods: A high number of transactions within a short time frame (e.g., several withdrawals in a few minutes) can indicate fraudulent behavior. This could be due to "carding" attacks or unauthorized access attempts.
New Payees or Beneficiaries
- Changes in Beneficiaries or Payees: If the account holder suddenly starts transferring money to new or unfamiliar payees, especially if these payees are in different locations or countries, it may signal that the account has been compromised.
- Frequent Changes in Payment Details: Changes to payee details (e.g., bank account numbers, recipient names) without a clear reason should be monitored closely, as fraudsters may try to redirect funds to their own accounts.
Unusual Withdrawals or Transfers
- Out-of-Character Withdrawal Patterns: Sudden, significant withdrawals that are inconsistent with the account holder’s normal behavior (e.g., withdrawing a large sum all at once or multiple withdrawals over a short period) are key indicators of fraud.
- Transfers to Unknown Accounts: Monitor for large sums being transferred to external accounts or unverified accounts, especially if the transfers are out of the account holder’s usual pattern.
- ATM Withdrawals in Different Locations: If there are multiple ATM withdrawals across different locations (especially international), this can indicate that an account is being accessed fraudulently.
Multiple Failed Login Attempts
- Suspicious Login Attempts: Multiple failed login attempts in a short period can be an indicator of an attempted account takeover. Fraudsters often try to guess passwords or use brute-force techniques to gain unauthorized access.
- Unsuccessful Authentication Attempts: If there are multiple unsuccessful attempts to authenticate a user’s identity (e.g., incorrect password or failed biometric verification), it could signal that fraudsters are trying to break into the account.
Use of Different Payment Methods
- Unfamiliar Payment Methods: A sudden shift in the type of payment method being used (e.g., shifting from credit card payments to mobile wallets or cryptocurrency) could be a red flag.
- High-Risk Transactions: Certain payment types are more commonly associated with fraud, such as wire transfers to international accounts, or prepaid card transactions. Monitoring for an increase in these types of transactions could be useful.
Multiple Accounts Linked to One Identity
- Multiple Accounts Using the Same Information: Fraudsters may attempt to open several accounts using synthetic identities or stolen personal information. Banks should track accounts that are linked to the same identity (e.g., same phone number, email, or address).
- Coordinated Transactions Across Accounts: Fraudsters may create multiple accounts and transfer funds between them to disguise the origin of stolen money. Monitoring transactions across accounts that appear to have no legitimate business relationship can highlight fraudulent activity.
Unusual Behavior Relative to Historical Patterns
- Deviation from Historical Spending Patterns: Fraud detection systems should be capable of analyzing historical transaction data to establish typical behavior patterns. Any deviation from these patterns, such as a significant increase in spending, can be a potential fraud indicator.

Objective

Installation

Dataset

Bank Account Fraud Dataset Suite (NeurIPS 2022)

Column	Description
fraud_bool	Fraud label (1 if fraud, 0 if legit).
income	Annual income of the applicant in quantiles. Ranges between [0, 1].
name_email_similarity	Metric of similarity between email and applicant’s name. Higher values represent higher similarity. Ranges between [0, 1].
prev_address_months_count	Number of months in previous registered address of the applicant, i.e., the applicant’s previous residence, if applicable. Ranges between [−1, 380] months (-1 is a missing value).
current_address_months_count	Months in currently registered address of the applicant. Ranges between [−1, 406] months (-1 is a missing value).
customer_age	Applicant’s age in bins per decade (e.g., 20-29 is represented as 20).
days_since_request	Number of days passed since application was done. Ranges between [0, 78] days.
intended_balcon_amount	Initial transferred amount for application. Ranges between [−1, 108].
payment_type	Credit payment plan type. 5 possible (anonymized) values.
zip_count_4w	Number of applications within same zip code in last 4 weeks. Ranges between [1, 5767].
velocity_6h	Velocity of total applications made in last 6 hours, i.e., average number of applications per hour in the last 6 hours. Ranges between [−211, 24763].
velocity_24h	Velocity of total applications made in last 24 hours, i.e., average number of applications per hour in the last 24 hours. Ranges between [1329, 9527].
velocity_4w	Velocity of total applications made in last 4 weeks, i.e., average number of applications per hour in the last 4 weeks. Ranges between [2779, 7043].
bank_branch_count_8w	Number of total applications in the selected bank branch in last 8 weeks. Ranges between [0, 2521].
date_of_birth_distinct_emails_4w	Number of emails for applicants with same date of birth in last 4 weeks. Ranges between [0, 42].
employment_status	Employment status of the applicant. 7 possible (anonymized) values.
credit_risk_score	Internal score of application risk. Ranges between [−176, 387].
email_is_free	Domain of application email (either free or paid).
housing_status	Current residential status for applicant. 7 possible (anonymized) values.
phone_home_valid	Validity of provided home phone.
phone_mobile_valid	Validity of provided mobile phone.
bank_months_count	How old is previous account (if held) in months. Ranges between [−1, 31] months (-1 is a missing value).
has_other_cards	If applicant has other cards from the same banking company.
proposed_credit_limit	Applicant’s proposed credit limit. Ranges between [200, 2000].
foreign_request	If origin country of request is different from bank’s country.
source	Online source of application. Either browser (INTERNET) or mobile app (APP).
session_length_in_minutes	Length of user session in banking website in minutes. Ranges between [−1, 107] minutes.
device_os	Operating system of device that made request. Possible values: Windows, Macintosh, Linux, X11, or other.
keep_alive_session	User option on session logout.
device_distinct_emails_8w	Number of distinct emails in banking website from the used device in last 8 weeks. Ranges between [0, 3].
device_fraud_count	Number of fraudulent applications with used device. Ranges between [0, 1].
month	Month where the application was made. Ranges between [0, 7].

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
bank-account-fraud-analysis-google-colab.ipynb		bank-account-fraud-analysis-google-colab.ipynb
bank-account-fraud-analysis.ipynb		bank-account-fraud-analysis.ipynb
dimensionality reduction techniques.png		dimensionality reduction techniques.png
discrete_vs_categorical_features.png		discrete_vs_categorical_features.png
histogram_vs_kde_plot.png		histogram_vs_kde_plot.png
outlier_detection_methods.png		outlier_detection_methods.png
pca_vs_mi.png		pca_vs_mi.png
standard_scaler_vs_minmax_scaler.png		standard_scaler_vs_minmax_scaler.png
supervised.ipynb		supervised.ipynb
z-score_vs_iqr.png		z-score_vs_iqr.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bank Account Fraud Analysis

Introduction

Objective

Installation

Dataset

References

About

Uh oh!

Uh oh!

Languages

tezzytezzy/bank-account-fraud-analysis

Folders and files

Latest commit

History

Repository files navigation

Bank Account Fraud Analysis

Introduction

Objective

Installation

Dataset

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages