This project analyzes a dataset of 170 Turkish couples to identify the key statistical predictors of divorce. The analysis is based on the "Divorce Predictors Scale" (DPS) derived from Gottman's couples therapy principles.
- See the full technical analysis: divorce_analysis.ipynb
- See the summary presentation: presentation/presentation.pdf
The dataset, sourced from Kaggle, contains responses to 54 questions from couples who were either "happily married" (Class 0) or "divorced" (Class 1). All responses were collected on a 5-point scale, where a higher score indicates a higher prevalence of problems.
The goal of this analysis was to move beyond individual questions and identify the themes that were most strongly correlated with divorce.
Data Source: Gottman Divorce Predictors Dataset on Kaggle
The analysis revealed a clear distinction between two types of marital problems: "Foundation Failures" and "Communication Breakdowns."
While both are factors in divorce, their predictive power is dramatically different.
These 10 questions represent a fundamental misalignment in the relationship's core. They are the strongest statistical predictors of divorce. A high score on these indicates a lack of shared values, goals, and emotional harmony.
/presentation/graphs/most_influent_questions.png
This group of 10 questions perfectly describes the destructive communication patterns (Contempt, Defensiveness, Stonewalling) identified by Gottman.
/presentation/graphs/least_influent_questions.png
While poor communication (like "The Four Horsemen") clearly erodes a marriage, a lack of a shared foundation (values, dreams, roles) is the strongest predictor of divorce.
This suggests that while communication can be repaired, a fundamental misalignment of values makes a stable marriage difficult to build from the start.
- Python 3
- Pandas: For data loading and manipulation.
- Seaborn & Matplotlib: For data visualization.
- Jupyter Notebook: For the analysis environment.
- Clone this repository:
git clone [https://github.com/peptesta/divorce_analysis.git](https://github.com/peptesta/divorce_analysis.git)
- If you want you can use the already downloaded dataset, or you can download it from the Kaggle link.
- Place the
divorce_data.csvandreference.tsvfiles into a/datafolder (you may need to create this folder). - Create the
/graphsfolder (in order to get real time graphs about the data) - Open
divorce_analysis.ipynbin a Jupyter environment to view and run the full analysis.