UC Davis Data & Analytics - Final Project

Most Americans are well aware that half of marriages end in divorce. The good news is that divorce rate in the US is decreasing, however, so is the marriage rate. According to the Center for Disease Control, the 2018 marriage rate was 6.9 per 1,000 total population while the divorce rate was 2.9 per 1,000 population. Both rates were decreased by a tenth of a point from the previous year and the year before. It seems fewer couples are willing to gamble the risk these days and opt out of marriage. With the high risk of divorce, we found the subject of divorce to be a compelling machine learning exercise. Can divorce be predicted based on specific attributes?

Linear Regression

For the original Turkish dataset, we chose to do Lasso and Ridge linear regressions to reduce model complexity. Ridge puts constraints on the coefficients which in effect, shrinks the coefficient.

Heatmap

In addition to linear regression, we also used a heatmap to get an overall look into the distribution of each attribute in our dataset. We used seborn to create the heatmap looking at all 54 survey questions and average values.

Lasso and Ridge

For the original Turkish dataset, we chose to do Lasso and Ridge linear regressions to reduce model complexity. Ridge puts constraints on the coefficients which in effect, shrinks the coefficient.

Classification

For this next model, a Random Forest was utilized to determine feature importance. This type provides a strong modeling technique that is better than using a single decision tree.

KNN, LinearSVC, and SGDClassifier

Similar to the Yontem study, our team analyzed the accuracy of three machine learning models on the augmented data set. The models used were K-Nearest Neighbor (KNN), LinearSVC, and SGDClassifier.

Statistical Analysis

Our team used numpy and Plotly to create interactive visuals that reflect the responses of the study at an individual level.

Final Project Presentation

Due to the COVID-19 outbreak, our team of data scientists conducted a pre-recorded final project presentation.

We regret not being able to present in person but have made the link available to you below for your viewing pleasure.