Titanic Data Set Feature Engineering Report

Continous/Continous Predictor Pairs

Correlation Table

Predictors Pearson's r Absolute Value of Correlation Linear Regression Plot
parch and fare 0.388540 0.388540 parch_and_fare_linear_regression
age and parch -0.269757 0.269757 age_and_parch_linear_regression
age and fare -0.090730 0.090730 age_and_fare_linear_regression

Correlation Matrix

"Brute Force" Table

Predictor 1 Predictor 2 Difference of Mean Response Weighted Difference of Mean Response Bin Plot Residual Plot
age parch 0.069881 0.022897 age_and_parch_bin_uw_plot age_and_parch_dwm_of_resp_residual
age fare 0.031062 0.021940 age_and_fare_bin_uw_plot age_and_fare_dwm_of_resp_residual
parch fare 0.033385 0.008893 parch_and_fare_bin_uw_plot parch_and_fare_dwm_of_resp_residual

Continous/Categorical Predictor Pairs

Correlation Table

Predictors Correlation Ratio Absolute Value of Correlation Violin Plot Distribution Plot
age and who 0.575062 0.575062 age_and_who_violin_plot age_and_who_dist_plot
parch and alone 0.549197 0.549197 parch_and_alone_violin_plot parch_and_alone_dist_plot
fare and deck 0.453723 0.453723 fare_and_deck_violin_plot fare_and_deck_dist_plot
age and deck 0.370417 0.370417 age_and_deck_violin_plot age_and_deck_dist_plot
fare and sibsp 0.348016 0.348016 fare_and_sibsp_violin_plot fare_and_sibsp_dist_plot
parch and who 0.347867 0.347867 parch_and_who_violin_plot parch_and_who_dist_plot
age and adult_male 0.338384 0.338384 age_and_adult_male_violin_plot age_and_adult_male_dist_plot
fare and pclass 0.332352 0.332352 fare_and_pclass_violin_plot fare_and_pclass_dist_plot
fare and class 0.332352 0.332352 fare_and_class_violin_plot fare_and_class_dist_plot
age and pclass 0.311973 0.311973 age_and_pclass_violin_plot age_and_pclass_dist_plot
age and class 0.311973 0.311973 age_and_class_violin_plot age_and_class_dist_plot
parch and sibsp 0.304862 0.304862 parch_and_sibsp_violin_plot parch_and_sibsp_dist_plot
fare and alone 0.298233 0.298233 fare_and_alone_violin_plot fare_and_alone_dist_plot
parch and deck 0.251095 0.251095 parch_and_deck_violin_plot parch_and_deck_dist_plot
fare and embarked 0.241075 0.241075 fare_and_embarked_violin_plot fare_and_embarked_dist_plot
fare and embark_town 0.241075 0.241075 fare_and_embark_town_violin_plot fare_and_embark_town_dist_plot
age and alone 0.203460 0.203460 age_and_alone_violin_plot age_and_alone_dist_plot
age and sibsp 0.189181 0.189181 age_and_sibsp_violin_plot age_and_sibsp_dist_plot
parch and adult_male 0.183689 0.183689 parch_and_adult_male_violin_plot parch_and_adult_male_dist_plot
age and sex 0.182452 0.182452 age_and_sex_violin_plot age_and_sex_dist_plot
fare and sex 0.127872 0.127872 fare_and_sex_violin_plot fare_and_sex_dist_plot
fare and who 0.120066 0.120066 fare_and_who_violin_plot fare_and_who_dist_plot
fare and adult_male 0.114147 0.114147 fare_and_adult_male_violin_plot fare_and_adult_male_dist_plot
parch and embarked 0.106304 0.106304 parch_and_embarked_violin_plot parch_and_embarked_dist_plot
parch and embark_town 0.106304 0.106304 parch_and_embark_town_violin_plot parch_and_embark_town_dist_plot
age and embarked 0.091556 0.091556 age_and_embarked_violin_plot age_and_embarked_dist_plot
age and embark_town 0.091556 0.091556 age_and_embark_town_violin_plot age_and_embark_town_dist_plot
parch and sex 0.086556 0.086556 parch_and_sex_violin_plot parch_and_sex_dist_plot
parch and pclass 0.075602 0.075602 parch_and_pclass_violin_plot parch_and_pclass_dist_plot
parch and class 0.075602 0.075602 parch_and_class_violin_plot parch_and_class_dist_plot

Correlation Matrix

"Brute Force" Table

Predictor 1 Predictor 2 Difference of Mean Response Weighted Difference of Mean Response Bin Plot Residual Plot
age sex 0.097871 0.091753 age_and_sex_diff_of_mean_resp_bin age_and_sex_dwm_of_resp_residual
age who 0.065816 0.089372 age_and_who_diff_of_mean_resp_bin age_and_who_dwm_of_resp_residual
age adult_male 0.092885 0.089372 age_and_adult_male_diff_of_mean_resp_bin age_and_adult_male_dwm_of_resp_residual
fare who 0.069394 0.084367 fare_and_who_diff_of_mean_resp_bin fare_and_who_dwm_of_resp_residual
fare adult_male 0.089148 0.083975 fare_and_adult_male_diff_of_mean_resp_bin fare_and_adult_male_dwm_of_resp_residual
parch who 0.038522 0.077019 parch_and_who_diff_of_mean_resp_bin parch_and_who_dwm_of_resp_residual
parch adult_male 0.053288 0.076641 parch_and_adult_male_diff_of_mean_resp_bin parch_and_adult_male_dwm_of_resp_residual
age deck 0.089152 0.073909 age_and_deck_diff_of_mean_resp_bin age_and_deck_dwm_of_resp_residual
fare sex 0.085031 0.073733 fare_and_sex_diff_of_mean_resp_bin fare_and_sex_dwm_of_resp_residual
parch sex 0.041412 0.064161 parch_and_sex_diff_of_mean_resp_bin parch_and_sex_dwm_of_resp_residual
age sibsp 0.057616 0.043384 age_and_sibsp_diff_of_mean_resp_bin age_and_sibsp_dwm_of_resp_residual
age embarked 0.066051 0.039678 age_and_embarked_diff_of_mean_resp_bin age_and_embarked_dwm_of_resp_residual
age embark_town 0.066051 0.039678 age_and_embark_town_diff_of_mean_resp_bin age_and_embark_town_dwm_of_resp_residual
age pclass 0.080358 0.036712 age_and_pclass_diff_of_mean_resp_bin age_and_pclass_dwm_of_resp_residual
age class 0.080358 0.036712 age_and_class_diff_of_mean_resp_bin age_and_class_dwm_of_resp_residual
age alone 0.066677 0.033916 age_and_alone_diff_of_mean_resp_bin age_and_alone_dwm_of_resp_residual
parch deck 0.027457 0.023047 parch_and_deck_diff_of_mean_resp_bin parch_and_deck_dwm_of_resp_residual
fare deck 0.012203 0.021784 fare_and_deck_diff_of_mean_resp_bin fare_and_deck_dwm_of_resp_residual
fare sibsp 0.040538 0.021726 fare_and_sibsp_diff_of_mean_resp_bin fare_and_sibsp_dwm_of_resp_residual
fare alone 0.027437 0.017546 fare_and_alone_diff_of_mean_resp_bin fare_and_alone_dwm_of_resp_residual
parch pclass 0.049397 0.013841 parch_and_pclass_diff_of_mean_resp_bin parch_and_pclass_dwm_of_resp_residual
parch class 0.049397 0.013841 parch_and_class_diff_of_mean_resp_bin parch_and_class_dwm_of_resp_residual
fare pclass 0.008012 0.013194 fare_and_pclass_diff_of_mean_resp_bin fare_and_pclass_dwm_of_resp_residual
fare class 0.008012 0.013194 fare_and_class_diff_of_mean_resp_bin fare_and_class_dwm_of_resp_residual
fare embarked 0.015654 0.012996 fare_and_embarked_diff_of_mean_resp_bin fare_and_embarked_dwm_of_resp_residual
fare embark_town 0.015654 0.012996 fare_and_embark_town_diff_of_mean_resp_bin fare_and_embark_town_dwm_of_resp_residual
parch sibsp 0.019892 0.008740 parch_and_sibsp_diff_of_mean_resp_bin parch_and_sibsp_dwm_of_resp_residual
parch embarked 0.018832 0.007190 parch_and_embarked_diff_of_mean_resp_bin parch_and_embarked_dwm_of_resp_residual
parch embark_town 0.018832 0.007190 parch_and_embark_town_diff_of_mean_resp_bin parch_and_embark_town_dwm_of_resp_residual
parch alone 0.026100 0.005833 parch_and_alone_diff_of_mean_resp_bin parch_and_alone_dwm_of_resp_residual

Categorical/Categorical Predictor Pairs

Correlation Table

Predictors Cramer's V Absolute Value of Correlation Heatmap
pclass and deck NaN NaN pclass_and_deck_heatmap_plot
pclass and class NaN NaN pclass_and_class_heatmap_plot
sex and deck NaN NaN sex_and_deck_heatmap_plot
sex and class NaN NaN sex_and_class_heatmap_plot
sibsp and deck NaN NaN sibsp_and_deck_heatmap_plot
sibsp and class NaN NaN sibsp_and_class_heatmap_plot
embarked and deck NaN NaN embarked_and_deck_heatmap_plot
embarked and embark_town 1.000000 1.000000 embarked_and_embark_town_heatmap_plot
embarked and class NaN NaN embarked_and_class_heatmap_plot
who and adult_male 0.997218 0.997218 who_and_adult_male_heatmap_plot
sex and who 0.960711 0.960711 sex_and_who_heatmap_plot
sex and adult_male 0.925496 0.925496 sex_and_adult_male_heatmap_plot
sibsp and alone 0.698869 0.698869 sibsp_and_alone_heatmap_plot
pclass and who 0.219432 0.219432 pclass_and_who_heatmap_plot
sibsp and embarked 0.170627 0.170627 sibsp_and_embarked_heatmap_plot
pclass and adult_male 0.139165 0.139165 pclass_and_adult_male_heatmap_plot
pclass and embark_town 0.137485 0.137485 pclass_and_embark_town_heatmap_plot
pclass and alone 0.099038 0.099038 pclass_and_alone_heatmap_plot
sex and sibsp 0.000000 0.000000 sex_and_sibsp_heatmap_plot
who and deck NaN NaN who_and_deck_heatmap_plot
who and alone 0.255280 0.255280 who_and_alone_heatmap_plot
who and embark_town 0.115123 0.115123 who_and_embark_town_heatmap_plot
sex and embarked 0.000000 0.000000 sex_and_embarked_heatmap_plot
who and class NaN NaN who_and_class_heatmap_plot
adult_male and deck NaN NaN adult_male_and_deck_heatmap_plot
adult_male and alone 0.179361 0.179361 adult_male_and_alone_heatmap_plot
sibsp and embark_town 0.170627 0.170627 sibsp_and_embark_town_heatmap_plot
pclass and embarked 0.137485 0.137485 pclass_and_embarked_heatmap_plot
sibsp and who 0.129024 0.129024 sibsp_and_who_heatmap_plot
embarked and who 0.115123 0.115123 embarked_and_who_heatmap_plot
pclass and sibsp 0.105780 0.105780 pclass_and_sibsp_heatmap_plot
sex and alone 0.103212 0.103212 sex_and_alone_heatmap_plot
sibsp and adult_male 0.095561 0.095561 sibsp_and_adult_male_heatmap_plot
pclass and sex 0.000000 0.000000 pclass_and_sex_heatmap_plot
sex and embark_town 0.000000 0.000000 sex_and_embark_town_heatmap_plot
embarked and adult_male 0.000000 0.000000 embarked_and_adult_male_heatmap_plot
embarked and alone 0.000000 0.000000 embarked_and_alone_heatmap_plot
adult_male and embark_town 0.000000 0.000000 adult_male_and_embark_town_heatmap_plot
adult_male and class NaN NaN adult_male_and_class_heatmap_plot
deck and embark_town NaN NaN deck_and_embark_town_heatmap_plot
deck and alone NaN NaN deck_and_alone_heatmap_plot
deck and class NaN NaN deck_and_class_heatmap_plot
embark_town and alone 0.000000 0.000000 embark_town_and_alone_heatmap_plot
embark_town and class NaN NaN embark_town_and_class_heatmap_plot
alone and class NaN NaN alone_and_class_heatmap_plot

Correlation Matrix

"Brute Force" Table

Predictor 1 Predictor 2 Difference of Mean Response Weighted Difference of Mean Response Bin Plot Residual Plot
who deck 0.088249 0.090293 who_and_deck_diff_of_mean_resp_bin who_and_deck_dwm_of_resp_residual
adult_male deck 0.097099 0.088502 adult_male_and_deck_diff_of_mean_resp_bin adult_male_and_deck_dwm_of_resp_residual
sibsp who 0.129080 0.078833 sibsp_and_who_diff_of_mean_resp_bin sibsp_and_who_dwm_of_resp_residual
who class 0.070229 0.078416 who_and_class_diff_of_mean_resp_bin who_and_class_dwm_of_resp_residual
pclass who 0.070229 0.078416 pclass_and_who_diff_of_mean_resp_bin pclass_and_who_dwm_of_resp_residual
embarked who 0.099055 0.078153 embarked_and_who_diff_of_mean_resp_bin embarked_and_who_dwm_of_resp_residual
who embark_town 0.099055 0.078153 who_and_embark_town_diff_of_mean_resp_bin who_and_embark_town_dwm_of_resp_residual
sibsp adult_male 0.173672 0.077780 sibsp_and_adult_male_diff_of_mean_resp_bin sibsp_and_adult_male_dwm_of_resp_residual
embarked adult_male 0.142118 0.077700 embarked_and_adult_male_diff_of_mean_resp_bin embarked_and_adult_male_dwm_of_resp_residual
adult_male embark_town 0.142118 0.077700 adult_male_and_embark_town_diff_of_mean_resp_bin adult_male_and_embark_town_dwm_of_resp_residual
pclass adult_male 0.085690 0.077604 pclass_and_adult_male_diff_of_mean_resp_bin pclass_and_adult_male_dwm_of_resp_residual
adult_male class 0.085690 0.077604 adult_male_and_class_diff_of_mean_resp_bin adult_male_and_class_dwm_of_resp_residual
sex who 0.043709 0.077048 sex_and_who_diff_of_mean_resp_bin sex_and_who_dwm_of_resp_residual
who alone 0.056792 0.075130 who_and_alone_diff_of_mean_resp_bin who_and_alone_dwm_of_resp_residual
who adult_male 0.031015 0.075076 who_and_adult_male_diff_of_mean_resp_bin who_and_adult_male_dwm_of_resp_residual
sex adult_male 0.062965 0.074561 sex_and_adult_male_diff_of_mean_resp_bin sex_and_adult_male_dwm_of_resp_residual
adult_male alone 0.074734 0.074395 adult_male_and_alone_diff_of_mean_resp_bin adult_male_and_alone_dwm_of_resp_residual
sex deck 0.064390 0.072268 sex_and_deck_diff_of_mean_resp_bin sex_and_deck_dwm_of_resp_residual
pclass sex 0.045391 0.066660 pclass_and_sex_diff_of_mean_resp_bin pclass_and_sex_dwm_of_resp_residual
sex class 0.045391 0.066660 sex_and_class_diff_of_mean_resp_bin sex_and_class_dwm_of_resp_residual
sex sibsp 0.115398 0.065765 sex_and_sibsp_diff_of_mean_resp_bin sex_and_sibsp_dwm_of_resp_residual
sex embarked 0.134027 0.063385 sex_and_embarked_diff_of_mean_resp_bin sex_and_embarked_dwm_of_resp_residual
sex embark_town 0.134027 0.063385 sex_and_embark_town_diff_of_mean_resp_bin sex_and_embark_town_dwm_of_resp_residual
sex alone 0.062995 0.062355 sex_and_alone_diff_of_mean_resp_bin sex_and_alone_dwm_of_resp_residual
sibsp deck 0.025657 0.018889 sibsp_and_deck_diff_of_mean_resp_bin sibsp_and_deck_dwm_of_resp_residual
pclass deck 0.031627 0.016389 pclass_and_deck_diff_of_mean_resp_bin pclass_and_deck_dwm_of_resp_residual
deck class 0.031627 0.016389 deck_and_class_diff_of_mean_resp_bin deck_and_class_dwm_of_resp_residual
deck alone 0.021524 0.014596 deck_and_alone_diff_of_mean_resp_bin deck_and_alone_dwm_of_resp_residual
sibsp embarked 0.067811 0.012062 sibsp_and_embarked_diff_of_mean_resp_bin sibsp_and_embarked_dwm_of_resp_residual
sibsp embark_town 0.067811 0.012062 sibsp_and_embark_town_diff_of_mean_resp_bin sibsp_and_embark_town_dwm_of_resp_residual
embarked deck 0.011341 0.010004 embarked_and_deck_diff_of_mean_resp_bin embarked_and_deck_dwm_of_resp_residual
deck embark_town 0.011341 0.010004 deck_and_embark_town_diff_of_mean_resp_bin deck_and_embark_town_dwm_of_resp_residual
pclass alone 0.036442 0.007828 pclass_and_alone_diff_of_mean_resp_bin pclass_and_alone_dwm_of_resp_residual
alone class 0.036442 0.007828 alone_and_class_diff_of_mean_resp_bin alone_and_class_dwm_of_resp_residual
pclass embarked 0.014305 0.007198 pclass_and_embarked_diff_of_mean_resp_bin pclass_and_embarked_dwm_of_resp_residual
pclass embark_town 0.014305 0.007198 pclass_and_embark_town_diff_of_mean_resp_bin pclass_and_embark_town_dwm_of_resp_residual
embarked class 0.014305 0.007198 embarked_and_class_diff_of_mean_resp_bin embarked_and_class_dwm_of_resp_residual
embark_town class 0.014305 0.007198 embark_town_and_class_diff_of_mean_resp_bin embark_town_and_class_dwm_of_resp_residual
pclass sibsp 0.023779 0.006274 pclass_and_sibsp_diff_of_mean_resp_bin pclass_and_sibsp_dwm_of_resp_residual
sibsp class 0.023779 0.006274 sibsp_and_class_diff_of_mean_resp_bin sibsp_and_class_dwm_of_resp_residual
embarked alone 0.008620 0.005247 embarked_and_alone_diff_of_mean_resp_bin embarked_and_alone_dwm_of_resp_residual
embark_town alone 0.008620 0.005247 embark_town_and_alone_diff_of_mean_resp_bin embark_town_and_alone_dwm_of_resp_residual
sibsp alone 0.003997 0.003591 sibsp_and_alone_diff_of_mean_resp_bin sibsp_and_alone_dwm_of_resp_residual
pclass class 0.005148 0.002970 pclass_and_class_diff_of_mean_resp_bin pclass_and_class_dwm_of_resp_residual
embarked embark_town 0.003987 0.002402 embarked_and_embark_town_diff_of_mean_resp_bin embarked_and_embark_town_dwm_of_resp_residual