In this project, I analyzed data related to the coverage of complementary health insurance using a dataset from the Health and Retirement Study (HRS) using the statistical software for Data Science (STATA).
The goal of the study was to evaluate the determinants influencing the demand for health insurance subscription (insurance: subscription = 1 and non-subscription = 0) based on a sample of 3,206 individuals.
The threshold is higher than the probability (0.000), meaning the result is significant. There is a correlation between the decision to subscribe to health insurance and retirement status.
The threshold is higher than the probability (0.000), meaning the result is significant. There is a correlation between the decision to subscribe to health insurance and gender.
The threshold is higher than the probability (0.000), meaning the result is significant. There is a correlation between the decision to subscribe to health insurance and health status.
The total number of chronic diseases significantly affects the decision to subscribe to health insurance.
The probability is lower than the threshold (Pr(|T| > |t|) = 0.0046), indicating a difference in the means between the two groups, meaning there is a correlation between the decision to subscribe to health insurance and the total number of chronic diseases.
Household income (log-transformed) significantly affects the decision to subscribe to health insurance.
The probability is lower than the threshold (Pr(|T| > |t|) = 0.0000), showing a difference in the means between the two groups, meaning there is a correlation between the decision to subscribe to health insurance and household income (logarithmic).
Age does not significantly affect the decision to subscribe to health insurance.
The probability is higher than the threshold (Pr(|T| > |t|) = 0.0777), meaning there is no significant difference in the means between the two groups, and there is no correlation between the decision to subscribe to health insurance and age.
A Logit model was estimated to predict health insurance subscription levels based on the proposed explanatory factors. The “gender” variable was removed since its probability was higher than the threshold.
The Chi-square probability is significant (0.0000 < threshold). Thus, we reject the null hypothesis (H0) that all coefficients are zero and accept the alternative hypothesis (H1). The model is globally significant, but the R² is 8.27%, indicating the model’s explanatory power is not strong.
A 1% increase in household income leads to a 164% increase in the probability of subscribing to health insurance.
Adding one chronic disease increases the probability of subscribing by 4.29%.
The model correctly predicted 64.25% of cases, with an error rate of 35.75%.
(in terms of probability) for a 70-year-old individual:
Retirement status = yes, gender = male, number of chronic diseases = 2, and household income (log) = 1.9.
INSURANCE = RETIRE + LINC + GENRE + AGE + CHRONIC
The estimated probability of subscribing to health insurance is 4.45%.