Project: Bayesian vs Frequentist Approach: A Comparative Study of Educational Returns to Income in Germany
The importance of education goes well beyond the classroom and substantially influences many aspects of our lives. In this project I focus on the effect of education on income in Germany. This relationship is not only important for understanding an individual’s economic success but also reflects broader social dynamics such as access to opportunity and economic inequality.
In this project I analyse:
- Potential endogeneity of the variable
$education$ (Blackburn and Neumark, 1993) - Validity of
$father's \ education$ as the instrumental variable - Compare the Bayesian and frequentist approach in analysing the relationship between education and income
Here you find a summary poster with methods and results
Code for the Bayesian approach
Code for the frequentist approach
To estimate the effect of the explanatory variable
where
I assume the variable
where
In the Bayesian approach I allow for a small direct effect (
The ratio of the effect of the instrument and the individual's own education on income is defined as
I use Gibbs sampling, a Markov Chain Monte Carlo method, to approximate the posterior density by using the conditional distribution (Geman and Geman, 1984). The sequence of samples forms a Markov chain whose stationary distribution is the posterior distribution. I discard the first part of the Gibbs sequence by establishing a burn-in of 1,000 draws from the 11,000 simulated draws.
The exclusion restriction for instuments states that a valid instrument should not be correlated with the error term. The instrument should not directly affect the dependent variable but only indirectly through the endogenous explanantory variable (Hoogerheide et al. (2012b)).
I expect the exclusion restriction to be violated in this case. I suspect that the instrument
The results of the robustness test indicate that the increase of posterior uncertainty with a relaxed exclusion restriction is no greater than the uncertainty associated with a strict exclusion restriction. Hence, for the project I assume a perfectly valid instrument.
The Bayesian IV and 2SLS estimates imply that an additional year of education on average results in an approximately 7.93% increase in income. Both the Bayesian IV and 2SLS estimates perfrom simiarly well. The downshift of the OLS estimate implies OLS underestimates the true causal effect if educatio.
To investigate if the effect of education is different for people with different levels of education and I divide the dataset into higher and lower levels of education. The findings show that individuals with lower education levels benefit the most from additional education, with an increase in income of nearly 30% for an additional year of education compared to 10% for individuals with higher education levels. These findings highlight the importance of expanding access to education and early interventions to counter educational inequality.
Blackburn, M.L. and D. Neumark (1993). “Omitted-Ability Bias and the Increase in the Return to Schooling”. In: Journal of Labor Economics 11.3, pp. 521–544.
Geman and Geman (1984). “Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images”. In: IEEE Transactions on Pattern Analysis and Machine Intelligence 6.6, pp. 721–741.
Hoogerheide, L., J.H. Block, and R. Thurik (2012a). “Family background variables as instruments for education in income regressions: A Bayesian analysis”. In: Economics of Education Review 31.5, pp. 515–523.
Hoogerheide, L.F., J.H. Block, and R. Thurik (2012b). “Are Education and Entrepreneurial Income Endogenous and do Family Background Variables make Sense as Instruments?” In: Entrepreneurship Research Journal 2.3, pp. 1–29.

