Econ One’s expert economists have experience across a wide variety of services including antitrust, class certification, damages, financial markets and securities, intellectual property, international arbitration, labor and employment, and valuation and financial analysis.
Econ One’s expert economists have extensive industry specific experience. Our industry experience spans numerous industries including electric power markets, financial markets, healthcare, insurance, oil and gas, pharmaceutical, and more
Econ One’s resources including blogs, cases, news, and more provide a collection of materials from Econ One’s experts.
Ph.D. in Statistics, University of California, Los Angeles
M.S. in Statistics, University of California, Los Angeles
B.A. in Mathematics/Economics, Claremont McKenna College
Econ One, August 2008 – Present
University of Pennsylvania, 2007 – 2008
University of California Los Angeles, 2007 – 2008
Self-Employed Statistical Consultant, 2004 – 2008
RAND Statistics Group, 2006
Lockheed Martin Missiles and Space, 2001 – 2003
U.S. District Court
State Court
Arbitration
Private Mediation
While this blog focuses on gender wage disparities between men and women, the methods described herein could be extended to any comparison of specific gender groups such as non-binary or other gender-diverse individuals.
My last blog post was a primer on boosted regression that included a case study using actual anonymized data. The boosted regression model estimated executives’ annual earnings as a by-product of calendar year, productivity, office location, and gender. Among other findings, the analysis revealed (i) a gender wage gap over $20,000, and (ii) that the inclusion of the gender variable in the model resulted in a 0.68% reduction in prediction error.
Standard linear regression output includes, among other things, probability calculations signifying the likelihood that the actual results are due to random chance. These probabilities are known as “p-values.” Conversely with a boosted regression model, additional computational programming generally is needed in order to derive these probability calculations.
In a litigation setting such as employment discrimination, the attorneys and trier of fact may well be interested in statistical significance, i.e., where the p-value is below a pre-determined threshold.1 Is the observed wage gap statistically significant? This blog post introduces a method for answering these statistical issues when employing boosted regression. The same case study and data in the previous blog post are used for illustrative purposes.
Two standard types of output from a boosted regression model include (i) a graph and/or table showing the marginal impact of a given predictor variable, and (ii) “variable influence” percentages. Each was introduced in the previous blog post and will be described briefly below.
From a boosted regression model, the practitioner is able to examine the marginal impact of individual predictor variables, holding all other predictor variables constant. The typical output is a graph or table of estimated outcome values, given a set of potential predictor variable values.
Referring to the case study from the previous blog post, below is a bar graph showing estimated earnings for each combination of gender and office location. The difference between a blue bar and the corresponding red bar signifies the estimated gender wage gap in each city. This graph reveals a noticeable gap between men’s and women’s earnings in each city. Across the entire dataset and holding all other predictor variables constant, the boosted regression model estimates the wage gap to be nearly $27,000.
Also from a boosted regression model, the practitioner can see the reduction in total error attributed to each predictor variable. This is referred to as “relative influence.” Each predictor variable’s relative influence is between 0% and 100%, and the sum of all relative influence calculations must equal 100%.
Referring to the case study from the previous post, the combination of productivity, geographic location, and calendar year account for 99.32% of all relative influence. The remaining 0.68% reduction in prediction error was attributed to the inclusion of the gender variable in the boosted regression model.
There are two topics that will drive the remainder of this blog post:
The statistical questions directly above can be answered using a combination of (i) random sampling, (ii) the definition of a p-value, and (iii) computational simulations. To summarize:
The five steps below outline how to derive p-values for a predictor’s marginal impact and relative influence by simulating models with randomized gender values. While focused on gender, this method can be applied to any predictor.
This creates a “baseline distribution” for both the marginal wage gap and gender variable’s relative influence.
One can generate a new predictor variable that is disassociated with the outcome using one of the following techniques:
The graphs below are derived using (i) the five steps described above, and (ii) the three techniques for randomly assigning gender to each observation in the dataset. There are two sets of three graphs:
In each graph, the dotted line signifies the measurement from the original data.
The first set of graphs shows the baseline distribution for the gender wage gap. Each marginal impact graph shows that the observed wage gap is more extreme compared to virtually all of the simulated wage gaps.3 Each dotted line signifies the original boosted regression model’s estimated wage gap of nearly $27,000. Given that 1,000 simulations were conducted, this observed wage gap is statistically significant at the 1% level.
In both the simulated models and in general, it is not a given that the wage gap would necessarily favor men over women. Therefore, when assessing statistical significance, the magnitude of the actual wage gap is compared to the absolute values of the simulated wage gaps, i.e., without regard to whether they favor men or women.
The second set of graphs shows the baseline distribution for the gender variable’s relative influence. Each of these graphs shows that the observed percent reduction in total error is the tail of the baseline distribution. The “Coin Toss Approach” reveals statistical significance at the 5% level. The other two approaches show statistical significance at the 1% level.
P-values for boosted regression can be derived through sampling and simulation. These diagnostic measurements can help assess whether observed patterns are likely due to random chance. This approach provides a practical way to evaluate the statistical significance of boosted regression results, particularly in litigious applications such as discrimination analysis.
1 Statistical significance is discussed in the Reference Manual on Scientific Evidence, and specifically, Reference Guide on Statistics by David H. Kaye and David A. Freedman (URL: https://nap.nationalacademies.org/catalog/13163/reference-manual-on-scientific-evidence-third-edition).
2 Some statistical software packages refer to this as “partial dependence.” See, e.g., https://journal.r-project.org/archive/2017/RJ-2017-016/RJ-2017-016.pdf
3 In both the simulated models and in general, it is not a given that the wage gap would necessarily favor men over women. Therefore, when assessing statistical significance, the magnitude of the actual wage gap is compared to the absolute values of the simulated wage gaps, i.e., without regard to whether they favor men or women.