PS 813: Problem Set 5

Author

YOUR NAME

Published

April 8, 2026

Please upload your solutions as a .html file saved as “Yourlastname_Yourfirstinitial_pset5.html”). In addition, an electronic copy of your .qmd file (saved as “Yourlastname_Yourfirstinitial_pset5.qmd”) must be submitted to the course website at the same time. We should be able to run your code without error messages. In order to receive credit, homework submissions must be substantially started and all work must be shown.

Problem 1

In “The Long-Term Impact of Mobilization and Repression on Political Trust” (2021), Desposato et. al. examine the impact of exposure to the 1989 student protest movement in China on present day trust in government. The paper uses a survey of Beijing residents who would have been enrolled in college around the time of the protests. The researchers compare the present day attitudes of those who began college in 1985-1988 and would have had direct personal exposure to the movement to those who enrolled in college in the fall of 1989 and would not have been exposed due to the post-Tiananmen crackdown.

To address the possibility of cohort-specific differences, the researchers use a fuzzy regression discontinuity design to estimate the effect of exposure vs. non-exposure by leveraging the fact that enrollment decisions are partly driven by a cut-off date. Students born after September 1, 1970 would be on track to enroll in the fall of 1989 (or later) and miss exposure to the protest movement while those born just before would have enrolled in the fall of 1988 (or earlier) and have been exposed to the movement. However, exposure in this case is fuzzy as not all students who would be eligible to enroll based on the birthday cut-off would have enrolled as expected (some students may start early, some may delay, etc…).

The paper claims that individuals who attended college during the protest movement exhibit less trust in the central government.

The original source for this data is

Desposato, Scott W., Gang Wang, and Jason Y. Wu. “The long-term impact of mobilization and repression on political trust.” Comparative Political Studies 54.14 (2021): 2447-2474.

The code below will read in the dataset in as problem1. Please note that you will have to drop any observations with missing data in the relevant variables.

load("data/DWW_CPS.RData")
problem1 <- data
rm(data)

The relevant variables are:

  • X_c: The centered difference between respondent’s birthday and the enrollment cut-off of September 1, 1970 (in years, continuous). Values greater than zero denote birthdays above the cut-off. Values below zero denote birthdays below the cut-off.
  • T: Exposure to the 1989 student protest movement.
  • TrustCent: Trust in the Central Government (0-10 index) High values denote greater trust
  • TrustProv: Trust in the Provincial Government (0-10 index) High values denote greater trust
  • TrustLocal: Trust in the Local Government (0-10 index) High values denote greater trust

Part A

Estimate the “intent-to-treat” effect of being born just before vs. just after the September 1, 1970 birthday cut-off on trust in the central government using a local linear regression with a uniform kernel. Use a bandwidth of \(h=2\) to the left and right of the cut-point. Provide a 95% confidence interval and discuss your results. Compare your regression discontinuity estimates to the simple difference in average central government trust between respondents exposed to the protest movement and respondents who were not exposed to the protest movement.

Part B

Estimate the first stage effect of being born just before vs. just after the September 1, 1970 birthday cut-off on exposure to the 1989 protest movement using a local linear regression with a uniform kernel. Use a bandwidth of \(h=2\) to the left and right of the cut-point. Provide a 95% confidence interval and discuss your results. Is the birthday cut-off a strong instrument for exposure to the protest movement?

Part C

Now use a bandwidth of \(h=4\). Provide a 95% confidence interval and compare your results to Part B.

Part D

Make two binned scatterplots to visualize the first stage and overlay the regression estimates - one for your results from Part B and one for your results from Part C. Compare the two plots and discuss which of the two regressions provides a better estimate of the CEF and why.

Do you think this fuzzy RD is a valid design for identifying the impact of exposure to the 1989 student protest movement on present-day attitudes?

Part E

Assess whether there is bunching near the discontinuity. Use any appropriate analytical technique or techniques and interpret your results. Is there evidence to suggest that the density of the running variable in this dataset is discontinuous at the cut-off?

Problem 2

In “Rock the registration: Same day registration increases turnout of young voters” Grumbach and Hill (2022) study the effect of same day registration (SDR) laws on voter turnout, arguing that these laws primarily benefit younger voters as these voters are more likely to experience events requiring voter registration changes (e.g. change of address) and interact less often with government agencies that register voters. Their design is a staggered-adoption difference-in-differences using the roll out of SDR policies across U.S. states

The original source for the data is below:

Grumbach, Jacob M., and Charlotte Hill. “Rock the registration: Same day registration increases turnout of young voters.” The Journal of Politics 84, no. 1 (2022): 405-417.

The code below will load in the dataset as problem2. Note that while the original study ran the regression on the individual level turnout data, the included data is aggregated at the level of the state-age-group-year level.

load("data/GH2022_aggregated.RData")
problem2 <- gh2022_agg
rm(gh2022_agg)

The relevant variables are

  • state - U.S. state
  • year - Year of the election (2-year increments)
  • age_group - Age bracket of voters
  • voted - Turnout rate among the given age group in that state and year
  • n_obs - The number of observations from the CPS dataset in that particular cell
  • sdr - Whether a state has a “same day registration” law in place in a given year

Part A

Generate a plot of the distribution of treatment across state and over time using the panelView R package. Comment on the distribution. Which units should be omitted from any difference-in-difference analysis? For which relative treatment times are we likely to have too few observations for reliable standard error estimation?

Part B

Drop units for which we cannot estimate an ATT using differences-in-differences. Additionally, drop Ohio from 2014 onward as it elminated same day registration after having briefly implemented it.

Using a conventional dynamic two-way fixed effects estimator, estimate the average treatment effect of same day registration on 18-24 year old turnout 0, 1, 2 and 3 elections after implementation. Include the placebo estimates for the three elections prior to treatment. Use the election just prior to treatment as the “baseline” time period. Cluster your standard errors appropriately (at the level of the state). I would recommend using fixest to implement this.

Make an event study plot to present you results and interpret your findings.

Part C

Construct a test for whether the turnout effects you estimated in Part B vary between the 18-24 year old voters and all of the other age brackets in the sample. Do you conclude that there is a statistically distinguishable difference in the treatment effect of SDR between “young” voters and older voters?

Part D

Estimate the same relative treatment time effects from Part B using instead the Callaway/Sant’anna first-differences estimator (implemented in did). Compare the two sets of estimates (both the relative treatment time effects and the placebo estimates) and discuss any differences you observe. Explain why the two sets of estimates might differ, or if they don’t, what you might then conclude about the treatment effects.