Chapter 1: Questions about Questions

한국어

Angrist & Pischke, Mostly Harmless Econometrics

Core Message

"Good econometrics cannot save a shaky research agenda, but the promiscuous use of fancy econometric techniques sometimes brings down a good one."

Every empirical research project should begin with four Frequently Asked Questions (FAQs).

FAQ 1: What is the causal relationship of interest?

The most interesting research in social science is about cause and effect.

Why Causal Relationships?

  • Useful for making predictions about counterfactual worlds
  • Helps predict consequences of policy changes
  • Can be derived from economic models

Example: Causal Effect of Education

Question: What is the causal effect of schooling on wages?

Definition: The increment to wages an individual would receive with more schooling

Finding: Causal effect of college degree ≈ 40% higher wages on average

Applications: Predicting effects of changing college costs, strengthening attendance laws

Units of Analysis

Unit Research Example
Individuals Education → Wages (Labor Economics)
Firms R&D Investment → Productivity
Countries Colonial Institutions → Economic Growth (Acemoglu et al., 2001)

FAQ 2: What is the ideal experiment?

Contemplate the ideal experiment that could capture the causal effect of interest.

Why Think About Ideal Experiments?

  • Helps pick fruitful research topics
  • Formulates causal questions precisely
  • Highlights forces to manipulate and factors to hold constant
"If you can't devise an experiment that answers your question in a world where anything goes, then the odds of generating useful results with a modest budget and non-experimental survey data seem pretty slim."

Examples

Schooling & Wages: Offer potential dropouts a reward for finishing school → Angrist & Lavy (2007) actually ran this

Political Institutions: Randomly assign different government structures to former colonies on Independence Day → Hypothetical

🚫 FUQ'd: Fundamentally Unidentified Questions

Questions that cannot be answered by any experiment are FUQ'd.

Example: Effect of school start age on 1st grade test scores

Comparison Problem
Same grade Late starters are older → Maturation effect
Same age Early starters spent more time in school → Time-in-school effect

Fundamental issue: Start age = Current age − Time in school (deterministic link)

Solution: Study adult outcomes (earnings, highest grade completed) instead

What's NOT FUQ'd: Causal Effects of Race/Gender

Race and gender seem hard to manipulate, but labor market discrimination research focuses on perceived race/gender.

  • Shakespeare's Rosalind disguised as Ganymede
  • Philip Roth's novel - Black professor passing as white
  • Audit studies: Experiments with fake resumes (Bertrand & Mullainathan, 2004)

FAQ 3: What is the identification strategy?

Identification Strategy: The manner in which a researcher uses observational data to approximate a real experiment.

Natural Experiments

Example: Angrist & Krueger (1991)

  • Used interaction between compulsory attendance laws and season of birth
  • Season of birth affects how much students are constrained by dropout laws
  • → Estimates effect of finishing high school on wages

Haavelmo (1944)'s Insight

"A design of experiments is an essential appendix to any quantitative theory. Experiments may be grouped into two classes:
(1) Experiments we should like to make to verify certain hypotheses
(2) The stream of experiments that Nature is steadily turning out, which we merely watch as passive observers"

FAQ 4: What is the mode of statistical inference?

Mode of Statistical Inference (Rubin, 1991)

What to Specify

  • The population to be studied
  • The sample to be used
  • Assumptions made when constructing standard errors

Practical Issues

  • Especially important with clustered or grouped data
  • Ultimate success of even well-conceived projects turns on inference details

Econometrics Haiku (Keisuke Hirano)

T-stat looks too good.
Use robust standard errors—
significance gone.

Chapter 1 Summary

FAQ Question Key Point
1 Causal relationship? Predicting changes in counterfactual worlds
2 Ideal experiment? Must conceptualize, even if hypothetical
3 Identification strategy? Approximate experiments with natural variation
4 Mode of inference? Population, sample, standard error assumptions
← Back to Study Notes Chapter 2: The Experimental Ideal →
This note was written with the assistance of LLM (Claude).