HyperAI

One-sentence Summary

By estimating a factor-augmented vector autoregression (FAVAR) model with Bayesian Normal-Gamma shrinkage priors and identifying monetary policy shocks via high-frequency surprises, this study analyzes monthly metropolitan home price data to reveal sizeable positive effects on regional housing prices that vary in magnitude and duration, with the largest impacts concentrated in California, Arizona, and Florida.

Key Contributions

The study implements a factor-augmented vector autoregression (FAVAR) model estimated through a full Bayesian framework that applies Gibbs sampling and Normal-Gamma shrinkage priors to simultaneously recover latent factors and autoregressive coefficients.
Monetary policy shocks are identified using high-frequency surprises around policy announcements as external instruments, which isolates exogenous policy changes without imposing restrictive structural assumptions.
Empirical analysis of monthly housing price data across 417 US metropolitan regions over a 72-month horizon demonstrates substantial spatial heterogeneity, with the strongest positive effects concentrated on the East and West Coasts and high levels of local spatial autocorrelation.

Introduction

Understanding how national monetary policy influences regional housing markets is critical for economic forecasting and targeted policy design. Prior research typically relies on rigid structural models that impose theoretical constraints or standard vector autoregressions that lack robust uncertainty quantification and often aggregate data at the state level. The authors leverage a factor-augmented vector autoregression model combined with full Bayesian estimation to analyze monthly housing prices across 417 US metropolitan areas. By using high-frequency surprises around policy announcements to identify monetary shocks, they capture spatial heterogeneity and quantify parameter uncertainty, revealing that expansionary policy drives significant but geographically varied housing price responses over a multi-year horizon.

Dataset

Dataset Composition and Sources: The authors assemble a monthly panel dataset spanning April 1997 to June 2012 to analyze regional monetary policy transmission. Housing valuations come from the Zillow Home Value Index, national economic indicators are sourced from the FRED database, and credit spread metrics are drawn from the Gertler and Karadi (2015) archive.
Subset Details: The regional subset comprises 417 core-based statistical areas, filtered from an initial pool of 917 based on longitudinal data availability. This includes 264 metropolitan and 153 micropolitan regions. The national aggregate subset tracks seven variables: housing starts, the industrial production index, the consumer price index, the one-year government bond rate, the ten-year treasury minus federal funds rate spread, the prime mortgage spread over ten-year bonds, and the Gilchrist-Zakrajšek excess bond premium.
Data Usage: The authors structure the regional housing price vector and national aggregate vector as a simultaneous panel for macroeconomic modeling. This configuration allows them to isolate how national monetary policy and credit conditions drive divergent housing price responses across different geographic markets, rather than using traditional machine learning training splits or mixture ratios.
Processing and Adjustments: The authors seasonally adjust all series where applicable and apply mathematical transformations to achieve approximate stationarity. The Zillow index relies on statistical valuation models instead of repeat sales methods, capturing fair market values for all residential property types. The fixed 1997 to 2012 timeframe acts as the effective cropping strategy, ensuring uniform temporal coverage and metadata consistency across all 417 selected regions.

Method

The authors leverage a factor-augmented vector autoregressive (FVAR) model to analyze regional housing price dynamics, incorporating both latent common factors and observable macroeconomic aggregates. The framework is structured around two main equations: a measurement equation and a state equation. The measurement equation links observed housing prices $H_t$ and national aggregates $M_t$ to latent factors $F_t$ and the observed aggregates, expressed as:

\left[ \begin{array} { l } { H _ { t } } \\\\ { M _ { t } } \end{array} \right] = \left[ \begin{array} { l l } { \mathbf { \Lambda } ^ { F } } & { \mathbf { \Lambda } ^ { M } } \\\\ { \mathbf { 0 } _ { K \times S } } & { I _ { K } } \end{array} \right] \left[ \begin{array} { l } { F _ { t } } \\\\ { M _ { t } } \end{array} \right] + \left[ \begin{array} { l } { \epsilon _ { t } } \\\\ { \mathbf { 0 } _ { K \times 1 } } \end{array} \right],

where $F_t$ is an $S \times 1$ vector of unobserved latent factors capturing co-movement across regions, and $M_t$ is a $K \times 1$ vector of observable national aggregates. The factor loadings matrices $\Lambda^F$ and $\Lambda^M$ map the factors and aggregates to regional housing prices, while $\epsilon_t$ represents region-specific measurement errors with diagonal variance-covariance matrix $\Sigma_\epsilon$ . The identification of the model is achieved by constraining the upper $S \times S$ submatrix of $\Lambda^F$ to be the identity matrix $I_S$ , and setting the first $S$ rows of $\Lambda^M$ to zero.

The evolution of the factors and aggregates is governed by a VAR process of order $Q$ , specified as:

y_t = A x_t + u_t,

where $y_t = (F_t', M_t')'$ , $x_t = (y_{t-1}', \dots, y_{t-Q}')'$ , and $A$ is a $(S+K) \times Q(S+K)$ coefficient matrix. The shocks $u_t$ follow a normal distribution with zero mean and variance-covariance matrix $\Sigma_u$ . The parameters $\Lambda^F$ , $\Lambda^M$ , $A$ , and the latent factors $F_t$ are unknown and must be estimated.

To address the high dimensionality of the model, the authors adopt a Bayesian estimation framework. They stack the free elements of the factor loadings into a vector $\boldsymbol{\lambda}$ of length $L = R(S+K)$ and the VAR coefficients into a vector $\boldsymbol{a}$ of length $J = (S+K)^2 Q$ . Prior distributions are specified hierarchically to incorporate shrinkage and prior information. For the VAR coefficients $a_j$ , a Normal-Gamma shrinkage prior is employed, where each coefficient is conditionally normal with mean zero and variance proportional to a local scale $\tau_{aj}^2$ , and the global shrinkage parameter $\xi_a$ controls the overall degree of shrinkage. The local scales $\tau_{aj}^2$ and the global parameter $\xi_a$ are assigned Gamma priors, allowing for a balance between sparsity and robustness to large deviations. The marginal prior on $a_j$ exhibits heavy tails, enabling the model to accommodate large effects when necessary. The variance-covariance matrix $\Sigma_u$ is assigned an inverted Wishart prior to ensure proper shrinkage and posterior propriety.

For the factor loadings $\lambda_\ell$ , a similar Normal-Gamma prior structure is used, with a single global shrinkage parameter $\xi_\lambda$ applied across all loadings. This ensures that the factor loadings are shrunk toward zero while allowing for flexibility in individual elements. The measurement error variances $\sigma_r^2$ are assigned independent inverted Gamma priors, with hyperparameters set to minimize prior influence. The estimation is conducted using a Markov Chain Monte Carlo (MCMC) algorithm, specifically Gibbs sampling, with 20,000 draws and a burn-in period of 10,000 draws. The model is implemented with $Q=2$ lags, and the number of latent factors $S$ is selected using the deviance information criterion, resulting in $S=1$ . Hyperparameters are set to induce strong shrinkage: $\vartheta_a = \vartheta_\lambda = 0.1$ , and $c_0 = c_1 = d_0 = d_1 = 0.01$ . The prior on $\Sigma_u$ is weakly informative, with $\nu = S+K+1$ and $\underline{\Sigma} = 10^{-2}I_{S+K}$ , while the inverted Gamma priors on $\sigma_r^2$ are set with $e_0 = e_1 = 0.01$ .

To achieve structural identification, the model incorporates high-frequency surprises in the three-months-ahead futures rate as an exogenous instrument for monetary policy shocks. This approach, adapted from Paul (2018), integrates the surprise variable $z_t$ into the state equation:

\pmb{y}_t = \pmb{A x}_t + \pmb{\zeta z}_t + \pmb{u}_t,

where $\pmb{\zeta}$ is a vector of coefficients capturing the impulse response of the system to the monetary policy shock. This identification strategy avoids the need for zero restrictions and allows for consistent estimation of contemporaneous impulse responses. Higher-order responses are derived recursively using the state space representation of the VAR model.

Experiment

The study employs a Bayesian factor vector autoregression model using high-frequency policy surprises to identify monetary shocks and evaluates their dynamic impact on national macroeconomic fundamentals, a latent housing factor, and a broad panel of regional markets. This setup validates that expansionary policy consistently stimulates real economic activity and aggregate housing prices in line with established theory, while the latent factor successfully captures national price co-movement driven primarily by western and southern coastal states. Qualitatively, the regional analysis reveals substantial heterogeneity, with coastal areas experiencing the strongest and most persistent positive price effects while inland regions show muted or negative reactions. Furthermore, the findings demonstrate pronounced spatial dependence, indicating that neighboring regions exhibit highly similar dynamic responses, though the linear transmission framework may require nonlinear extensions to fully capture turbulent economic periods.

One-sentence Summary

Key Contributions

The study implements a factor-augmented vector autoregression (FAVAR) model estimated through a full Bayesian framework that applies Gibbs sampling and Normal-Gamma shrinkage priors to simultaneously recover latent factors and autoregressive coefficients.
Monetary policy shocks are identified using high-frequency surprises around policy announcements as external instruments, which isolates exogenous policy changes without imposing restrictive structural assumptions.
Empirical analysis of monthly housing price data across 417 US metropolitan regions over a 72-month horizon demonstrates substantial spatial heterogeneity, with the strongest positive effects concentrated on the East and West Coasts and high levels of local spatial autocorrelation.

Introduction

Dataset

Dataset Composition and Sources: The authors assemble a monthly panel dataset spanning April 1997 to June 2012 to analyze regional monetary policy transmission. Housing valuations come from the Zillow Home Value Index, national economic indicators are sourced from the FRED database, and credit spread metrics are drawn from the Gertler and Karadi (2015) archive.
Subset Details: The regional subset comprises 417 core-based statistical areas, filtered from an initial pool of 917 based on longitudinal data availability. This includes 264 metropolitan and 153 micropolitan regions. The national aggregate subset tracks seven variables: housing starts, the industrial production index, the consumer price index, the one-year government bond rate, the ten-year treasury minus federal funds rate spread, the prime mortgage spread over ten-year bonds, and the Gilchrist-Zakrajšek excess bond premium.
Data Usage: The authors structure the regional housing price vector and national aggregate vector as a simultaneous panel for macroeconomic modeling. This configuration allows them to isolate how national monetary policy and credit conditions drive divergent housing price responses across different geographic markets, rather than using traditional machine learning training splits or mixture ratios.
Processing and Adjustments: The authors seasonally adjust all series where applicable and apply mathematical transformations to achieve approximate stationarity. The Zillow index relies on statistical valuation models instead of repeat sales methods, capturing fair market values for all residential property types. The fixed 1997 to 2012 timeframe acts as the effective cropping strategy, ensuring uniform temporal coverage and metadata consistency across all 417 selected regions.

Method

\left[ \begin{array} { l } { H _ { t } } \\\\ { M _ { t } } \end{array} \right] = \left[ \begin{array} { l l } { \mathbf { \Lambda } ^ { F } } & { \mathbf { \Lambda } ^ { M } } \\\\ { \mathbf { 0 } _ { K \times S } } & { I _ { K } } \end{array} \right] \left[ \begin{array} { l } { F _ { t } } \\\\ { M _ { t } } \end{array} \right] + \left[ \begin{array} { l } { \epsilon _ { t } } \\\\ { \mathbf { 0 } _ { K \times 1 } } \end{array} \right],

The evolution of the factors and aggregates is governed by a VAR process of order $Q$ , specified as:

y_t = A x_t + u_t,

\pmb{y}_t = \pmb{A x}_t + \pmb{\zeta z}_t + \pmb{u}_t,

Experiment

One-sentence Summary

Key Contributions

The study implements a factor-augmented vector autoregression (FAVAR) model estimated through a full Bayesian framework that applies Gibbs sampling and Normal-Gamma shrinkage priors to simultaneously recover latent factors and autoregressive coefficients.
Monetary policy shocks are identified using high-frequency surprises around policy announcements as external instruments, which isolates exogenous policy changes without imposing restrictive structural assumptions.
Empirical analysis of monthly housing price data across 417 US metropolitan regions over a 72-month horizon demonstrates substantial spatial heterogeneity, with the strongest positive effects concentrated on the East and West Coasts and high levels of local spatial autocorrelation.

Introduction

Dataset

Dataset Composition and Sources: The authors assemble a monthly panel dataset spanning April 1997 to June 2012 to analyze regional monetary policy transmission. Housing valuations come from the Zillow Home Value Index, national economic indicators are sourced from the FRED database, and credit spread metrics are drawn from the Gertler and Karadi (2015) archive.
Subset Details: The regional subset comprises 417 core-based statistical areas, filtered from an initial pool of 917 based on longitudinal data availability. This includes 264 metropolitan and 153 micropolitan regions. The national aggregate subset tracks seven variables: housing starts, the industrial production index, the consumer price index, the one-year government bond rate, the ten-year treasury minus federal funds rate spread, the prime mortgage spread over ten-year bonds, and the Gilchrist-Zakrajšek excess bond premium.
Data Usage: The authors structure the regional housing price vector and national aggregate vector as a simultaneous panel for macroeconomic modeling. This configuration allows them to isolate how national monetary policy and credit conditions drive divergent housing price responses across different geographic markets, rather than using traditional machine learning training splits or mixture ratios.
Processing and Adjustments: The authors seasonally adjust all series where applicable and apply mathematical transformations to achieve approximate stationarity. The Zillow index relies on statistical valuation models instead of repeat sales methods, capturing fair market values for all residential property types. The fixed 1997 to 2012 timeframe acts as the effective cropping strategy, ensuring uniform temporal coverage and metadata consistency across all 417 selected regions.

Method

\left[ \begin{array} { l } { H _ { t } } \\\\ { M _ { t } } \end{array} \right] = \left[ \begin{array} { l l } { \mathbf { \Lambda } ^ { F } } & { \mathbf { \Lambda } ^ { M } } \\\\ { \mathbf { 0 } _ { K \times S } } & { I _ { K } } \end{array} \right] \left[ \begin{array} { l } { F _ { t } } \\\\ { M _ { t } } \end{array} \right] + \left[ \begin{array} { l } { \epsilon _ { t } } \\\\ { \mathbf { 0 } _ { K \times 1 } } \end{array} \right],

The evolution of the factors and aggregates is governed by a VAR process of order $Q$ , specified as:

y_t = A x_t + u_t,

\pmb{y}_t = \pmb{A x}_t + \pmb{\zeta z}_t + \pmb{u}_t,

House Price Calculation Methods for Beginners

Abstract

One-sentence Summary

Key Contributions

Introduction

Dataset

Method

Experiment

Build AI with AI

HyperAI Newsletters

House Price Calculation Methods for Beginners

Abstract

One-sentence Summary

Key Contributions

Introduction

Dataset

Method

Experiment

Build AI with AI

HyperAI Newsletters

House Price Calculation Methods for Beginners

Abstract

One-sentence Summary

Key Contributions

Introduction

Dataset

Method

Experiment

Build AI with AI

HyperAI Newsletters

Command Palette

House Price Calculation Methods for Beginners

Abstract

One-sentence Summary

Key Contributions

Introduction

Dataset

Method

Experiment

Build AI with AI

HyperAI Newsletters

Command Palette

House Price Calculation Methods for Beginners

Abstract

One-sentence Summary

Key Contributions

Introduction

Dataset

Method

Experiment

Build AI with AI

HyperAI Newsletters

Command Palette

House Price Calculation Methods for Beginners

Abstract

One-sentence Summary

Key Contributions

Introduction

Dataset

Method

Experiment

Build AI with AI

HyperAI Newsletters