EC 570/571: Topic 11

Systems of Regression Equations

Seemingly Unrelated Regression Model

Vector Autocorrelation Model: An Application

The General Model

Consider a system of regression quations with N observations, indexed by i:

Y_i = X_iβ_i + ε_i (i=1,2,...,G)

where Y_i = [Y_i1,Y_i2,...,Y_iN]', X_i = [X_i1,X_i2,...,X_iN]', and ε_i = [ε_i1,ε_i2,...,ε_iN]'. The model satisfies the following assumptions:

E(ε_i) = 0_Nx1
Cov(X_i,ε_i) = E(X_i'ε_i) = 0_Kx1
Cov(ε_i,ε_j) = E(ε_iε_j') = σ_ijΩ_ij

σ_ii = σ²_i > 0.
σ_ij = 0, if no cross equation correlation (i≠j).
Ω_ij = I_NxN, if no serial correlation.

The stacked (or pooled) model is written as

Y = Xβ + ε, or

⌈

|

|

⌊

Y₁

Y₂

:

Y_G

⌉

|

|

⌋

=

⌈

|

|

⌊

X₁ 0 .. 0

0 X₂ .. 0

: : : :

0 0 .. X_G

⌉

|

|

⌋

⌈

|

|

⌊

β₁

β₂

:

β_G

⌉

|

|

⌋

+

⌈

|

|

⌊

ε₁

ε₂

:

ε_G

⌉

|

|

⌋

This model is also known as the Seemingly Unrelated Regression Equations.

Special Cases

Common Parameters: β_i = β for all i, or
Y_i = X_iβ + ε_i

⌈

|

|

⌊

Y₁

Y₂

:

Y_G

⌉

|

|

⌋

=

⌈

|

|

⌊

X₁

X₂

:

X_G

⌉

|

|

⌋

β +

⌈

|

|

⌊

ε₁

ε₂

:

ε_G

⌉

|

|

⌋
More restricted cases may include common regressors (X_i = X for all i) and restrictions on part or all of the paramters.

Seemingly Unrelated Regression Model

Consider the general model Y = Xβ + ε satisfying the following classical assumptions (no serial correlation, and homoscedascity for each equation):

E(ε) = 0_NGx1
E(X'ε) = 0_Kx1
Var(ε) = E(εε') = V_NGxNG = Σ_GxG⊗I_NxN, where

Σ =

⌈

|

|

⌊

σ₁₁ σ₁₂ .. σ_1G

σ₂₁ σ₂₂ .. σ_2G

: : : :

σ_G1 σ_G2 .. σ_GG

⌉

|

|

⌋

Notice that contemporary correlation across equations is assumed although there is no serial correlation for each equation. GLS (Generalized Least Squares) estimation of the model parameters β follows:

b = (X'V^-1X)^-1X'V^-1Y
Var(b) = (X'V^-1X)^-1

The estimator of the elements σ_ij of V = Σ⊗I is obtained from s_ij = e_i'e_j/N, where e_i = Y_i - X_ib is the residual vector for equation i obtained from the OLS estimation (assuming no cross equation correlation). The above GLS estimation may be iterated to update the residuals e and variance-covariance matrix V.

Random Coefficients Model

Similar to the concept of random effects model for panel data analysis, for each equation i=1,2,...,G, the model (with K random coefficients) may be expressed as follows:

Y_i = X_iβ_i + ε_i
β_i = β + υ_i

We note that not only the intercept but also the slope parameters are random across equations. This model generalizes from the system of regression equations with common (but random) parameters. The assumptions of the model are:

E(ε_i) = 0_Nx1
Var(ε_i) = E(ε_iε_i') = σ_i²I_NxN
Cov(ε_i,ε_j) = 0_NxN, i≠j

and

E(υ_i) = 0_Kx1
Var(υ_i) = E(υ_iυ_i') = Γ_KxK (independent of i)
Cov(υ_i,υ_j) = 0_KxK, i≠j
Cov(υ_i,ε_i) = 0_Kx1

The model for estimation is

Y_i = X_iβ + (X_iυ_i + ε_i), or
Y_i = X_iβ + ω_i where ω_i = X_iυ_i + ε_i, and

E(ω_i) = 0_Nx1
Var(ω_i) = E(ω_iω_i') = Π_i
= E(X_iυυ'X_i'+X_iυ_iε+ε_iυ_i'X_i+ε_iε_i') = σ_i²I_NxN + X_iΓX_i'

Then the stacked model is

Y = Xβ + ω

where ω = [ω₁,...,ω_G]', and

E(ω) = 0_GNx1

Var(ω) = E(ωω') = V =

⌈

|

|

⌊

Π₁ 0 .. 0

0 Π₂ .. 0

: : : :

0 0 .. Π_G

⌉

|

|

⌋

GLS is used to estimate the model. That is,

b^* = (X'V^-1X)^-1X'V^-1Y
Var(b^*) = (X'V^-1X)^-1

A special case of random coefficient model is to assume fixed slop coefficents but the intercept is random:

Y_ij = X_ijβ + ω_ij
ω_ij = υ_j + ε_ij

where i = 1,2,...G (equations) and j = 1,2,...,N (observations). For each observation j, υ_j is the random component of the intercept. We assume

E(υ_j) = 0, Var(υ_j) = σ_u²
E(ε_ij) = 0, Var(ε_ij) = σ_e²
Cov(υ_j,ε_ij) = 0

The stacked form of the model is written as

Y = Xβ + ω
ω = ι⊗υ + ε

where ι is the unit vector of size G (the number of equations). Then
Var(ω) = Var(ι⊗υ+ε) = ιι'⊗Var(υ)+Var(ε) = ιι'⊗[σ_u²I_N]+σ_e²I_NG = V

The model can be estimated with GLS.

The computation of random coefficient model is based on the following steps (Swamy, 1971):

For each regression equation i, Y_i = X_iβ_i + ε_i, obtain the OLS estimator of β_i:
b_i = (X_i'X_i)^-1X_i'Y_i
Var(b_i) = (X_i'X_i)^-1(X_i'Π_iX_i)(X_i'X_i)^-1 = σ_i²(X_i'X_i)^-1+Γ = V_i+Γ
(Taking account of heteroscedasticity, where V_i = σ_i²(X_i'X_i)^-1)
Note that σ_i² is estimated by s²_i = e_i'e_i/(N-K), where e_i = Y_i - X_ib_i.
Then, the estimate of V_i = s_i²(X_i'X_i)^-1.
For the random coeffcients equation, β_i = β + υ_i, the variance of b_i (estimator of β_i) is estimated by
∑_i=1,...,G(b_i-b^m)(b_i-b^m)'/(G-1) = ∑_i=1,...,G(b_ib_i'-G b^mb^m')/(G-1), where b^m = ∑_i=1,...,Gb_i/G.
Therefore, Γ = ∑_i=1,...,G(b_ib_i'-G b^mb^m')/(G-1) - ∑_i=1,...,GV_i/G
Concerning the possibility that Γ may be nonpositive definite, we use
Γ = ∑_i=1,...,G(b_ib_i'-G b^mb^m')/(G-1).
Write the GLS estimator of β as:
b^* = (X'V^-1X)^-1X'V^-1Y
= [∑_i=1,...,GX_i'Π_iX_i]^-1 [∑_i=1,...,GX_i'Π_iY_i]
= [∑_i=1,...,GX_i'Π_iX_i]^-1 [∑_i=1,...,GX_i'Π_iX_ib_i]
= [∑_i=1,...,G(Γ+V_i)^-1]^-1 [(Γ+V_i)^-1b_i]
= ∑_i=1,...,GW_ib_i, where W_i = [∑_i=1,...,G(Γ+V_i)^-1]^-1 [(Γ+V_i)^-1].
Similarly,
Var(b^*) = (X'V^-1X)^-1 = [∑_i=1,...,G(Γ+V_i)^-1]^-1

The individual parameter vectors may be predicted as follows:

b_i^* = (Γ+V_i)^-1[Γ^-1b^*+V_i^-1b_i] = A_ib^* + (I-A_i)b_i,
where A_i = (Γ+V_i)^-1Γ^-1.

Var(b_i^*) = [A_i    I-A_i]

⌈

⌊

∑_i=1,2,...,GW_i(Γ+V_i)W_i'    W_i(Γ+V_i)

(Γ+V_i)W_i'    (Γ+V_i)

⌉

⌋

⌈

⌊

A_i

I-A_i

⌉

⌋

Vector Autocorrelation Model

Generalizing from the univariate time series AR(1) model:

Y_t = μ + ρY_t-1 + ε_t

the mutivariate system of G variables can be written as follows:

Y_it = μ_i + ∑_j=1,2,...,G ρ_ijY_j,t-1 + ε_it (i=1,2,...,G)

This is called Vector Autocorrelation of order 1, or VAR(1). The matrix representation of the model as a simultaneous linear equations system looks like this:

[Y_1t,Y_2t,...,Y_Gt] = [μ₁,μ₂,...,μ_G] + [Y_1,t-1,Y_2,t-1,...,Y_G,t-1]

⌈

|

|

⌊

ρ₁₁ ρ₂₁ .. ρ_G1

ρ₁₂ ρ₂₂ .. ρ_G2

: : : :

ρ_1G ρ_2G .. ρ_GG

⌉

|

|

⌋

+ [ε₁,ε₂,...,ε_G]

The alternative is the stacked form suitable for estimation as a system of regression equations:

⌈

|

|

⌊

Y_1t

Y_2t

..

Y_Gt

⌉

|

|

⌋

=

⌈

|

|

⌊

μ₁

μ₂

..

μ_G

⌉

|

|

⌋

+

⌈

|

|

⌊

ρ₁₁ ρ₁₂ .. ρ_1G

ρ₂₁ ρ₂₂ .. ρ_2G

: : : :

ρ_G1 ρ_G2 .. ρ_GG

⌉

|

|

⌋

⌈

|

|

⌊

Y_1,t-1

Y_2,t-1

..

Y_G,t-1

⌉

|

|

⌋

+

⌈

|

|

⌊

ε_1t

ε_2t

:

ε_Gt

⌉

|

|

⌋

In a shorthand notation,

Y_t = μ + ρ Y_t-1 + ε_t

Extension: VAR(p)

First, we can write the univariate AR(p) model as the system:

Y_t = μ + ρ₁Y_t-1 + ρ₂Y_t-2 + ... +ρ_pY_t-p + ε_t
Y_t-1 = Y_t-1
Y_t-2 = Y_t-2
:
Y_t-p+1 = Y_t-p+1

Or,

⌈

|

|

⌊

Y_t

Y_t-1

:

Y_t-p+1

⌉

|

|

⌋

=

⌈

|

|

⌊

μ

0

:

0

⌉

|

|

⌋

+

⌈

|

|

⌊

ρ₁ ρ₂ .. ρ_p

1 0 .. 0

: : : :

0 .. 1 0

⌉

|

|

⌋

⌈

|

|

⌊

Y_t-1

Y_t-2

:

Y_t-p

⌉

|

|

⌋

+

⌈

|

|

⌊

ε_t

0

:

0

⌉

|

|

⌋

That is,

Y_t = μ + ρ Y_t-1 + ε_t

This is a system of p equations with restricted parameters matrix. The usable time series observations are from p+1 to N (N-p in total).

Similarly, for the multivariate VAR(p) system, the model can be expressed in terms of the stacked G endogenous variables. Therefore, Y_t, Y_t-1, ..., and Y_t-p are Gx1 vectors. The size of the problem is (N-p)Gp. Then the parameter matrix ρ of the lag variable Y_t-1 is

ρ =

⌈

|

|

⌊

ρ₁ ρ₂ .. .. ρ_p

I 0 .. .. 0

0 I : : 0

0 0 .. I 0

⌉

|

|

⌋

where, for each k = 1,2,...,p, ρ_k = [ρ_ij,k (i,j=1,2...,G)]. Furthermore, I is GxG identity matrix, and 0 is GxG zeros matrix.

Impulse Response Functions

Deriving from a general VAR(1) system, Y_t = μ + ρ Y_t-1 + ε_t, we write:

[I-ρ(B)]Y_t = μ + ε_t

where B is the backshift operator. Then,

Y_t = [I-ρ]^-1μ + ∑_i=0,2...,∞ ρⁱε_t-i

= Y^* + (ε_t + ρ¹ε_t-1 + ρ²ε_t-2 + ...)

Y^* is the equilibrium and ε_t is the innovation. By shocking one element of ε_t, says ε_jt, Y_t will move away from the equilibrium Y^*. Note that the effect of Y_t due to change of ε_jt is not just on the jth variable alone but also on other variables in the system. The path whereby the variables returns to equilibirum is called the Impulse Responses of a stable VAR system. The Impulse Response Function traces the effects of a one-time innovation ε_jt on the k-th variable over time (i=0,1,2,...) as ρⁱ_kj (k,j = 1,2,...,G).

Systems of Regression Equations

Table of Contents

The General Model

Seemingly Unrelated Regression Model

Random Coefficents Model

Vector Autocorrelation Model: An Application

The General Model

Special Cases

Seemingly Unrelated Regression Model

Random Coefficients Model

Vector Autocorrelation Model

Extension: VAR(p)

Impulse Response Functions