Autoregressive Conditional Heteroscedasticity (ARCH) model

The autoregressive conditional heteroscedasticity (ARCH) model is a statistical model for time series data that models the variance of the current error as a function of the actual sizes of the previous time periods’ errors. The ARCH model is appropriate when the error variance in a time series follows an autoregressive (AR) model.

An ARCH(q) process can be written as \(y_{t} = a_{0} + \sum_{i=1}^{q}y_{t-q}+\epsilon_{t}\) where \(\epsilon_{t}\) denote the error terms. These \(\epsilon_{t}\) are split into a stochastic piece \(z_{t}\) and a time-dependent standard deviation \(\sigma_{t}\) characterizing the typical size of the terms so that \(\epsilon_{t}=\sigma_{t}z_{t}\). The random variable \(z_{t}\) is a strong white noise process. The series \(\sigma_{t}^{2}\) is modeled by

\[\sigma_{t}^{2} = \alpha_{0} + \alpha_{1}\epsilon_{t-1}^{2} + \dots + \alpha_{q}\epsilon_{t-1}^{2} = \alpha_{0} + \sum_{i=1}^{q}\alpha_{i}\epsilon_{t-i}^{2} \, ,\]

where \(\alpha_{0} > 0\) and \(\alpha_{i} \geq 0\), \(i > 0\).

An ARCH(q) model can be estimated using ordinary least squares. This procedure is as follows:

Estimate the best fitting autoregressive model AR(q) \(y_{t}=a_{0}+a_{1}y_{t-1}+\dots +a_{q}y_{t-q}+\epsilon_{t} =a_{0}+\sum_{i=1}^{q}a_{i}y_{t-i}+\epsilon_{t}\).
Obtain the squares of the error \(\hat{\epsilon}^{2}\) and regress them on a constant and q lagged values: \(\hat{\epsilon}^{2}=\hat {\alpha}_{0}+\sum_{i=1}^{q}\hat{\alpha}_{i}\hat{\epsilon}_{t-i}^{2}\) where q is the length of ARCH lags.
Null Hypothesis: \(\alpha_{i}=0\) for all \(i=1,\dots ,q\). Alternative hypothesis: At least one of the estimated \(\alpha_{i}\) coefficients must be significant. Under the null hypothesis of no ARCH errors, the test statistic \(T'R²\) follows \(\chi^{2}\) distribution with q degrees of freedom, where \(T'\) is the number of equations in the model which fits the residuals vs the lags (i.e. \(T'=T-q\)). If \(T'R²\) is greater than the Chi-square table value, we reject the null hypothesis and conclude there is an ARCH effect, otherwise we do not reject the null hypothesis.

For additional information, see Wikipedia: Autoregressive conditional heteroskedasticity.

Keep in Mind

Data should be properly formatted for estimation as a time-series. See creating a time series data set. If not, you may fail to execute or receive erroneous output.
ARCH can be used to model time-varying conditional variance.

Also Consider

ARCH models can be univariate (scalar) or multivariate (vector).
ARCH models are commonly employed in modeling financial time series that exhibit time-varying volatility and volatility clustering, i.e. periods of swings interspersed with periods of relative calm.
If an autoregressive moving average (ARMA) model is assumed for the error variance, the model is a generalized autoregressive conditional heteroskedasticity (GARCH) model. For more information on GARCH models, see Wikipedia: GARCH. For information about estimating an GARCH models, see LOST: GARCH models.

Implementations

Julia

The ARCHModels.jl package offers a variety of ARCH-model simulation, estimation, and forecasting tools.

We start by loading the package.

# Load necessary packages 
using ARCHModels

Next, we use the simulate function to specify an ARCH{1} model with coefficient parameters a0 and a1, and then simulate a realization of the specified data-generating process with 1000 observations.

# Simulate an ARCH(1) process
a0 = 1.0
a1 = 0.5
arch1sim = simulate(ARCH{1}([a0, a1]), 1000)

Lastly, we use the fit function to fit an ARCH{1} model to the generated series contained in the data attribute of the UnivariateARCHModel object we named arch1sim in the above code chunk.

# Fit ARCH(1) model to simulated data
fit(ARCH{1}, arch1sim.data)

Python

from random import gauss
from random import seed
from matplotlib import pyplot
from arch import arch_model
import numpy as np
# seed the process
np.random.seed(1)
# Simulating a ARCH(1) process
a0 = 1
a1 = .5
w = np.random.normal(size=1000)
e = np.random.normal(size=1000)
Y = np.empty_like(w)
for t in range(1, len(w)):
    Y[t] = w[t] * np.sqrt((a0 + a1*Y[t-1]**2))
# fit model
model = arch_model(Y, vol = "ARCH", rescale = "FALSE")
model_fit = model.fit()
print(model_fit.summary)

R

# setup
library(fGarch)
# seed pseudorandom number generator
set.seed(1)
# Simulating a ARCH(1) process
e <- NULL
obs <- 1000
e[1] <- rnorm(1)
for (i in 2:obs) {e[i] <- rnorm(1)*(1+0.5*(e[i-1])^2)^0.5}
# fit the model
arch.fit <- garchFit(~garch(1,0), data = e, trace = F)
summary(arch.fit)

Stata

* seed pseudorandom number generator
set seed 1
* Simulating a ARCH(1) process
set obs 1000
gen time=_n
tsset time
gen e=.
replace e=rnormal() if time==1
replace e=rnormal()*(1 + .5*(e[_n-1])^2)^.5 if time>=2 & time<=2000
* Estimate arch parameters.. 
arch e, arch(1)