The Mathematical Garden

Mathematical methods

HMMs

Introduction

The HMM

Estimate alpha

Simulate a HMM

Estimation of alpha

Extension of the model

Summary and references

Mathematical Induction

Pigeonhole principle

Random Walk

Solving Linear Systems

Extension of the Method

In general the number of hidden states can be more than two. Suppose that the number of hidden states is m and the steady state probability distribution of the hidden states is given by

a = (a₁, a₂,...,a_m)

Moreover, we let the number of observable state be n and when the hidden state is i (i = 1, 2, ... ,m), the stationary distribution of the observable states is given by

(p_i1, p_i1, ... ,p_in)

Here we assume that the model parameters m, n and p_ij are known. Given an observed sequence of the observable states, one can of course calculate the occurrences of each state in the sequence and hence the probability distribution q of the observable states. Using the same trick as before, if we ignore the hidden states, the observable states follow the transition probability matrix given by

₁=

é
ê
ê
ê
ë a₁ a₂ ... a_m ù
ú
ú
ú
û

a₁ a₂ ... a_m

: : : :

a₁ a₂ ... a_m

é
ê
ê
ê
ë p₁₁ p₁₂ ... p_1n ù
ú
ú
ú
û

p₂₁ p₂₂ ... p_2n

: : : :

p_m1 p_m2 ... p_mn

=

é
ê
ê
ê
ë 1
1
:
1 ù
ú
ú
ú
û

p

where

p= ( _m
S
ⁱ⁼¹ a_ip_i1, _m
S
ⁱ⁼¹ a_ip_i2, . . . _m
S
ⁱ⁼¹ a_ip_in)

It is easy to check that

p₁=p and _n
S
ⁱ⁼¹ p_i = 1

Hence we have the following proposition.

Proposition 1 The vector p is the steady state probability distribution of ₁.

Therefore the transition probabilities of the hidden states

a = (a₁, a₂, . . . a_m)

can be obtained by solving

min
^a ||p-q|| subject to _m
S
ⁱ⁼¹ a_i = 1 and a ³ 0.

This is a standard constrained least square problem when ||.|| is chosen to be the square of the L₂-norm. We remark that when ||.|| is chosen to be the L₁-norm, the resulting optimization problem can be transformed into a linear programming problem, see for instance [2].

Department of Mathematics, HKU, 2010