In: Statistics and Probability
Survival analysis is a branch of statistics for analyzing the expected duration of time until one or more events happen, such as death in biological organisms and failure in mechanical systems. A survival model can be used for different kinds of analysis, for example, to study age at marriage, the duration of marriage, the intervals between successive births to a woman, the duration of stay in a city (or in a job), the length of life etc.
The following terms are commonly used in a survival model analysis:
The following three functions are essential in designing a basic survival model:
1. Survival Function
Let T be a non-negative random variable representing the waiting
time until the occurrence of an event. For simplicity we will adopt
the terminology of survival analysis, referring to the event of
interest as `death' and to the
waiting time as `survival' time.
We will assume for now that T is a continuous random variable with probability density function (p.d.f.): f(t) and cumulative distribution function (c.d.f.): F(t) = Pr {T < t}. We can now define the Survival function S(t) which gives the probability of being alive just before duration t, or more generally, the probability that the event of interest has not occurred by a duration t. Then, we have S(t) as follows:
2. Hazard Function
An alternative characterization of the distribution of T is given by the Hazard function λ(t), or instantaneous rate of occurrence of the event, defined as follows:
The numerator of this expression is the conditional probability
that the event will occur in the interval [t; t+dt) given that it
has not occurred before, and the denominator is the width of the
interval. Dividing one by the other we
obtain a rate of event occurrence per unit of time. Taking the
limit as the width of the interval goes down to zero, we obtain an
instantaneous rate of occurrence.
3. Likelihood Function
Suppose then that we have n units with lifetimes governed by a survival function S(t) with associated density f(t) and hazard λ(t). Suppose unit i is observed for a time ti. If the unit died at ti, its contribution to the likelihood function (Li) is the density at that duration, which can be written as the product of S(t) and λ(t). Let di be a death indicator, taking the value one if unit i died and the value zero otherwise. Then the likelihood function (L) may be written as follows:
............................................................................................................
Short Note: Model Fitting
There are essentially three approaches to fitting survival models:
............................................................................................................
Hope this helps!