Surviving Lawlessness, Pt. 2: Being Discrete
Introduction
This is part 2 of my slow trudge through J. F. Lawlessness’ book “Statistical Models and Methods For Lifetime Data”. Part 1 is here.
Despite our penchant for measuring time in terms of things like ‘seconds’, time is continuous - a smear of unidirectional, infinitesimal, temporally flavored jam across the universe. Sometimes, though, time-to-event things happen in more discrete units. Consider graduation time - students tend to graduate yearly, all at the same time (ie graduation day)1. For this section, we’re going to describe things in terms of graduation, but you might also consider a ‘yearly checkup’ to be another case of discrete-time events.
Old friends in new clothes
Probability function
We no longer call it a ‘probability density function’, because it’s no longer a density. Instead of the paradoxical ‘
Here, I’ve generated fake data for graduation times. I just wanted the mode to be around 4 years. I don’t even know if that’s true or not. It also assumes that no students drop out, which I know is definitely not true. But it’s a distribution we’re going to stick with for sake of example:

The probability of graduating at your 4th year is 0.5 in this plot. We would say that
Our arbitrary probability function is written like this:
Survivor Function
The survivor function means the same thing that it did previously, but it needs to be summed instead of integrated2.
The colon below the sum means ‘such that’, just like the bar (
So,

Plotted, the survivor function looks like this:

Hazard function
The hazard function in the discrete case is slightly different from the continuous case, as it is a probability rather than just a rate. It functions like the conditional probability we talked about previously. In our case of graduation, it is the probability of graduating in some year, given you haven’t already graduated. In math speak, it’s:
We can write this in terms of
Since in this analogy,
and we noted in the above sections that
We can say that
The hazard function looks like this:

Note that it is NOT the same as
There’s also a super weird ‘cumulative hazard function’ for the discrete case, which is not actually a cumulative hazard function. In the continuous case, you can write:
So if you just close your eyes and hope for the best, you can generate a symbolically equivalent version in the discrete case by using the discrete functions instead:
The problem is that it’s not equal to the actual accumulation (that is, the cumulative sum) of the hazard function.

Were I a mathier person I would probably be able to give you a reasonable explanation for why this is. The best I can do is assume that it makes ‘some math’ easier down the line.