# Probability density function vs. probability mass function

I’ve a confession to make. I’ve been using PDF’s and PMF’s without actually knowing what they are. My understanding is that density equals area under the curve, but if I look at it that way, then it doesn’t make sense to refer to the “mass” of a random variable in discrete distributions. How can I interpret this? Why do we call use “mass” and “density” to describe these functions rather than something else?

P.S. Please feel free to change the question itself in a more understandable way if you feel this is a logically wrong question.

Let’s say we have some function $f(x)$ that we haven’t named yet but we know that $\int_a^b f(x) dx$ yields the probability that we see an outcome between $a$ and $b$. What should we call $f(x)$? Well, what are its properties? Let’s start with its units. We know that, in general, the units on a definite integral $\int_a^b f(x) dx$ are the units of $f(x)$ times the units of $dx$. In our setting, the integral gives a probability, and $dx$ has units in say, length. So the units of $f(x)$ must be probability per unit length. This means that $f(x)$ must be telling us something about how much probability is concentrated per unit length near $x$; i.e., how dense the probability is near $x$. So it makes sense to call $f(x)$ a “probability density function.” (In fact, one way to view $\int_a^b f(x) dx$ is that, if $f(x) \geq 0$, $f(x)$ is always a density function. From this point of view, height is area density, area is volume density, speed is distance density, etc. One of my colleagues uses an approach like this when he discusses applications of integration in second-semester calculus.)
Now that we’ve named $f(x)$ a density function, what should we call the corresponding function in the discrete setting? It’s not a density function; its units are probability rather than probability per unit length. So what is it? Well, when we say “density” without a qualifier we are normally talking about “mass density,” and when we integrate a density function over an object we obtain the mass of that object. With this in mind, the relationship between the probability function in the continuous setting to that of the probability function in the discrete setting is exactly that of density to mass. So “probability mass function” is a natural term to grab to apply to the corresponding discrete function.