Bins and balls; Large deviations of the empirical occupancy process

Abstract
In the random allocation model, balls are sequentially inserted at random into $n$ exchangeable bins. The occupancy score of a bin denotes the number of balls inserted in this bin. The (random) distribution of occupancy scores defines the object of this paper: the empirical occupancy measure which is a probability measure over the integers. This measure-valued random variable packages many useful statistics. This paper characterizes the large deviations of the flow of empirical occupancy measures when $n$ goes to infinity while the number of inserted balls remains proportional to $n$. The main result is a Sanov-like theorem for the empirical occupancy measure when the set of probability measures over the integers is endowed with metrics that are slightly stronger than the total variation distance. Thanks to a coupling argument, this result applies to the degree distribution of sparse random graphs.

This publication has 20 references indexed in Scilit: