Quantitative way to compare statistical models
In statistics, the deficiency is a measure to compare a statistical model with another statistical model. The concept was introduced in the 1960s by the French mathematician Lucien Le Cam, who used it to prove an approximative version of the Blackwell–Sherman–Stein theorem.[1][2] Closely related is the Le Cam distance, a pseudometric for the maximum deficiency between two statistical models. If the deficiency of a model
in relation to
is zero, then one says
is better or more informative or stronger than
.
Introduction
Le Cam defined the statistical model more abstract than a probability space with a family of probability measures. He also didn't use the term "statistical model" and instead used the term "experiment". In his publication from 1964 he introduced the statistical experiment to a parameter set
as a triple
consisting of a set
, a vector lattice
with unit
and a family of normalized positive functionals
on
.[3][4] In his book from 1986 he omitted
and
.[5]
This article follows his definition from 1986 and uses his terminology to emphasize the generalization.
Basic concepts
Let
be a parameter space. Given an abstract L1-space
(i.e. a Banach lattice such that for elements
also
holds) consisting of lineare positive functionals
. An experiment
is a map
of the form
, such that
.
is the band induced by
and therefore we use the notation
. For a
denote the
. The topological dual
of an L-space with the conjugated norm
is called an abstract M-space. It's also a lattice with unit defined through
for
.
Let
and
be two L-space of two experiments
and
, then one calls a positive, norm-preserving linear map, i.e.
for all
, a transition. The adjoint of a transitions is a positive linear map from the dual space
of
into the dual space
of
, such that the unit of
is the image of the unit of
ist.[5]
Deficiency
Let
be a parameter space and
and
be two experiments indexed by
. Le
and
denote the corresponding L-spaces and let
be the set of all transitions from
to
.
The deficiency
of
in relation to
is the number defined in terms of inf sup:
[6]
where
denoted the total variation norm
. The factor
is just for computational purposes and is sometimes omitted.
Explanations
means that there exists a transition
such that
for all
.
- The deficiency measures how well
of
can be approximated by
in the sense of total variation.
- The deficiency is a norm for
.
Le Cam distance
The Le Cam distance is the following pseudometric

This induces an equivalence relation and when
, then one says
and
are equivalent. The equivalent class
of
is also called the type of
.
Often one is interested in families of experiments
with
and
with
. If
as
, then one says
and
are asymptotically equivalent.
Let
be a parameter space and
be the set of all types that are induced by
, then the Le Cam distance
is complete with respect to
. The condition
induces a partial order on
, one says
is better or more informative or stronger than
.[6]
References
Bibliography