.. _feldman_cousins:

Feldman and Cousins Confidence Intervals
========================================

Feldman and Cousins solved the problem on how to choose confidence intervals in
a unified way (that is without basing the choice on the data) [Feldman1998]_.
The functions ``gammapy.stats.fc_*`` give you access to a numerical solution to
the Feldman Cousins algorithm. It can be used for any type of statistical
distribution and is not limited to a Poisson process with background or a
Gaussian with a boundary at the origin.

The basic ingredient to `~gammapy.stats.fc_construct_acceptance_intervals_pdfs`
is a matrix of :math:`P(X|\mu)` (see e.g. equation (3.1) and (3.2) in
[Feldman1998]_). Every row is a probability density function (PDF) of x and the
columns are built up by varying the signal strength :math:`\mu`. The other
parameter is the desired confidence level (C.L.). The function will return
another matrix of acceptance intervals where 1 means the point is part of the
acceptance interval and 0 means it is outside. This can be easily converted to
upper and lower limits (`~gammapy.stats.fc_get_limits`), which simply connect
the outside 1s for different :math:`\mu` values. An upper or lower limit is
obtained by drawing a vertical line at the measured value and calculating the
intersection (`~gammapy.stats.fc_find_limit`).

Examples
--------

Assume you have a Poisson background with known mean 3.0. We generate the matrix
of :math:`P(X|\mu)` like this

.. code-block:: python

    import gammapy.stats as gstats
    import numpy as np
    from scipy import stats

    x_bins = np.arange(0, 50)
    mu_bins = np.linspace(0, 15, 15 / 0.005 + 1, endpoint=True)
    matrix = [stats.poisson(mu + 3.0).pmf(x_bins) for mu in mu_bins]

Now we generate the 90% acceptance intervals and construct the lower and upper
limit from them:

.. code-block:: python

    acceptance_intervals = gstats.fc_construct_acceptance_intervals_pdfs(matrix, 0.9)
    LowerLimitNum, UpperLimitNum, _ = gstats.fc_get_limits(mu_bins, x_bins, acceptance_intervals)

Let's say you measured x = 1, then the 90% upper limit would be:

.. code-block:: python

    >>> gstats.fc_find_limit(1, UpperLimitNum, mu_bins)
    1.875

The following plot shows the confidence belt based on the Feldman and Cousins
principle for a 90% confidence level for the unknown Poisson signal mean
:math:`\mu`. It is a reproduction of Fig. 7 from [Feldman1998]_. It should be
noted that the plot in the paper is inconsistent with Table IV from the same
paper: the lower limit is off by one bin to the left.

.. plot:: stats/plot_fc_poisson.py

Assume you have an experiment where the observable x is simply the measured
value of :math:`\mu` in an experiment with a Gaussian resolution with known
width :math:`\sigma`. The following plot shows the confidence belt based on the
Feldman and Cousins principle for a 90% confidence level for the mean of the
Gaussian :math:`\mu`, constrained to be non-negative. it reproduces Fig. 10
from [Feldman1998]_.

.. plot:: stats/plot_fc_gauss.py

Acceptance Interval Fixing
--------------------------

Feldman and Cousins point out that sometimes the set of intersected horizontal
lines is not simply connected. These artefacts are corrected with
`~gammapy.stats.fc_fix_limits`.

.. code-block:: python

    >>> gstats.fc_fix_limits(LowerLimitNum, UpperLimitNum)

The following script in the ``examples`` directory demonstrates the problem:
:download:`example_fc_demonstrate_artefact.py
<../../examples/example_fc_demonstrate_artefact.py>`

For mu = 0.745 the 90% acceptance interval is [0,8] and for mu = 0.750 it is
[1,8]. A lot of the fast algorithms that do not compute the full confidence belt
will come to the conclusion that the 90% confidence interval is [0, 0.745] and
thus the upper limit when zero is measured should be 0.745 (one example is
``TFeldmanCousins`` that comes with ``ROOT``, but is has the additional bug of
making the confidence interval one mu bin to big, thus reporting 0.75 as upper
limit).

For mu = 1.035 the 90% acceptance interval is [0,8] again and only starting mu =
1.060 will 0 no longer be in the 90% acceptance interval. Thus the correct upper
limit according to the procedure described in [Feldman1998]_ should be 1.055,
which is also the value given in the paper (rounded to 1.06).

Sensitivity
-----------

[Feldman1998]_ also defines experimental sensitivity as the average upper limit
that would be obtained by an ensemble of experiments with the expected
background and no true signal. It can be calculated using
`~gammapy.stats.fc_find_average_upper_limit`.

.. code-block:: python

    >>> gstats.fc_find_average_upper_limit(x_bins, matrix, UpperLimitNum, mu_bins)
    4.41

General Case
------------

In the more general case, one may not know the underlying PDF of
:math:`P(X|\mu)`. One way would be to generate :math:`P(X|\mu)` from Monte
Carlo simulation. With a dictionary of mu values and lists of X values from
Monte Carlo one can use `~gammapy.stats.fc_construct_acceptance_intervals` to
construct the confidence belts.

Here is an example, where the X values are generated from Monte Carlo (seed is
fixed here, so the result is known):

.. code-block:: python

    import gammapy.stats as gstats
    import numpy as np
    from scipy import stats

    x_bins = np.linspace(-10, 10, 100, endpoint=True)
    mu_bins = np.linspace(0, 8, 8 / 0.05 + 1, endpoint=True)

    np.random.seed(seed=1)

    distribution_dict = dict((mu, [stats.norm.rvs(loc=mu, scale=1, size=5000)]) for mu in mu_bins)

    acceptance_intervals = gstats.fc_construct_acceptance_intervals(distribution_dict, x_bins, 0.6827)

    LowerLimitNum, UpperLimitNum, _ = gstats.fc_get_limits(mu_bins, x_bins, acceptance_intervals)

    mu_upper_limit = gstats.fc_find_limit(1.7, UpperLimitNum, mu_bins)
    # mu_upper_limit == 2.7

Verification
------------

To verify that the numerical solution is working, the example plots can also be
produced using the analytical solution. They look consistent. The scripts for
the analytical solution are given in the ``examples`` directory:
:download:`example_fc_poisson_analytical.py
<../../examples/example_fc_poisson_analytical.py>`
:download:`example_fc_gauss_analytical.py
<../../examples/example_fc_gauss_analytical.py>`