Track Fitting

The track fitting algorithms estimate the track parameters. It is part of the pattern recognition/track reconstruction/tracking. We can run the track fitting algorithms, after we allocated all hits to single tracks with the help of a track finding algorithm. It is not necessary, that all points of a track are present.

Currently, we have implementations for three different fitters:

Kalman Filter
GSF
Global Chi-Square Fitter (GX2F) [wip] Even though all of them are least-squares fits, the concepts are quite different. Therefore, we should not expect identical results from all of them.

Kalman Filter (KF) [wip]

The Kalman Filter is an iterative fitter. It successively combines measurements to obtain an estimate of the track parameters. The KF needs an estimate as a starting point. The procedure alternates between two methods:

Extrapolate the current state to the next surface.
Update the extrapolation using the measurement of the new surface.[1] The meaning of “this surface” and “the next surface” changes with the context. There are three different interpretations for this. The KF can give us those three interpretations as sets of track parameters:
- predicted: Uses “older” data (i.e. from the last surfaces) to make the prediction. This prediction is an extrapolation from the old data onto the current surface.
- filtered: Uses the “current” data (i.e. the predicted data updated with the measurement on the current surface). It is some kind of weighted mean.
- smoothed: Uses the “future” data to predict the current parameters. This can only be evaluated if the whole propagation is finished once. This can be done in to ways: one uses backwards-propagation and one does not.

Todo

Complete Kalman Filter description

Gaussian Sum Filter (GSF)

The GSF is an extension of the Kalman-Filter that allows to handle non-gaussian errors by modelling the track state as a gaussian mixture:

\[ p(\vec{x}) = \sum_i w_i \varphi(\vec{x}; \mu_i, \Sigma_i), \quad \sum_i w_i = 1 \]

A common use case of this is electron fitting. The energy-loss of Bremsstrahlung for electrons in matter are highly non-Gaussian, and thus cannot be modelled accurately by the default material interactions in the Kalman Filter. Instead, the Bremsstrahlung is modelled as a Bethe-Heitler distribution, where \(z\) is the fraction of the energy remaining after the interaction (\(E_f/E_i\)), and \(t\) is the material thickness in terms of the radiation length:

\[ f(z) = \frac{(- \ln z)^{c-1}}{\Gamma(c)}, \quad c = t/\ln 2 \]

../../_images/gsf_bethe_heitler_approx.svg — Fig. 24 The true Bethe-Heitler distribution compared with a gaussian mixture approximation (in thin lines the individual components are drawn) at t = 0.1 (corresponds to ~ 10mm Silicon).

To be able to handle this with the Kalman filter mechanics, this distribution is approximated by a gaussian mixture as well (see Fig. 24). The GSF Algorithm works then as follows (see also Fig. 25)

On a surface with material, the Bethe-Heitler energy-loss distribution is approximated with a fixed number of gaussian components for each component. Since this way the number of components would grow exponentially with each material interaction, components that are close in terms of their Kullback–Leibler divergence are merged to limit the computational cost.
On a measurement surface, for each component a Kalman update is performed. Afterwards, the component weights are corrected according to each component’s compatibility with the measurement.

../../_images/gsf_overview.svg — Fig. 25 Simplified overview of the GSF algorithm.

The Multi-Stepper

To implement the GSF, a special stepper is needed, that can handle a multi-component state internally: The Acts::MultiEigenStepperLoop, which is based on the Acts::EigenStepper and thus shares a lot of code with it. It interfaces to the navigation as one aggregate state to limit the navigation overhead, but internally processes a multi-component state. How this aggregation is performed can be configured via a template parameter, by default weighted average is used (Acts::WeightedComponentReducerLoop).

Even though the multi-stepper interface exposes only one aggregate state and thus is compatible with most standard tools, there is a special aborter is required to stop the navigation when the surface is reached, the Acts::MultiStepperSurfaceReached. It checks if all components have reached the target surface already and updates their state accordingly. Optionally, it also can stop the propagation when the aggregate state reaches the surface.

Using the GSF

The GSF is implemented in the class Acts::GaussianSumFitter. The interface of its fit(...)-functions is very similar to the one of the Acts::KalmanFitter (one for the standard Acts::Navigator and one for the Acts::DirectNavigator that takes an additional std::vector<const Acts::Surface *> as an argument):

template<typename propagator_t, typename bethe_heitler_approx_t, typename traj_t> struct GaussianSumFitter

Public Functions

template<typename source_link_it_t, typename start_parameters_t, typename track_container_t, template<typename> class holder_t> inline auto fit(source_link_it_t begin, source_link_it_t end, const start_parameters_t &sParameters, const GsfOptions<traj_t> &options, const std::vector<const Surface*> &sSequence, TrackContainer<track_container_t, traj_t, holder_t> &trackContainer) const

template<typename source_link_it_t, typename start_parameters_t, typename track_container_t, template<typename> class holder_t> inline auto fit(source_link_it_t begin, source_link_it_t end, const start_parameters_t &sParameters, const GsfOptions<traj_t> &options, TrackContainer<track_container_t, traj_t, holder_t> &trackContainer) const

The fit can be customized with several options. Important ones are:

maximum components: How many components at maximum should be kept.
weight cut: When to drop components.
component merging: How a multi-component state is reduced to a single set of parameters and covariance. The method can be chosen with the enum Acts::ComponentMergeMethod. Two methods are supported currently:
- The mean computes the mean and the covariance of the mean.
- max weight takes the parameters of component with the maximum weight and computes the variance around these. This is a cheap approximation of the mode, which is not implemented currently.
mixture reduction: How the number of components is reduced to the maximum allowed number. Can be configured via a Acts::Delegate:
- Weight cut: Keep only the N components with the largest weights. Implemented in Acts::reduceMixtureLargestWeights().
- KL distance: Merge the closest components until the required amount is reached. The distance measure is the Kullback-Leibler distance in the q/p component. Implemented in Acts::reduceMixtureWithKLDistance().

Note

A good starting configuration is to use 12 components, the max weight merging and the KL distance reduction.

All options can be found in the Acts::GsfOptions:

template<typename traj_t> struct GsfOptions

Public Functions

GsfOptions() = delete

Public Members

bool abortOnError = false

std::reference_wrapper<const CalibrationContext> calibrationContext

ComponentMergeMethod componentMergeMethod = ComponentMergeMethod::eMaxWeight

bool disableAllMaterialHandling = false

GsfExtensions<traj_t> extensions

std::string_view finalMultiComponentStateColumn = ""

std::reference_wrapper<const GeometryContext> geoContext

std::reference_wrapper<const MagneticFieldContext> magFieldContext

std::size_t maxComponents = 4

PropagatorPlainOptions propagatorPlainOptions

const Surface *referenceSurface = nullptr

double weightCutoff = 1.e-4

If the GSF finds the column with the string identifier “gsf-final-multi-component-state” (defined in Acts::GsfConstants::kFinalMultiComponentStateColumn) in the track container, it adds the final multi-component state to the track as a std::optional<Acts::MultiComponentBoundTrackParameters<SinglyCharged>> object.

A GSF example can be found in the ACTS Examples Framework here.

Customising the Bethe-Heitler approximation

The GSF needs an approximation of the Bethe-Heitler distribution as a Gaussian mixture on each material interaction (see above). This task is delegated to a separate class, that can be provided by a template parameter to Acts::GaussianSumFitter, so in principle it can be implemented in different ways.

However, ACTS ships with the class Acts::AtlasBetheHeitlerApprox that implements the ATLAS strategy for this task: To be able to evaluate the approximation of the Bethe-Heitler distribution for different materials and thicknesses, the individual Gaussian components (weight, mean, variance of the ratio \(E_f/E_i\)) are parametrised as polynomials in \(x/x_0\). This class can load files in the ATLAS format that can be found here. A default parameterization can be created with Acts::makeDefaultBetheHeitlerApprox().

The Acts::AtlasBetheHeitlerApprox is constructed with two parameterizations, allowing to use different parameterizations for different \(x/x_0\). In particular, it has this behaviour:

\(x/x_0 < 0.0001\): Return no change
\(x/x_0 < 0.002\): Return a single gaussian approximation
\(x/x_0 < 0.1\): Return the approximation for low \(x/x_0\).
\(x/x_0 \geq 0.1\): Return the approximation for high \(x/x_0\). The maximum possible value is \(x/x_0 = 0.2\), for higher values it is clipped to 0.2 and the GSF emits a warning.

Global Chi-Square Fitter (GX2F) [wip]

Todo

Write GX2F documentation