
Home Preprocessing Baseline Correction |
||||||||
![]() |
||||||||
Baseline Correction
Real measured spectra in most cases (and especially in imaging applications) are composed of actual spectra and artifacts, coming about by a combination of various effects. For example, a spectrum can be shifted by a constant value (e.g. in the case of using optical fibers), or else it can be underlaid with a wavelength-dependent function (dependent, for example, on scattering effects), or alternatively a very weak spectral signal can be combined with a very large fundamental signal, such as typically occurs with Raman spectra. Here the Raman signal is smaller by about a factor of 1:106 than the basis signal, making the Raman signal barely visible in the raw data. In any case, the baseline needs to be corrected or eliminated as well as possible, so that in the end there is a “pure” spectrum. The correction of baselines is dependent on boundary conditions in many cases and must each be carried out with adjusted parameters. Additionally, the baseline problem in imaging applications is complicated by the fact that structure boundaries can lead to serious scattering effects if the structure size lies in the range of the wavelength of the radiation (Mie scattering). This is a big problem especially in infrared microscopy if, for example, particle sizes are analyzed in the range of tens of microns. Here there is a strong wavelength-dependent scattering that varies from pixel to pixel. Essentially, there are several baseline correction methods that each have different advantages and disadvantages:
Much simpler and less error-prone is the method according to Lieber [Lieber 2003], which iteratively adjusts an n-th degree polynomial (typically n = 2 to 6) by truncating the intensities lying above the estimated baseline and replacing them with the respective estimated value. Repeating this process many times shifts the estimated value of the baseline ever farther down until after about 10 to 20 iterations a stable baseline emerges. The disadvantage of this procedure is that the form of the baseline is given by the degree of the polynomial – this in many cases results in an insufficient or too strong adaptation. The method according to Eilers [Eilers 2005] yields the best results in most cases, which determines the baseline via an asymmetrical least squares adaptation. For this purpose a polynomial is adapted to the spectral data by means of regression, so that values above the estimated baseline enter into the regression calculation with a lower weight than values below. Again the procedure is repeated several times until the estimated baseline no longer shifts. The adaptation is more "natural", however two parameters have to be set for this procedure, which require considerable variability of the adaptation (and thus also uncertainty as to whether the baseline correction is actually functioning optimally). A basic problem that specifically concerns the methods according to Lieber and Eilers is the occurrence of negative peaks, which for example arise in IR spectroscopy through bands of CO2 in the air. These negative peaks have to be explicitly excluded in the baseline calculation, or the CO2 bands have to be removed before the baseline correction (most easily through linear interpolation between the CO2 peak limits).
Another possibility to deal with the baseline problem is to go around the problem by using the first respectively second derivative of the spectrum. In many applications this approach contributes to a viable solution, however the differentiation worsens the signal-to-noise ratio, so that further evaluation is made more difficult. In any case the signal should be smoothed when using derivatives before differentiating. Savitzky and Golay developed a method to calculate a smoothed first or second derivative in one step. |
||||||||