<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1756-0381-2-6</ui>
   <ji>1756-0381</ji>
   <fm>
      <dochead>Review</dochead>
      <bibl>
         <title>
            <p>Extraction of pure components from overlapped signals in gas chromatography-mass spectrometry (GC-MS)</p>
         </title>
         <aug>
            <au ca="yes" id="A1">
               <snm>Liki&#263;</snm>
               <mi>A</mi>
               <fnm>Vladimir</fnm>
               <insr iid="I1"/>
               <email>vlikic@unimelb.edu.au</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, 30 Flemington Road, Parkville 3010, Australia</p>
            </ins>
         </insg>
         <source>BioData Mining</source>
         <issn>1756-0381</issn>
         <pubdate>2009</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>6</fpage>
         <url>http://www.biodatamining.org/content/2/1/6</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1756-0381-2-6</pubid>
               <pubid idtype="pmpid">19818154</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>12</day>
               <month>10</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>10</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Liki&#263;; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Gas chromatography-mass spectrometry (GC-MS) is a widely used analytical technique for the identification and quantification of trace chemicals in complex mixtures. When complex samples are analyzed by GC-MS it is common to observe co-elution of two or more components, resulting in an overlap of signal peaks observed in the total ion chromatogram. In such situations manual signal analysis is often the most reliable means for the extraction of pure component signals; however, a systematic manual analysis over a number of samples is both tedious and prone to error. In the past 30 years a number of computational approaches were proposed to assist in the process of the extraction of pure signals from co-eluting GC-MS components. This includes empirical methods, comparison with library spectra, eigenvalue analysis, regression and others. However, to date no approach has been recognized as best, nor accepted as standard. This situation hampers general GC-MS capabilities, and in particular has implications for the development of robust, high-throughput GC-MS analytical protocols required in metabolic profiling and biomarker discovery. Here we first discuss the nature of GC-MS data, and then review some of the approaches proposed for the extraction of pure signals from co-eluting components. We summarize and classify different approaches to this problem, and examine why so many approaches proposed in the past have failed to live up to their full promise. Finally, we give some thoughts on the future developments in this field, and suggest that the progress in general computing capabilities attained in the past two decades has opened new horizons for tackling this important problem.</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Both gas chromatography and mass spectrometry are important analytical techniques in their own right. Electron ionization mass spectrometry is an approach that generates charged molecular fragments and measures their mass-to-charge (<it>m/z</it>) ratios <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Under standard conditions, electron ionization of small organic molecules produces complex but reproducible <it>m/z </it>patterns that can be related to the chemical structure of the parent molecule. On the other hand, gas chromatography excels at separation of components in complex mixtures, and is particularly well suited for the analysis of thermally stable compounds of low polarity <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The combination of gas chromatography and mass spectrometry allows for highly sensitive analysis of complex mixtures, and is routinely used in biochemical <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>, medical <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, agricultural <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and environmental <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> research, as well as in various industrial applications <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. A surge of interest in GC-MS has been fueled by recent biomarker and metabolite profiling studies <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B11">11</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>, and the potential of GC-MS to contribute to systems biology studies <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. To this end GC-MS has been used for metabolic profiling in plants <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B15">15</abbr></abbrgrp>, bacteria <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>, yeast <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> and biological fluids <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>The ever increasing scope of GC-MS applications is opening new challenges in data processing and analysis <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B6">6</abbr><abbr bid="B28">28</abbr></abbrgrp>. GC-MS experiments on complex biological and environmental samples may result in hundreds of signals and the detection of many compounds in parallel. For example, Fiehn and co-authors have quantified 326 metabolites in <it>Arabidopsis thaliana </it>leaf tissue extracts <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In an independent GC-MS study of <it>Arabidopsis thaliana </it>leaves, Jonsson and co-authors detected 497 unique chemical components in five different genotypes <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. When such complex samples are analyzed, incomplete chromatographic separations are often observed (note that this is also expected theoretically <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>). This manifests itself as the overlap of chromatographic peaks, which in turn makes the extraction of pure components and their mass spectra (required for unambiguous component identification) challenging. Currently, the most accurate analysis of complex GC-MS data sets can be achieved by an expert operator, however this is both time and labour intensive. The need to improve analysis times by speeding up the separation by gas chromatography without sacrificing the ability to separate/identify individual components is putting additional pressure on data processing methods.</p>
         <p>Over the past 30 years a number of approaches for the extraction of pure components from overlapped GC-MS signals were proposed. This includes empirical methods <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>, comparison with library spectra <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, differential methods <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>, eigenvalue analysis <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp> and regression analysis <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. Some time ago methods for the extraction of pure components were reviewed <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The scope of GC-MS applications has increased significantly in past years, and a review of previous work seems timely. Here we first discuss the nature of GC-MS data and the problem of signal overlap which arises from co-eluting components. Subsequently, we review the most prominent approaches for the extraction of pure component signals proposed in the past, and give some thoughts on future developments in the GC-MS data processing field.</p>
         <sec>
            <st>
               <p>The nature of GC-MS data</p>
            </st>
            <p>In a typical GC-MS setup, the eluate from the gas chromatographic column is led directly into the mass spectrometer ion source, and the mass spectrometer records <it>m/z </it>intensities in the repetitive scanning mode. This results in <it>R </it>mass scans recorded during the time of the experiment, at times <it>t</it><sub>1</sub>, <it>t</it><sub>2</sub>, ..., <it>t</it><sub><it>R</it></sub>. Each mass scan can be converted into a series of <it>N m/z </it>intensities defined by the mass vector <b>m </b>= (<it>m</it><sub>1</sub>, <it>m</it><sub>2</sub>, ..., <it>m</it><sub><it>N</it></sub>), where each <it>m</it><sub><it>i </it></sub>corresponds to one <it>m/z </it>"channel". This results in a series of mass spectra, defined by the mass vector <b>m</b>, and taken at times <it>t</it><sub>1</sub>, <it>t</it><sub>2</sub>, ..., <it>t</it><sub><it>R</it></sub>. As the mixture components elute from the chromatographic column their concentrations change, and the mass spectra of this continuously changing mixture are recorded.</p>
            <p>Consider analysis of a mixture containing <it>K </it>pure components, whose mass spectra are <it>&#948;</it><sub>1</sub>, <it>&#948;</it><sub>2</sub>, ..., <it>&#948;</it><sub><it>K</it></sub>:</p>
            <p>
               <display-formula id="M1">
                  <graphic file="1756-0381-2-6-i1.gif"/>
               </display-formula>
            </p>
            <p>The above equations can be written more concisely by introducing the matrix &#916;,</p>
            <p>
               <display-formula id="M2">
                  <graphic file="1756-0381-2-6-i2.gif"/>
               </display-formula>
            </p>
            <p>Let <b>C </b>be the matrix of concentrations of <it>K </it>pure components over the time of the GC-MS experiment, sampled at points <it>t</it><sub>1</sub>, <it>t</it><sub>2</sub>, ..., <it>t</it><sub><it>R</it></sub>. These concentrations could themselves be arranged into a two-dimensional matrix C, where each row corresponds to one sampling time point:</p>
            <p>
               <display-formula id="M3">
                  <graphic file="1756-0381-2-6-i3.gif"/>
               </display-formula>
            </p>
            <p>The assumption of the linear mixture model is that the observed mass spectrum is a linear combination of pure component mass spectra <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. This is a robust assumption, implying that mass spectrum observed at each mass spectral scan is the result of a linear combination of the component mass spectra, where the weighting coefficients are given by the concentrations of individual components. Therefore, the mass spectrum observed at time <it>t</it><sub><it>i </it></sub>is:</p>
            <p>
               <display-formula id="M4">
                  <graphic file="1756-0381-2-6-i4.gif"/>
               </display-formula>
            </p>
            <p>where <it>c</it><sub><it>i</it>1</sub>, <it>c</it><sub><it>i</it>2</sub>, &#8943;, <it>c</it><sub><it>iK </it></sub>are the concentrations of <it>K </it>pure components at time <it>t</it><sub><it>i</it></sub>, and <it>&#948;</it><sub><it>k </it></sub>refers to the mass spectrum of the pure component <it>k</it>, given in the equation (1). The equation (4) can be rewritten more succinctly in the matrix notation,</p>
            <p>
               <display-formula id="M5">
                  <graphic file="1756-0381-2-6-i5.gif"/>
               </display-formula>
            </p>
            <p>where the matrices C and &#916; are given by the equations (3) and (2), respectively. The matrix <b>S </b>represents the net result of a GC-MS experiment, after the transformation of raw data scans into <it>m/z </it>intensities over channels defined by <b>m</b>:</p>
            <p>
               <display-formula id="M6">
                  <graphic file="1756-0381-2-6-i6.gif"/>
               </display-formula>
            </p>
            <p>In the above matrix, each row represents the mass spectrum <inline-formula><graphic file="1756-0381-2-6-i7.gif"/></inline-formula> recorded at time <it>t</it><sub><it>i</it></sub>. In the mathematical notation,</p>
            <p>
               <display-formula id="M7">
                  <graphic file="1756-0381-2-6-i8.gif"/>
               </display-formula>
            </p>
            <p>for <it>i </it>= 1, 2, ..., <it>R</it>. The total of <it>R </it>mass spectra of the eluting mixture are recorded at times <it>t</it><sub><it>i</it></sub>. Equation (4) shows the same quantity, <inline-formula><graphic file="1756-0381-2-6-i7.gif"/></inline-formula>, written in a more explicit form.</p>
            <p>In summary, the matrix given by the equation (6) represents the net result of a GC-MS experiment. Conceptually, this matrix can be viewed as the product of two matrices, the matrix <b>C </b>(equation (3)), whose columns contain concentrations of pure components as a function of elution time, and the matrix &#916; (equation (2)), whose rows contain mass spectra of pure components. For the sake of simplicity, experimental noise was neglected in the above considerations.</p>
         </sec>
         <sec>
            <st>
               <p>Total ion chromatogram (TIC)</p>
            </st>
            <p>A single column of the complete GC-MS data matrix, equation (6), is called <it>ion chromatogram</it>. It represents the elution profile of a single <it>m/z </it>channel. In practice, the GC-MS data matrix is usually viewed as the sum of its ion chromatograms, which is called a total ion chromatogram (TIC). A TIC is obtained by summing the complete GC-MS data matrix (6) along its columns,</p>
            <p>
               <display-formula id="M8">
                  <graphic file="1756-0381-2-6-i9.gif"/>
               </display-formula>
            </p>
            <p>where <inline-formula><graphic file="1756-0381-2-6-i10.gif"/></inline-formula> for <it>i </it>= 1, 2, ..., <it>R</it>. A comparison with the equation (7) shows that <it>a</it><sub><it>i </it></sub>is the sum of intensities present in the mass spectrum <inline-formula><graphic file="1756-0381-2-6-i7.gif"/></inline-formula> (or equivalently, the mass spectral scan taken at <it>t</it><sub><it>i</it></sub>), summed over all measured <it>m/z </it>values.</p>
         </sec>
         <sec>
            <st>
               <p>The problem of signal overlap</p>
            </st>
            <p>Dynamic interactions of solute with mobile and stationary phases, as well as solute axial diffusion, lead to broadening of component zones as the solute progresses along the column <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B56">56</abbr></abbrgrp>. These kinetic processes give rise to familiar chromatographic peaks, which represent component concentration in the mobile phase observed at the end of the column as a function of elution time. The chromatographic peaks have a complex shape, and in practice are most often modelled with the exponentially modified Gaussian function <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. For the sake of simplicity, in the example below we assume simple Gaussian peaks. In this case, each column of the matrix <b>C </b>given by the equation (2) will contain a single Gaussian peak centered at the elution time characteristic of that particular solute component.</p>
            <p>Consider a hypothetical mixture of two components A and B (<it>K </it>= 2), whose pure mass spectra are shown in Figure <figr fid="F1">1</figr>. We assume that the component A elutes from the gas chromatography column earlier than the component B (<it>t</it><sup><it>A </it></sup>&lt;<it>t</it><sup><it>B</it></sup>, where <it>t</it><sup><it>A </it></sup>and <it>t</it><sup><it>B </it></sup>are the retention times of the components A and B, respectively). If the two components elute at significantly different retention times, they will be well resolved (Figure <figr fid="F2">2</figr>, panel (a)), resulting in two visible signal peaks in the TIC, as shown in Figure <figr fid="F2">2</figr>, panel (b). The pure mass spectra of the two components are given by the mass spectral scans taken at the apex of each component peak, and correspond to the mass spectra given in Figure <figr fid="F1">1</figr>. However, if the two components elute close in time, as depicted in Figure <figr fid="F2">2</figr>, panel (c), a severe overlap of component signals will occur. In this case a single chromatographic peak may be observed in the TIC, as shown in Figure <figr fid="F2">2</figr>, panel (d). The mass spectrum at the apex of the composite peak will be a mixture of the pure mass spectra of the two components, equation (4).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The assumed mass spectra of pure components A and B</p>
               </caption>
               <text>
                  <p><b>The assumed mass spectra of pure components A and B</b>. The simulated GC-MS profile is shown in Figure 2.</p>
               </text>
               <graphic file="1756-0381-2-6-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Two scenarios illustrating the problem of peak overlap in GC-MS data</p>
               </caption>
               <text>
                  <p><b>Two scenarios illustrating the problem of peak overlap in GC-MS data</b>. Components A and B, whose mass spectra are given in Figure 1 are assumed to be present in the mixture. If the retention times of the two components differ significantly the observed signal will consist of two well resolved peaks, as shown in the panel (a); the panel (b) shows the corresponding total ion chromatogram (TIC). If the two components elute closely together (panel (c)), the TIC may exhibit only a single, composite peak, as shown in panel (d).</p>
               </text>
               <graphic file="1756-0381-2-6-2"/>
            </fig>
            <p>The problem of extraction of pure component signals in incomplete chromatographic separation is often called "peak deconvolution" <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B58">58</abbr></abbrgrp>. This terminology is unfortunate because the term "deconvolution" denotes the inversion of a convolution process, a particular kind of integral transform encountered in the field of signal processing <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Extraction of pure components from overlapped GC-MS signals is both mathematically and conceptually different. However the term "peak deconvolution" has taken such deep roots in the GC-MS practice that is likely to remain a part of the GC-MS specialist's vocabulary for the foreseeable future.</p>
            <p>A complete solution to the problem of pure components is provided by the matrices <b>C </b>and &#916;, given by the equation (5). However, in GC-MS experiments only the matrix <b>S </b>is measured. It is a non-trivial problem to decompose the matrix <b>S </b>into matrices <b>C </b>and &#916;; in the most general case such matrix decomposition does not have a unique solution. In practice the most important objective is often to identify retention times and mass spectra of individual components that contribute to the composite signal. From this viewpoint and under certain conditions one can sidestep the equation (5), and focus on some empirical way to resolve retention times and mass spectra of pure components. This results in two different approaches to the problem of extraction of pure signals from co-eluting components. "Empirical methods" sidestep the mathematics of the equation (5), and focus on some empirical way to resolve retention times and mass spectra of pure components, while "matrix methods" aim to find the solution of the matrix equation (5). The empirical methods typically apply the logic of a human analyst, and utilize the capacity of computers to process large amounts of data and execute repetitive tasks <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. On the other hand, matrix methods aspire to a comprehensive solution of the equation (5) relying on some suitable assumptions, and usually attempt to use most if not all data points. These methods include for example eigenvalue analysis <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>, regression <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp> and differential analysis <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>. In the next sections we summarize the most prominent empirical and matrix methods proposed in the past.</p>
         </sec>
         <sec>
            <st>
               <p>Methods for the extraction of pure components from overlapped GC-MS signals</p>
            </st>
            <sec>
               <st>
                  <p>Empirical methods</p>
               </st>
               <p>The method of Biller and Biemann <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> was the first widely used method for peak deconvolution. This method examines <it>m/z </it>intensities which maximize at any given chromatographic time point, or at adjacent mass spectral scans. If intensities of several <it>m/z </it>channels exhibit a maximum at the same time point, a chromatographic peak is recorded containing these <it>m/z </it>channels. This procedure results in "reconstructed" mass spectra of pure components, and is effective when two signals do not have common mass to charge ratios <it>and </it>maximize at two or more scans apart.</p>
               <p>Colby extended the idea of Biller and Biemann by introducing more accurate estimates of peak positions, followed by binning <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. In this approach peaks were identified as local maxima in ion chromatograms, and peak centroids are calculated from the three point quadratic fit centered at the local maximum. From this a "deconvoluted TIC" was calculated by binning the centroid intensities, in ten bins per scan <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The mass spectra of pure compounds were estimated by collecting peak centroids within the boundaries of the deconvoluted TIC peak. The author suggested that this method is capable of separating components which differ for only one quarter of a scan along the retention time axis <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. In the original work, Colby demonstrated deconvolution of a single peak consisting of six components, all of which were resolved by the application of the proposed method <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
               <p>Dromey and co-workers proposed an approach that relies on statistical analysis <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This method focused on finding well resolved peaks in individual ion chromatograms, ie. peaks that showed unique <it>m/z </it>relative to its neighbors. This is based on the assumption that even for heavily overlapped signals there will be some <it>m/z </it>that are unique to either of the two components. So called "singlet fragmentograms" provide information about the shape of component peaks, and this can be used to separate component signals in overlapped ion chromatograms, even for mass-to-charge ratios that occur in both overlapped components <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Dromey and co-authors proposed that two histograms are calculated for singlet peak positions, one recording signal maxima and the other recording total ion intensity above the noise level at these positions. The exact positions of components were determined by a parabolic least squares fit over the top five points in the sampled peak data. After this, the resolved spectrum of each component was obtained by the least squares fit to the model peak. The authors demonstrated that the proposed approach was able to detect indole acetic acid 3-methyl ester in complex GC-MS data acquired on human urine samples <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. While this specific component did not give a visible signal in the TIC due to heavy overlap, the authors were able to reconstruct its pure mass spectrum <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
               <p>Hargrove and co-authors reported that the method of Dromey failed to recognize weak but readily visible signals <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The problem was traced to the way the method calculated "peak sharpness", the property used to distinguish true singlet peaks from doublet or background signal <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Hargrove and co-authors proposed a different function for peak sharpness, and reported a marked improvement in the performance of the Dromey method <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
               <p>Based on the ideas of Dromey et al. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, Stein proposed an approach with refinements to improve the ability of the method to discern weak signals <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. In this method the first step is the detection of individual components ("component perception"). For each "perceived" component the precise peak apex is calculated from the three point parabola fit centered on the maximum. Once the number and positions of components are determined, the mass spectrum for each component is obtained by the least-squares method similar to that of Dromey et al. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. An important aspect of this method is the analysis of the signal and noise features, used subsequently to aid in discerning the true signal from noise. An elaborate, empirical procedure involving analysis of all ion chromatograms is used to estimate a data noise factor <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. This method also explicitly interpolates zero values which are found in the signal when measured intensities fall under the threshold, normally established during instrument tuning <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Stein has developed a PC program AMDIS which implements the proposed method <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Eigenvalue analysis</p>
               </st>
               <p>The first methods for GC-MS peak deconvolution based on the eigenvalue analysis were proposed not long after the Biller-Biemann method. In the method of Davis and co-authors, the principal component analysis was used to obtain the number of pure components in a composite signal, but not their mass spectra <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. This approach was subsequently extended by several groups <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. Ritter and co-authors proposed the eigenvalue analysis of the covariance matrix to obtain the number of pure components <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Knorr and Futrell proposed the method for the determination of both the number of pure components and their mass spectra based on the factor analysis <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. A similar method was proposed by Abdallah and co-authors, who calculated "ranges" for the pure component mass spectra <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Roach and Guilhaus reported enhanced factor analysis which exploited the ordered nature of GC-MS elution profiles <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, based on the ideas by Meader (dubbed evolving factor analysis, EFA) <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. More recently, variants of the eigenvalue analysis were applied to the analysis of complex plant extracts <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Differential methods</p>
               </st>
               <p>Ghosh and Anderegg proposed differential processing of GC-MS data in which <it>m/z </it>intensities for each two successive scans are subtracted <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. This procedure resulted in two new data sets created from the original GC-MS spectral matrix, one with the positive and one with the negative differences in intensities. Ghosh and Anderegg reported that such differential processing resulted in pure component mass spectra, which can be used for reliable comparison with mass spectral libraries <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Pool and co-authors extended this work in two directions <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>. First, they proposed that two data sets resulting from the subtraction are combined into a single data set that resembles the original data; second, they proposed that this procedure is applied recursively until convergence is achieved ("backfolding") <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. The authors reported that backfolding is capable of extracting pure mass spectra when signals are severely overlapped <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Library search</p>
               </st>
               <p>The first computer approaches to aid in identification of compounds in complex mixtures relied on comparing mass spectra to precompiled libraries <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. This is of course limited by the scope of the available library. Moreover, when the signals overlap the observed mass spectrum will be a mixture, and the library search may fail to match any of the components from the mixture.</p>
               <p>Gan and Liang proposed the method for the search of component mass spectra based on the observed composite signal <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. This method first identifies potential candidates for component mass spectra, and then uses non-negative least-squares regression to calculate contributions of the assumed components to the observed, composite mass spectrum <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. This process results in pure signals, and therefore could be viewed as a method for the extraction of pure components from overlapped signals.</p>
            </sec>
            <sec>
               <st>
                  <p>Regression methods</p>
               </st>
               <p>Blaisdell and Sweeley proposed a procedure for the extraction of pure components based on the singular value decomposition and least squares fitting <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. This method depends on the determination of background noise for each mass, which was assumed to be constant over 10-12 scans. Knorr and co-authors proposed a regression procedure where the full matrix representation of data, equation (6), is modelled as a function of component retention times. The least squares fit is performed to minimize the difference between the predicted and the observed data matrix, where individual ion chromatograms (i.e. columns of the matrix C, equation (3)) are modelled as Gaussian functions modified with an exponential decay function <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. This requires that the number of components is known. The authors proposed a heuristic procedure based on the relationship between the number of components in the model and the observed changes in goodness-of-fit to determine the optimal number of components <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
               <p>Karjalainen proposed alternating regression for the extraction of pure components from GC-MS data <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. In this approach, <b>C </b>and &#916; are initially set to random values, and the equation (5) is solved for both <b>C </b>and &#916; iteratively, by applying constrains such as non-negativity and unimodal shape, until the convergence is achieved <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. This method requires the number of components to be known, and the author proposed this to be found by trial-and-error <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Since multiple solutions may be obtained by convergence from random values, the repetition of the calculation from different initial values was proposed to establish the stability of the solution <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>.</p>
               <p>An iterative optimization method for peak deconvolution was proposed for the special case when one signal is embedded within another <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. In this method, least squares are used to obtain mass spectra of pure components <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Shao and co-authors reported the application of the artificial immune algorithm for the extraction of pure components in GC-MS data <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> (immune algorithms are inspired by the defense processes of the biological immune system <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>). These authors used independent component analysis <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> to extract the mass spectra of pure components, and then chromatographic profiles corresponding to these pure components were extracted with an adaptive immune algorithm <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. The method was demonstrated on simulated data, and on experimental data obtained on the pyrolysates of phenylalanine <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>.</p>
               <p>Stokkum and co-authors proposed the regression method based on a parametrized model of the data, where elution profiles are described with exponentially modified Gaussian functions <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. In this method the data is separated into time windows, so that each time window contains only a small number of pure components, estimated from the principal component analysis <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. In their model each component is described with three parameters determined by the nonnegative least squares fit, where the difference between the model at the parameter values and the data is minimized <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Automated extraction of pure components from co-eluting components in GC-MS data is a challenging problem. To make the problem tractable, most methods rely on implicit or explicit assumptions about the characteristics of the signal and the noise. For example, Knorr et al. <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> modelled signal peaks as exponentially modified Gaussian functions; Stein assumed that a single noise parameter derived from multiple ion chromatograms can adequately describe random fluctuations in data <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>; Colby assumed that a fixed number of bins is optimal to bin centroid intensities <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and so on. The degree of validity of such assumptions will depend on the data at hand, and when the assumptions are no longer valid the method is likely to fail.</p>
         <p>In addition, experimental GC-MS data may contain a range of irregularities and imperfections, confounding the problem further. For example, in a typical GC-MS experimental setup only intensities above a threshold are stored <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. This may result in zero intensities, or entire blocks of zero intensities embedded in the data, which in turn complicates the analysis of noise. There are at least five experimental factors that collectively, and often confoundingly, influence the characteristics of GC-MS data:</p>
         <p indent="1">1. <it>The nature of sample components</it>. More complex samples produce more signals per standard chromatographic separation run, and this results in increased peak crowding and overlap. The more severe the peak overlap the more difficult is the extraction of pure components, and this is especially the case if multi-component overlap occurs.</p>
         <p indent="1">2. <it>The sample matrix</it>. The sample matrix can profoundly influence both the characteristics and quality of the GC-MS data. Samples of biological material can have large amounts of background chemicals which interfere with the detection of trace compounds, both through impeding the efficacy in separation/detection, and also by producing noise-like effects. Specifically, samples of urine, saliva and serum are associated with difficult sample matrices.</p>
         <p indent="1">3. <it>Condition of the instrument</it>. Less than optimal instrument condition may result in chemical noise that is difficult to model (see below). For example, a worn out liner, a component of the GC inlet system, may deform peak shapes and affect peak resolution; a sub-optimal connection of the column may result in oxygen diffusion into the system increasing the background noise; septum bleed may result in wide humps that distort the signal baseline, and so on. In addition, mechanical problems associated with gas chromatography, such as uneven flow of the carrier gas or column packaging may have similar effects.</p>
         <p indent="1">4. <it>Instrument tuning and experiment runtime parameters</it>. The parameters set by the operator, if not optimal, may adversely affect the quality of GC-MS data. For example, faster oven ramp rates result in shorter experiment times, but also increase peak crowding and consequently peak overlap.</p>
         <p indent="1">5. <it>Instrument type</it>. Data acquired on different GC-MS instruments may have different characteristics (retention time resolution, m/z resolution, noise characteristics). For example, time-of-flight (TOF) instruments allow faster scan rates compared to quadrupole instruments, and typically result in higher resolution data.</p>
         <p>Purely from the data viewpoint, the main challenges in automated signal detection include <it>a priori </it>unknown shapes of signal peaks and reliable separation of the true signal from noise. In most practical situations, the latter problem is more challenging; the question of peak shapes is amenable to empirical solutions. A number of empirical functions were successfully used for the modelling of chromatographic peak shapes in the past <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
         <p>In GC-MS experiments a combination of true noise and chemical noise is typically observed. True noise refers to random fluctuations that originate from the limitations in instrument electronics (this type of noise is always present in instruments that use ion multipliers). On the other hand, chemical noise arises from extraneous chemical components introduced in the system unintentionally. Such components may be introduced during the sample preparation process (for example, as a consequence of derivatization), or may originate from the instrument condition (due to column bleed, for example). Therefore chemical noise is not noise at all, but unwanted signal that originates from chemical components introduced as a part of the experimental process <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>.</p>
         <p>Although the origin of noise in GC-MS experiments is well understood, in any specific experiment noise is difficult to model or account for accurately. In practice, noise may manifest itself in any number of ways. For example, the signal from chemical noise may overlap or obscure the signal of interest; alternatively the net effect may be the degradation of the signal quality due to increased background, lower signal-to-noise ratio, skewed peak shapes or distorted signal baseline. Furthermore, very low concentration components present in the sample may result in true signals that are at the level of noise. As a result, in experimental data often there is no clear separation between the signal and the noise components (Figure <figr fid="F3">3</figr>).</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Two fragments of experimental GC-MS data matrices, equation (6), showing signals from closely co-eluting components</p>
            </caption>
            <text>
               <p><b>Two fragments of experimental GC-MS data matrices, equation (6), showing signals from closely co-eluting components</b>. The signal peaks in the panel (a) exhibit symmetric peak shapes, while the signal peaks in the the panel (b) show slightly asymmetric peaks. This effect (dubbed "peak tailing") can originate from several instrument conditions, for example column degradation, or contaminants left in the injection port. Both data sets show a continuum between noise and weak signals, a situation typically encountered in practice.</p>
            </text>
            <graphic file="1756-0381-2-6-3"/>
         </fig>
         <p>A review of the literature suggests that the most widely used, publicly described method for peak deconvolution is AMDIS <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> (this view is corroborated by others <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>). We speculate that this is for two reasons. First, AMDIS is probably the only method implemented in a freely available software package targeting the PC computing environment most analysts are familiar with (although it is not open source) <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Second, in AMDIS component detection is integrated with library matching <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, which is useful in practice.</p>
         <p>The main weakness of empirical methods, including AMDIS, is the use of arbitrary rules and empirical parameters. For example, AMDIS divides each ion chromatogram into segments of 13 scans for noise analysis; zero abundance values are replaced based on a complicated set of empirical rules that involve several arbitrarily chosen parameters; pre-set maximum number of scans in component detection is 12; "peak sharpness" is defined by an empirical formula, which in turn features a single "noise" parameter calculated empirically, and this parameter is assumed to faithfully represent the noise; the multiplier for maximum range in peak sharpness calculation is 50; the components that do not have the sharpness within 75% of the maximum value are discarded; and so on <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Why exactly these numbers are chosen is impossible to justify in a truly objective way. Furthermore, the sheer number of empirical rules and parameters suggest that a systematic optimization of an empirical method such is AMDIS is difficult, and understanding fully how the parameters affect the final result is probably not a realistic goal. AMDIS was originally optimized for a specific GC-MS application <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, and subsequently applied to other systems <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp>. However, a recent study reported that AMDIS generated as much as 70-80% false components (false positives) <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>.</p>
         <p>On the other hand, the matrix methods remain marginally used in practice, in spite of the considerable enthusiasm that surrounded many initial demonstrations. There are several reasons for this. First, most matrix methods proposed in the past were proof-of-concept demonstrations, and had failed to establish unambiguously their usefulness in real experimental scenarios. Second, often there is no intuitive picture associated with matrix methods. For example, the eigenvalue methods result in matrix decompositions of the original GC-MS data that have no physical meaning <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. This is certainly a downside for most GC-MS practitioners, at least before the method's advantages in real experimental scenarios are clear. Finally, and related to the first point, software implementations that would allow matrix methods to be tested by a wider community and under realistic experimental scenarios are lacking. To our knowledge none of the matrix methods reviewed here were accompanied by an accessible and widely available software implementation.</p>
         <p>Another problem is the method demonstrations in limited experimental scenarios. The first attempts to use the eigenvalue analysis for the separation of overlapped GC-MS signals were on simple binary mixtures with a limited range of <it>m/z </it>values <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. Ritter and co-authors used four sets of binary mixtures (cyclohexane/cyclohexene, hexane/cyclohexane, heptane/octane, and unknown xylenes), and only 20 <it>m/z </it>values <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Subsequent work used more realistic but still limited experimental scenarios compared to modern standards. For example, Abdallah and co-authors used binary mixtures with 135 <it>m/z </it>values <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, while Roach and Guilhaus used a mixture of seven organochlorine compounds with a similar <it>m/z </it>range <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>.</p>
         <p>The method based on differential processing of GC-MS data was originally proposed by Ghosh and Anderegg <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>, and subsequently developed further by Pool and co-authors <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>. Interestingly, the authors compared differential processing with the empirical method of Colby <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and the regression method of Karjalainen <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, and reported that backfolding outperformed both methods <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Unfortunately this conclusion was based on the analysis of only a small fragment of an experimental data set <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
         <p>The first applications of regression to GC-MS peak deconvolution were proposed not long after the first eigenvalue methods were tested <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. The method of Blaisdell and Sweeley relied on both the eigenvalue analysis and linear least squares, although the original description lacked the full mathematical detail <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. The regression method of Knorr and co-authors amounts to a mathematical decomposition of the data matrix, equation (5), where the individual ion chromatograms are modelled explicitly with modified Gaussian function <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. This idea is clearly a viable approach for resolving multi-component overlapping signals. However, its demonstration in the original formulation was on highly simplified data compared to today's standards: binary and ternary mixtures with 30 mass spectrometry scans involving a small number of <it>m/z </it>channels <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
         <p>The alternative regression method of Karjalainen appears to be both advanced and model-free <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. In the original publication, the author reported two problems: with the convergence and with determining the number of components <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Recently, Jonsson and co-authors proposed an approach based on the method of Karjalainen <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. In this approach each data set is divided into suitable time windows, and within each time window the overlapped signals are resolved with the alternating regression method originally proposed by Karjalainen <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>; a multivariate analysis is used to identify time windows which contain significant differences between samples <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B68">68</abbr></abbrgrp>. Jonsson and co-authors also proposed an an improved method for choosing initial values that provided better convergence compared to random values, as originally proposed by Karjalainen <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>.</p>
         <p>The regression method of Gong and co-authors <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> was applied on complex plant samples; however the focus of this method was on resolving a specific type of signal overlap. An interesting outcome of this study was that signal clusters originating from co-eluting components should be analyzed differently, depending on the specific nature of the signal overlap <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. The library search method of Gan and Liang aimed to tackle both deconvolution and spectral matching simultaneously <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. However, even in an ideal scenario, this method has strong limitations, since any component that does not have a mass spectrum in the library cannot be identified as a part of the mixture.</p>
         <p>A method for peak deconvolution based on artificial immune algorithm <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> was reported by Shao and co-authors <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Their test cases involved GC-MS data obtained from pyrolysates of phenylalanine <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>; however the analysis focused on a narrow retention time range of 0.5 minutes which contained three overlapped components. The authors also compared the performance of the proposed method with the multivariate curve resolution method SIMPLISMA <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. To our knowledge, beyond this work SIMPLISMA was not applied to GC-MS data, although it was used for resolution of co-eluting components in liquid chromatography-mass spectrometry (LC-MS) <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. It is interesting that SIMPLISMA <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> was originally inspired by the factor analysis work of Knorr and Futrell <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
         <p>Recently, a novel regression method was reported by Stokkum and co-authors <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. This method borrows several strategies from the work of Jonsson et al. <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B68">68</abbr></abbrgrp>, including dividing the data into time windows. Applications on real and simulated GC-MS data sets under difficult co-eluting scenarios demonstrated that this method is competitive with multivariate curve resolution <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B68">68</abbr></abbrgrp> at simultaneous analysis of multiple GC-MS data sets.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In this work, published methods for the extraction of pure components in GC-MS data with co-eluting components were reviewed. This provides several important insights. First, in reports presenting new peak deconvolution methods, the use of realistic experimental scenarios is important. Second, for any new method, the availability of software implementation that would allow the method to be tested by a wider GC-MS community, is critical.</p>
         <p>Perhaps a more subtle point is that most matrix methods require the number of components to be known prior to the separation of overlapped signals. This is evident in both early studies <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B48">48</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp> as well as in more recent works <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B68">68</abbr></abbrgrp>, suggesting that a separate analysis of this problem is warranted. We also note that the method of Jonsson and co-authors <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B68">68</abbr></abbrgrp> may provide the recipe for a systematic deconvolution of the entire data set by applying divide-and-conquer strategy, coupled with the alternating regression originally proposed by Karjalainen <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>.</p>
         <p>Although the empirical methods for peak deconvolution are currently most widely used in practice, it seems inevitable that matrix methods will dominate the future. This is evident from the application of matrix methods to the analysis of complex plant samples <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, development of new matrix-like approaches <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B61">61</abbr></abbrgrp> and methods aimed to identify differences in high-throughput GC-MS data <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B68">68</abbr><abbr bid="B71">71</abbr></abbrgrp>.</p>
         <p>Remarkable progress in the field of general computing in the past two decades has opened new avenues for tackling the problem of peak deconvolution, and GC-MS data processing in general. Modern computer hardware is thousands of times more capable compared to the elite computing machines of twenty years ago. Several important works reviewed here were performed on (today completely obsolete) PDP-11 computers <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. For example, Blaisdell and co-authors reported that a mere 20,000 16-bit words of core memory was available for their programs <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Furthermore, modern computing clusters based on commodity hardware allow even further scaling in the CPU power. The changes in the software landscape are equally drastic. For example, in their application of principal component analysis Davis and co-authors wrote their own functions for eigenvalue decomposition in the programming language BASIC <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Today, software platforms such as MATLAB <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>, GNU Octave <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>, and R <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> provide integrated environments with thousands of highly optimized mathematical and statistical functions readily available (and in the case of open source packages such as GNU Octave and R, at no cost). Moreover, a range of open source projects such as Python <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>, Perl <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>, and Java <abbrgrp><abbr bid="B77">77</abbr></abbrgrp> provide general purpose programming languages with rich and well tested libraries. These developments suggest that a new era of collaborative computing, based on open standards and open source software, is about to emerge in GC-MS data processing. A similar transformation is already evident from the initiatives to standardize representations of mass spectrometry data <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>, and open source packages for LC-MS data processing recently published <abbrgrp><abbr bid="B79">79</abbr><abbr bid="B80">80</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The author declares that they have no competing interests.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The author thanks Joachim Kopka, Gary Siuzdak, H. Paul Benton, Georgie Harding and Amanda Gelsi for reading the manuscript and their valuable comments.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Gross</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Mass Spectrometry: A Textbook</source>
            <publisher>Berlin: Springer-Verlag</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Heftmann</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Chromatography: Fundamentals and Applications of Chromatography and Related Differential Migration Methods</source>
            <publisher>Amsterdam: Elsevier</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Metabolite profiling: from diagnostics to systems biology</p>
            </title>
            <aug>
               <au>
                  <snm>Fernie</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Trethewey</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Krotzky</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Willmitzer</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>763</fpage>
            <lpage>769</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1451</pubid>
                  <pubid idtype="pmpid" link="fulltext">15340383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The expanding role of mass spectrometry in metabolite profiling and characterization</p>
            </title>
            <aug>
               <au>
                  <snm>Want</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Cravatt</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>G</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>ChemBioChem</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/cbic.200500151</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Chemical derivatization and mass spectral libraries in metabolic profiling by GC/MS and LC/MS/MS</p>
            </title>
            <aug>
               <au>
                  <snm>Halket</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Przyborowska</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Bramley</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>J Exp Bot</source>
            <pubdate>2005</pubdate>
            <volume>56</volume>
            <fpage>219</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/jxb/eri069</pubid>
                  <pubid idtype="pmpid" link="fulltext">15618298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Gas Chromatography Mass Spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Kopka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Plant Metabolomics</source>
            <publisher>Heidelberg: Springer</publisher>
            <editor>Saito K, Dixon RA, Willmitzer L</editor>
            <pubdate>2006</pubdate>
            <fpage>3</fpage>
            <lpage>20</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Metabolic profiles: gas-phase methods for analysis of metabolites</p>
            </title>
            <aug>
               <au>
                  <snm>Horning</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Horning</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Clinical Chemistry</source>
            <pubdate>1971</pubdate>
            <volume>17</volume>
            <fpage>802</fpage>
            <lpage>809</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">5105517</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Application of gas chromatography-mass spectrometry in routine and research in clinical chemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Eldjarn</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jellum</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stokke</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>J Chromatogr</source>
            <pubdate>1974</pubdate>
            <volume>91</volume>
            <fpage>353</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0021-9673(01)97914-2</pubid>
                  <pubid idtype="pmpid">4133390</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Profiling steroids by gas chromatography-mass spectrometry: clinical applications</p>
            </title>
            <aug>
               <au>
                  <snm>Wudy</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Homoki</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Diagnostics of Endocrine Function in Children and Adolescents</source>
            <publisher>Basel: Krager</publisher>
            <editor>Ranke MB</editor>
            <pubdate>2003</pubdate>
            <fpage>427</fpage>
            <lpage>449</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Gas chromatography/mass spectrometry in metabolic profiling of biological fluids</p>
            </title>
            <aug>
               <au>
                  <snm>Pasikanti</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>J Chromatogr B</source>
            <pubdate>2008</pubdate>
            <volume>871</volume>
            <fpage>202</fpage>
            <lpage>211</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jchromb.2008.04.033</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Metabolite profiling of germinating rice seeds</p>
            </title>
            <aug>
               <au>
                  <snm>Shu</snm>
                  <fnm>XL</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shu</snm>
                  <fnm>QY</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>J Agric Food Chem</source>
            <pubdate>2008</pubdate>
            <volume>56</volume>
            <fpage>11612</fpage>
            <lpage>11620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/jf802671p</pubid>
                  <pubid idtype="pmpid" link="fulltext">19053355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Software-based mass spectral enhancement to remove interferences from spectra of unknowns</p>
            </title>
            <aug>
               <au>
                  <snm>Herron</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>W</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>J Am Soc Mass Spectrom</source>
            <pubdate>1996</pubdate>
            <volume>7</volume>
            <fpage>598</fpage>
            <lpage>604</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/1044-0305(96)00018-9</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <aug>
               <au>
                  <snm>Johnstone</snm>
                  <fnm>RAW</fnm>
               </au>
               <au>
                  <snm>Johnstone</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Mass spectrometry for chemists and biochemists</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>1996</pubdate>
         </bibl>
         <bibl id="B14">
            <aug>
               <au>
                  <snm>Niessen</snm>
                  <fnm>WMA</fnm>
               </au>
               <au>
                  <cnm>(Ed)</cnm>
               </au>
            </aug>
            <source>Current Practice of Gas Chromatography-Mass Spectrometry</source>
            <publisher>New York: Marcel Dekker, Inc</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Metabolite profiling for plant functional genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Fiehn</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Kopka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>D&#246;rmann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Altmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Trethewey</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Willmitzer</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>1157</fpage>
            <lpage>1161</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/81137</pubid>
                  <pubid idtype="pmpid" link="fulltext">11062433</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Comprehensive metabolite profiling of Sinorhizobium meliloti using gas chromatography-mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Barsch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Patschkowski</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Niehaus</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Funct Integr Genomics</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>219</fpage>
            <lpage>230</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s10142-004-0117-y</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372312</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>High-throughput metabolic state analysis: the missing link in integrated functional genomics of yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Villas-B&#244;as</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Moxley</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Akesson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stephanopoulos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Biochem J</source>
            <pubdate>2005</pubdate>
            <volume>388</volume>
            <fpage>669</fpage>
            <lpage>677</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1042/BJ20041162</pubid>
                  <pubid idtype="pmcid">1138975</pubid>
                  <pubid idtype="pmpid" link="fulltext">15667247</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Metabolite profiling of germinating rice seeds</p>
            </title>
            <aug>
               <au>
                  <snm>Devantier</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Scheithauer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Villas-B&#244;as</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>L</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>J Agric Food Chem</source>
            <pubdate>2008</pubdate>
            <volume>56</volume>
            <fpage>11612</fpage>
            <lpage>11620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/jf802671p</pubid>
                  <pubid idtype="pmpid" link="fulltext">19053355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Extraction and GC/MS analysis of the human blood plasma metabolome</p>
            </title>
            <aug>
               <au>
                  <snm>Jiye</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Trygg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gullberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Jonsson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Antti</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Marklund</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>T</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>2005</pubdate>
            <volume>77</volume>
            <fpage>8086</fpage>
            <lpage>9094</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ac051211v</pubid>
                  <pubid idtype="pmpid" link="fulltext">16351159</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Mass spectrometry-based metabolic profiling reveals different metabolite patterns in invasive ovarian carcinomas and ovarian borderline tumors</p>
            </title>
            <aug>
               <au>
                  <snm>Denkert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Budczies</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kind</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weichert</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Tablack</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sehouli</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Niesporek</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>K&#246;nsgen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dietel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fiehn</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Cancer Res</source>
            <pubdate>2006</pubdate>
            <volume>66</volume>
            <fpage>10795</fpage>
            <lpage>10804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1158/0008-5472.CAN-06-0755</pubid>
                  <pubid idtype="pmpid" link="fulltext">17108116</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Identification and relative quantification of fatty acids in Escherichia coli membranes by gas chromatography/mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Oursel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Loutelier-Bourhis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Orange</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chevalier</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Norris</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>M</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>Rapid Commun Mass Spectrom</source>
            <pubdate>2007</pubdate>
            <volume>21</volume>
            <fpage>3229</fpage>
            <lpage>3233</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/rcm.3177</pubid>
                  <pubid idtype="pmpid" link="fulltext">17828792</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Phenotype differentiation of three E. coli strains by GC-FID and GC-MS based metabolomics</p>
            </title>
            <aug>
               <au>
                  <snm>Tian</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Chromatogr B</source>
            <pubdate>2008</pubdate>
            <volume>871</volume>
            <fpage>220</fpage>
            <lpage>226</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jchromb.2008.06.031</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Development and validation of a gas chromatography/mass spectrometry metabonomic platform for the global profiling of urinary metabolites</p>
            </title>
            <aug>
               <au>
                  <snm>Pasikanti</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>Rapid Commun Mass Spectrom</source>
            <pubdate>2008</pubdate>
            <volume>22</volume>
            <fpage>2984</fpage>
            <lpage>2902</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/rcm.3699</pubid>
                  <pubid idtype="pmpid" link="fulltext">18763274</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A pilot study of GC/MS-based serum metabolic profiling of acute rejection in renal transplantation</p>
            </title>
            <aug>
               <au>
                  <snm>Mao</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Bai</snm>
                  <fnm>JQ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Shou</snm>
                  <fnm>ZF</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>YY</fnm>
               </au>
            </aug>
            <source>Transpl Immunol</source>
            <pubdate>2008</pubdate>
            <volume>19</volume>
            <fpage>74</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18346641</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Connecting genes to metabolites by a systems biology approach</p>
            </title>
            <aug>
               <au>
                  <snm>Oksman-Caldentey</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Inz&#233;</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ore&#353;i&#269;</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>9949</fpage>
            <lpage>9950</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0403636101</pubid>
                  <pubid idtype="pmcid">454195</pubid>
                  <pubid idtype="pmpid" link="fulltext">15226495</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Metabolomics and systems biology in Saccaromyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Castrillo</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>The Mycota XII</source>
            <publisher>Heidelberg: Springer-Verlag</publisher>
            <editor>P BAJ</editor>
            <pubdate>2006</pubdate>
            <fpage>3</fpage>
            <lpage>18</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Systems biology, metabolic modelling and metabolomics in drug discovery and development</p>
            </title>
            <aug>
               <au>
                  <snm>Kell</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Drug Discov Today</source>
            <pubdate>2006</pubdate>
            <volume>11</volume>
            <fpage>1085</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.drudis.2006.10.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">17129827</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Standardizing GC-MS metabolomics</p>
            </title>
            <aug>
               <au>
                  <snm>Kanani</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Chrysanthopoulos</snm>
                  <fnm>PK</fnm>
               </au>
               <au>
                  <snm>Klapa</snm>
                  <fnm>MI</fnm>
               </au>
            </aug>
            <source>J Chromatogr B</source>
            <pubdate>2008</pubdate>
            <volume>871</volume>
            <fpage>191</fpage>
            <lpage>201</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jchromb.2008.04.049</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>High-throughput data analysis for detecting and identifying differences between samples in GC/MS-based metabolomic analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Jonsson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Gullberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Trygg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jiye</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Grung</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Marklund</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sj&#246;str&#246;m</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Antti</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Moritz</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>2005</pubdate>
            <volume>77</volume>
            <fpage>5635</fpage>
            <lpage>5642</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ac050601e</pubid>
                  <pubid idtype="pmpid" link="fulltext">16131076</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Theoretical limitations of gas chromatographic/mass spectrometric identification of multicomponent mixtures</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenthal</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1982</pubdate>
            <volume>54</volume>
            <fpage>63</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00238a020</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Statistical theory of component overlap in multicomponent chromatograms</p>
            </title>
            <aug>
               <au>
                  <snm>Davis</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Giddings</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1983</pubdate>
            <volume>55</volume>
            <fpage>418</fpage>
            <lpage>424</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00254a003</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Reconstruction of mass spectra, a novel approach for the utilization of gas chromatograph-mass spectrometer data</p>
            </title>
            <aug>
               <au>
                  <snm>Biller</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Biemann</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Anal Lett</source>
            <pubdate>1974</pubdate>
            <volume>7</volume>
            <fpage>515</fpage>
            <lpage>528</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Extraction of mass spectra free of background and neighboring component contributions from gas chromatography/mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Dromey</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Stefik</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Rindfleisch</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Duffield</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1976</pubdate>
            <volume>48</volume>
            <fpage>1368</fpage>
            <lpage>1375</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac50003a027</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Improvement of algorithm for peak detection in automatic gas chromatography-mass spectrometry data processing</p>
            </title>
            <aug>
               <au>
                  <snm>Hargrove</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Rosenthal</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cooley</snm>
                  <fnm>PC</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1981</pubdate>
            <volume>53</volume>
            <fpage>538</fpage>
            <lpage>539</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00226a035</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Spectral deconvolution for overlapping GC/MS components</p>
            </title>
            <aug>
               <au>
                  <snm>Colby</snm>
                  <fnm>BN</fnm>
               </au>
            </aug>
            <source>J Am Soc Mass Spectrom</source>
            <pubdate>1992</pubdate>
            <volume>3</volume>
            <fpage>558</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/1044-0305(92)85033-G</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>J Am Soc Mass Spectrom</source>
            <pubdate>1999</pubdate>
            <volume>10</volume>
            <fpage>770</fpage>
            <lpage>781</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S1044-0305(99)00047-1</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Application of gas chromatography, mass spectrometry, and computer methods in clinical biochemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Jellum</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stokke</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Eldjarn</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1973</pubdate>
            <volume>45</volume>
            <fpage>1099</fpage>
            <lpage>1106</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac60329a037</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Rapid computerized identification of compounds in complex biological mixtures by gas chromatography-mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Sweeley</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Gates</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>J Chromatogr</source>
            <pubdate>1974</pubdate>
            <volume>99</volume>
            <fpage>507</fpage>
            <lpage>517</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0021-9673(00)90880-X</pubid>
                  <pubid idtype="pmpid">4418788</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Differential Gas Chromatographic Mass Spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Anderegg</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1989</pubdate>
            <volume>61</volume>
            <fpage>73</fpage>
            <lpage>77</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00176a015</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Complex Mixture Analysis Using Differential Gas Chromatographic Mass Spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Anderegg</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1989</pubdate>
            <volume>61</volume>
            <fpage>2118</fpage>
            <lpage>2121</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00193a024</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Backfolding applied to differential gas chromatography/mass spectrometry as a mathematical enhancement of chromatographic resolution</p>
            </title>
            <aug>
               <au>
                  <snm>Pool</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>de Leeuw</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>J Mass Spectrom</source>
            <pubdate>1996</pubdate>
            <volume>31</volume>
            <fpage>509</fpage>
            <lpage>516</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/(SICI)1096-9888(199605)31:5&lt;509::AID-JMS323>3.0.CO;2-B</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Automated extraction of pure mass spectra from gas chromatographic/mass spectrometric data</p>
            </title>
            <aug>
               <au>
                  <snm>Pool</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>de Leeuw</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Graaf</snm>
                  <mnm>van de</mnm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Mass Spectrom</source>
            <pubdate>1997</pubdate>
            <volume>32</volume>
            <fpage>438</fpage>
            <lpage>443</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/(SICI)1096-9888(199704)32:4&lt;438::AID-JMS499>3.0.CO;2-N</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Principal-component analysis applied to combined gas chromatographic-mass spectrometric data</p>
            </title>
            <aug>
               <au>
                  <snm>Davis</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Shepard</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stanford</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>LB</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1974</pubdate>
            <volume>46</volume>
            <fpage>821</fpage>
            <lpage>825</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac60343a045</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Factor analysis of the mass spectra of mixtures</p>
            </title>
            <aug>
               <au>
                  <snm>Ritter</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Lowry</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Isenhour</snm>
                  <fnm>TL</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1976</pubdate>
            <volume>48</volume>
            <fpage>591</fpage>
            <lpage>595</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac60367a028</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Separation of mass spectra of mixtures by factor analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Knorr</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Futrell</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1979</pubdate>
            <volume>51</volume>
            <fpage>1236</fpage>
            <lpage>1241</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac50044a030</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Extraction of individual mass spectra from gas chromatography-mass spectrometry data of unseparated mixtures</p>
            </title>
            <aug>
               <au>
                  <snm>Sharaf</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Kowalski</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1981</pubdate>
            <volume>53</volume>
            <fpage>518</fpage>
            <lpage>522</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00226a031</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Evolving factor analysis for the resolution of overlapping chromatographic peaks</p>
            </title>
            <aug>
               <au>
                  <snm>Meader</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1987</pubdate>
            <volume>59</volume>
            <fpage>527</fpage>
            <lpage>530</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00130a035</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Evolving factor analysis in gas chromatography/mass spectrometry - a feasibility study</p>
            </title>
            <aug>
               <au>
                  <snm>Roach</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Guilhaus</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Org Mass Spectrom</source>
            <pubdate>1992</pubdate>
            <volume>27</volume>
            <fpage>1071</fpage>
            <lpage>1076</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/oms.1210271017</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Analysis of volatile fractions of Schisandra chinensis (Turcz.) Baill. using GC-MS and chemometric resolution</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>XN</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>YQ</fnm>
               </au>
               <au>
                  <snm>Z</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Chau</snm>
                  <fnm>FT</fnm>
               </au>
            </aug>
            <source>Phytochem Anal</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <fpage>23</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/pca.674</pubid>
                  <pubid idtype="pmpid" link="fulltext">12597252</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Determination in gas chromatography-mass spectrometry data of mass spectra free of background and neighboring substance contributions</p>
            </title>
            <aug>
               <au>
                  <snm>Blaisdell</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Sweeley</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>Anal Chemica Acta</source>
            <pubdate>1980</pubdate>
            <volume>117</volume>
            <fpage>1</fpage>
            <lpage>15</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0003-2670(80)87001-2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Multichannel detection and numerical resolution of overlapping chromatographic peaks</p>
            </title>
            <aug>
               <au>
                  <snm>Knorr</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Thorsheim</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1981</pubdate>
            <volume>53</volume>
            <fpage>821</fpage>
            <lpage>825</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00229a019</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Spectrum reconstruction in GC/MS. The robustness of the solution found with alternating regression</p>
            </title>
            <aug>
               <au>
                  <snm>Karjalainen</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>Scientific Computing and Automation</source>
            <publisher>Amsterdam: Elsevier Science Publishers</publisher>
            <editor>Karjalainen EJ</editor>
            <pubdate>1990</pubdate>
            <fpage>477</fpage>
            <lpage>488</lpage>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Gas chromatography-mass spectrometry and chemometric resolution applied to the determination of essential oils in Cortex cinnamomi</p>
            </title>
            <aug>
               <au>
                  <snm>Gong</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>YZ</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>QS</fnm>
               </au>
               <au>
                  <snm>Chau</snm>
                  <fnm>FT</fnm>
               </au>
            </aug>
            <source>J Chromatogr A</source>
            <pubdate>2001</pubdate>
            <volume>905</volume>
            <fpage>193</fpage>
            <lpage>205</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0021-9673(00)00976-6</pubid>
                  <pubid idtype="pmpid">11206787</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Global analysis of multiple gas chromatography-mass spectrometry (GC/MS) data sets: A method for resolution of co-eluting components with comparison to MCR-ALS</p>
            </title>
            <aug>
               <au>
                  <snm>van Stokkum</snm>
                  <fnm>IHM</fnm>
               </au>
               <au>
                  <snm>Mullen</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>V</snm>
                  <fnm>MV</fnm>
               </au>
            </aug>
            <source>Chemometrics and Intelligent Laboratory Systems</source>
            <pubdate>2009</pubdate>
            <volume>95</volume>
            <fpage>150</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.chemolab.2008.10.004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Trends in automatic data processing</p>
            </title>
            <aug>
               <au>
                  <snm>Chapman</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Int J Mass Spectrom Ion Phys</source>
            <pubdate>1982</pubdate>
            <volume>45</volume>
            <fpage>207</fpage>
            <lpage>218</lpage>
         </bibl>
         <bibl id="B56">
            <aug>
               <au>
                  <snm>Giddings</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Dynamics of Chromatography: Principles and Theory</source>
            <publisher>New York: Marcel Dekker</publisher>
            <pubdate>1965</pubdate>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Mathematical functions for the representation of chromatographic peaks</p>
            </title>
            <aug>
               <au>
                  <snm>Di Marco</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Bombi</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>J Chromatogr A</source>
            <pubdate>2001</pubdate>
            <volume>931</volume>
            <fpage>1</fpage>
            <lpage>30</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0021-9673(01)01136-0</pubid>
                  <pubid idtype="pmpid">11695512</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Comparative evaluation of software for deconvolution of metabolomics data based on GC-TOF-MS</p>
            </title>
            <aug>
               <au>
                  <snm>Lu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>WB</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kell</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Trends in Anal Chem</source>
            <pubdate>2008</pubdate>
            <volume>27</volume>
            <fpage>215</fpage>
            <lpage>227</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.trac.2007.11.004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <aug>
               <au>
                  <snm>Bracewell</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>The Fourier transform and its applications</source>
            <publisher>New York: McGraw-Hill</publisher>
            <pubdate>1999</pubdate>
         </bibl>
         <bibl id="B60">
            <title>
               <p>A nove approach to the retrieval of the mass spectrum of a mixture</p>
            </title>
            <aug>
               <au>
                  <snm>Gan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>YZ</fnm>
               </au>
            </aug>
            <source>Anal Sci</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>603</fpage>
            <lpage>607</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2116/analsci.16.603</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Extraction of mass spectra and chromatographic profiles from overlapping GC/MS signal with background</p>
            </title>
            <aug>
               <au>
                  <snm>Shao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>Q</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>2004</pubdate>
            <volume>76</volume>
            <issue>17</issue>
            <fpage>5143</fpage>
            <lpage>5148</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ac035521u</pubid>
                  <pubid idtype="pmpid" link="fulltext">15373454</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Immune algorithms in analytical chemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Shao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Trends in Anal Chem</source>
            <pubdate>2003</pubdate>
            <volume>22</volume>
            <fpage>59</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0165-9936(03)00205-X</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Independent component analysis, A new concept?</p>
            </title>
            <aug>
               <au>
                  <snm>Comon</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Signal Proc</source>
            <pubdate>1994</pubdate>
            <volume>36</volume>
            <fpage>287</fpage>
            <lpage>314</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0165-1684(94)90029-9</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Chemical noise in Mass Spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Busch</snm>
                  <fnm>KL</fnm>
               </au>
            </aug>
            <source>Spectroscopy</source>
            <pubdate>2002</pubdate>
            <volume>17</volume>
            <issue>10</issue>
            <fpage>32</fpage>
            <lpage>37</lpage>
         </bibl>
         <bibl id="B65">
            <title>
               <p>TagFinder for the quantitative analysis of gas chromatography-mass spectrometry (GC-MS)-based metabolite profiling experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Luedemann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Strassburg</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Erban</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kopka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <fpage>732</fpage>
            <lpage>737</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btn023</pubid>
                  <pubid idtype="pmpid" link="fulltext">18204057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Deconvolution gas chromatography/mass spectrometry of urinary organic acids-potential for pattern recognition and automated identification of metabolic disorders</p>
            </title>
            <aug>
               <au>
                  <snm>Halket</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Przyborowska</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Mallard</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chalmers</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Rapid Commun Mass Spectrom</source>
            <pubdate>1999</pubdate>
            <volume>13</volume>
            <fpage>279</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0231(19990228)13:4&lt;279::AID-RCM478>3.0.CO;2-I</pubid>
                  <pubid idtype="pmpid" link="fulltext">10097403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Comparison of gas chromatography-pulsed flame photometric detection-mass spectrometry, automated mass spectral deconvolution and identification system and gas chromatography-tandem mass spectrometry as tools for trace level detection and identification</p>
            </title>
            <aug>
               <au>
                  <snm>Dagan</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Chromatogr A</source>
            <pubdate>2000</pubdate>
            <volume>4</volume>
            <fpage>229</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0021-9673(99)01138-3</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>A strategy for identifying differences in large series of metabolomic samples analyzed by GC/MS</p>
            </title>
            <aug>
               <au>
                  <snm>Jonsson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gullberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nordstr&#246;m</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kusano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kowalczyk</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sj&#246;str&#246;m</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moritz</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>2004</pubdate>
            <volume>76</volume>
            <fpage>1738</fpage>
            <lpage>1745</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ac0352427</pubid>
                  <pubid idtype="pmpid" link="fulltext">15018577</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Interactive self-modeling mixture analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Windig</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Guilment</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>1991</pubdate>
            <volume>63</volume>
            <fpage>1425</fpage>
            <lpage>1432</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/ac00014a016</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Application of SIMPLISMA for the assessment of peak purity in liquid chromatography with diode array detection</p>
            </title>
            <aug>
               <au>
                  <snm>S&#225;nchez</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Massart</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Anal Chim Acta</source>
            <pubdate>1994</pubdate>
            <volume>298</volume>
            <fpage>331</fpage>
            <lpage>339</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0003-2670(94)00283-5</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Predictive metabolite profiling applying hierarchical multivariate curve resolution to GC-MS data-a potential tool for multi-parametric diagnosis</p>
            </title>
            <aug>
               <au>
                  <snm>Jonsson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Wuolikainen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lindberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schuppe-Koistinen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kusano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sj&#246;str&#246;m</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Trygg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moritz</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Antti</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Proteome Res</source>
            <pubdate>2006</pubdate>
            <volume>5</volume>
            <fpage>1407</fpage>
            <lpage>1414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/pr0600071</pubid>
                  <pubid idtype="pmpid" link="fulltext">16739992</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Matlab - The MathWorks</p>
            </title>
            <url>http://www.mathworks.com/products/matlab/</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>GNU Octave homepage</p>
            </title>
            <url>http://www.gnu.org/software/octave/</url>
         </bibl>
         <bibl id="B74">
            <title>
               <p>The R project</p>
            </title>
            <url>http://www.r-project.org/</url>
         </bibl>
         <bibl id="B75">
            <title>
               <p>Python</p>
            </title>
            <url>http://www.python.org/</url>
         </bibl>
         <bibl id="B76">
            <title>
               <p>Perl</p>
            </title>
            <url>http://www.perl.org/</url>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Java</p>
            </title>
            <url>http://java.sun.com/</url>
         </bibl>
         <bibl id="B78">
            <title>
               <p>A common open representation of mass spectrometry data and its application to proteomics research</p>
            </title>
            <aug>
               <au>
                  <snm>Pedrioli</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Eng</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Hubley</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Vogelzang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Deutsch</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Raught</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pratt</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Angeletti</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Costello</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Hermjakob</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Julian</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Kapp</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mccomb</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Omenn</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Paton</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Aebersold</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature Biotechnology</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <fpage>1459</fpage>
            <lpage>1466</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1031</pubid>
                  <pubid idtype="pmpid" link="fulltext">15529173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Want</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>O'Maille</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Abagyan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Siuzdak</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Anal Chem</source>
            <pubdate>2006</pubdate>
            <volume>78</volume>
            <fpage>779</fpage>
            <lpage>787</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ac051437y</pubid>
                  <pubid idtype="pmpid" link="fulltext">16448051</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data</p>
            </title>
            <aug>
               <au>
                  <snm>Katajamaa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miettinen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ore&#353;i&#269;</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>634</fpage>
            <lpage>636</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btk039</pubid>
                  <pubid idtype="pmpid" link="fulltext">16403790</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

