Journal of Applied Physiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Appl Physiol 86: 359-376, 1999;
8750-7587/99 $5.00
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF) Free
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Small, M.
Right arrow Articles by Stick, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Small, M.
Right arrow Articles by Stick, S.
Vol. 86, Issue 1, 359-376, January 1999

Is breathing in infants chaotic? Dimension estimates for respiratory patterns during quiet sleep

M. Small1, K. Judd1, M. Lowe2, and S. Stick2

1 Centre for Applied Dynamics and Optimization, Department of Mathematics, University of Western Australia, Nedlands, Western Australia 6907; and 2 Department of Respiratory Medicine, Princess Margaret Hospital for Children, Subiaco, Western Australia 6008, Australia

    ABSTRACT
Top
Abstract
Introduction
Appendix
Appendix
Appendix
Appendix
References

We describe an analysis of dynamic behavior apparent in times-series recordings of infant breathing during sleep. Three principal techniques were used: estimation of correlation dimension, surrogate data analysis, and reduced linear (autoregressive) modeling (RARM). Correlation dimension can be used to quantify the complexity of time series and has been applied to a variety of physiological and biological measurements. However, the methods most commonly used to estimate correlation dimension suffer from some technical problems that can produce misleading results if not correctly applied. We used a new technique of estimating correlation dimension that has fewer problems. We tested the significance of dimension estimates by comparing estimates with artificial data sets (surrogate data). On the basis of the analysis, we conclude that the dynamics of infant breathing during quiet sleep can best be described as a nonlinear dynamic system with large-scale, low-dimensional and small-scale, high-dimensional behavior; more specifically, a noise-driven nonlinear system with a two-dimensional periodic orbit. Using our RARM technique, we identified the second period as cyclic amplitude modulation of the same period as periodic breathing. We conclude that our data are consistent with respiration being chaotic.

control of breathing; periodic breathing; dynamical systems; chaos; surrogate data techniques

    INTRODUCTION
Top
Abstract
Introduction
Appendix
Appendix
Appendix
Appendix
References

THIS PAPER DESCRIBES and summarizes a study of infant breathing by using data-analysis techniques derived from dynamic systems theory (DST). Such techniques have been useful for examining other complex physiological rhythms such as heart rate (7, 46, 47, 63, 67), electroencephalogram (2, 24, 26, 29, 36, 37), parathyroid hormone secretion (31), and opticokinetic nystagmus (45) and can distinguish variations that are random from those that are deterministic ("chaotic").

Any model that describes the control of respiration must account for two important behaviors, rhythm and pattern generation. There are two distinct approaches to the problem of rhythm: the first requires the existence of discrete "pacemaker" cells with intrinsic activity that drives other respiratory neurons. The output of various respiratory centers or pools of motoneurons is then organized by a pattern generator. A second approach implies that networks of cells with oscillatory behavior interact in a complex way to produce respiratory rhythms, which are either further organized by a pattern generator or might be self-organizing (11). The first model is, in effect, a linearly coupled model with pattern generation dependent on feedback mechanisms from the various components of the respiratory system, whereas the second is more likely to be complex. Advances in neurobiology have allowed recordings to be made from individual neurons and groups of neurons in the brain. Using these techniques, various studies have demonstrated that the concept of discrete respiratory centers made up of neurons with specific functions defined by the nature of a particular "center" is obsolete (11). Whereas there is organization of neurons into functional networks or pools, these are not necessarily anatomically discrete. Also, there are conflicting data in regard to the presence of a specific pattern generator. Given the complexity of the connections between the various groups of oscillating, respiratory-related neurons and the capacity for interactions between simple oscillating systems to produce complex behavior, we have argued that information about the organization of respiratory control can be determined by using DST. In essence, the argument that there is a simple pattern generator that coordinates the output from various respiratory centers is unnecessary if the output from interacting networks is dynamic and self-organizing. To examine such a hypothesis, respiration must first be adequately shown to be chaotic, then examined further to describe its dynamic structure.

Most studies of the dynamic behavior of biological systems have used fractal dimension estimation to establish that a system's behavior is chaotic or to classify distinct types of behavior by their complexity. Recent studies have suggested that respiration in humans is chaotic. If that is the case, then techniques derived from DST should allow the dynamic structure of respiratory behavior to be better described, thus improving our understanding of the control of breathing. However, these earlier studies have important limitations. Most studies have used the Grassberger and Procaccia algorithm (14, 15) for estimating fractal dimension, which is simple and easy to implement. Unfortunately, it is now recognized (19, 33) that this algorithm has some technical problems that can lead to misinterpretations of data. The most serious problems occur with small data sets or when the system incorporates a substantial noise component. The studies described below each suffer these limitations to a greater or lesser extent and none examined respiratory patterns in quiet sleep when breathing is most regular.

Donaldson (9) used DST to examine respiratory behavior in humans. He studied resting adults and, by estimating Lyapunov exponents, concluded that resting respiration is chaotic. However, this relatively simple approach could not distinguish a nonlinear dynamic system from linearly filtered noise.

Pilgram et al. (30) studied the dimensional complexity in the respiratory traces of infants during rapid-eye-movement (REM) sleep. They concluded that breathing during REM sleep is chaotic but did not study patterns of breathing during quiet sleep. In reaching their conclusion, Pilgram et al. also employed the method of surrogate data. Dimension-estimation algorithm used by Pilgram et al. failed to give reliable estimates for their noise-driven surrogates, and so the authors inferred that the control of respiration is possibly deterministic.

In another set of studies, Sammon and colleagues (38-42) examined dynamic respiratory behavior in rats. From their observations they concluded that, in anesthetized vagotomized rats, the respiratory system behaves as an oscillator with a single degree of freedom. With the vagus intact, however, respiratory behavior was more complex, exhibiting low-order chaos, which, the authors speculated, was due to feedback from various types of pulmonary afferent activity.

In this paper, we test for cyclic modulation in the amplitude of respiration. Cyclic fluctuation in the amplitude of respiration indicates the presence of additional complexity in the respiratory system and is consistent with the results of our correlation dimension calculations. Other authors (see, e.g., Ref. 64) have identified cyclic behavior in breath sizes. Waggener and colleagues (64) identify cyclic variation in the breath sizes of resting adults at simulated altitude.

The study reported here employs a new algorithm to determine fractal dimension that is technically more complex but is in practice more reliable, more robust under the restrictions of finite data, and less prone to misinterpretation. Estimates of fractal dimension were used as an aid to identify the dynamic system that produced the data we have measured.

From dimension estimations we conclude that the dynamics of breathing during quiet sleep are consistent with a large-scale, low-dimensional system with a substantial small-scale, high-dimensional component, i.e., a periodic orbit with a few (perhaps 2 or 3) degrees of freedom supplemented by smaller, more complex deterministic fluctuations.

The nature of the low-dimensional system was investigated further by constructing surrogate data, which enabled us to test whether the dynamics were consistent with linearly filtered noise or with a nonlinear dynamic system. Our results show clearly that, in almost all cases, the dynamics are best described as a low-dimensional, nonlinear dynamic system being driven by a high-dimensional noise source. In those cases when such a model is inconsistent with the data, the measured data have strong indications of nonstationarity; that is, the breathing patterns changed during the recording (for example, a sudden switch to periodic breathing occurred).

To describe a complex dynamic system as chaotic may not be helpful when the components that produce the structure cannot be described. Understanding these components is more important than simply classifying the system as chaotic. However, before the mechanism can be understood, it is important to determine basic properties of the system: Is it dynamic? Is it nonlinear? Is it chaotic?

We, therefore, investigated the structure of the dynamics we described in greater detail. We used a new form of linear modeling to expose a possible source of our multidimensional periodic orbit. By applying a reduced autoregressive modeling (RARM) algorithm to some data from sleeping infants we observe cyclic amplitude modulation (CAM) indicative of substantial structure in breath-to-breath variability. These calculations lead us to conclude that respiration in quiet sleep is most probably controlled by multidimensional oscillator supplemented by small-scale, complex (chaotic) behavior. Moreover, our results are inconsistent with a noise-driven, one-dimensional model of eupnea (or any other model that does not include substantial structure in breath-to-breath variability).

After a brief introduction to the new dimension estimation algorithm, we describe the experimental methodology, including a description of our surrogate data-generation methods and our RARM algorithm. Finally, we discuss the dimension calculations and the results of the hypothesis testing by using the surrogate data sets.

    METHODS

Using standard noninvasive inductive-plethysmography techniques, we obtained a measurement proportional to the cross-sectional area of the chest or abdomen, which is a gauge of the lung volume. The present study collected measurements of the cross-sectional area of the abdomen of infants during natural sleep. The study was approved by the Princess Margaret Hospital Ethics Committee.

Subjects

Ten healthy infants were studied at 2 mo of age in the sleep laboratory at Princess Margaret Hospital. Data from 15 infants (including the 10 studied at 2 mo) between 1 and 6 mo of age were recorded and are used in our calculations to detect CAM.

Data Collection

The unfiltered analog signal from a NIMS Respitrace+ [Non-Invasive Monitoring systems (NIMS); trading through Sensormedics, Yorba Linda, CA] inductance plethysmograph was passed through a direct-current amplifier and 12-bit analog-to-digital converter (sampling at 50 Hz). The digital data were recorded in ASCII format by using Lab-dat and Anadat software packages (RHT-Infodat, Montreal, Quebec, Canada) installed on an IBM 286 microcomputer. This ASCII data were then transferred to Unix workstations at the University of Western Australia for analysis with the use of Matlab and C programs.

The only practical limitation on the length of time for which data could be collected is the period during which the infant remains asleep and still. The cross-sectional area of the lung varies with the position of the infant. However, in this study, we were interested only in the variation due to the breathing, and so we have been careful to avoid artifacts due to changes in position or band slippage. We have made observations of up to 2 h that are free from significant movement artifacts, although, typically, observations are in the range of 5-30 min. All 27 observations used to calculate dimension are between 240 and 360 s; those used to identify CAM are between 400 and 1,400 s.

In contrast to the study by Pilgram and colleagues (30) that examined breathing in REM sleep, we have studied infants in quiet sleep. From measurements of electroencephalogram, electromyogram, and electrooculogram, sleep stage was determined by using standard polysomnographic criteria (6). During quiet sleep, breathing often appears relatively regular. The possibly chaotic features of most interest are the small variations from this regular periodic behavior. Because we wished to observe such fine detail, we did not filter signals. The analog output of the respiratory plethysmograph (operating in its direct-current mode) has no built-in filtering. Filtering methods, such as linear filters and singular-value decomposition methods (30), can remove some features that we wished to observe. Furthermore, filtering (even to avoid aliasing) has been shown in some cases to lead to erroneous identification of chaos (25, 28, 34).

The abdominal signal is not necessarily proportional to lung volume. Takens' embedding theorem (56) (and, therefore, the methods of this paper; see APPENDIX A, Embedding) only requires a diffeomorphism (a smooth function) of a measurement of the system. Moreover, present methods are not capable of dealing well with multichannel data, and, therefore, use of both rib and abdominal signals to approximate actual lung volume is difficult. Of the available measurements, we found that the abdominal cross section was the easiest to measure experimentally.

The 27 observations used to calculate dimension were selected based on sleep state (quiet stage 3-4 sleep) and then on the basis of sufficient stationarity and a minimum of 4 min in length. From each of these observations, 240 s of stationary data (the 240 s that had the most stationary moving average) were used to calculate dimension. Data for RARM calculations were selected based on being at least 400 s in length and taken during uninterrupted quiet sleep.

    DATA ANALYSIS

In this study, we employed three main analysis methods: correlation dimension estimation, surrogate data analysis, and RARM. This section provides a description of these methods. The mathematical detail is deferred to APPENDIXES A-D. First, we discuss generalized dimension and correlation dimension estimation, and then we provide an overview of surrogate-data techniques and describe RARM procedures.

Glossary

An autoregressive process of order n

CN(epsilon )

Correlation function

dc, dc(epsilon 0)

Correlation dimension

de, n

Embedding dimension

I(X )

An indicator function

MDL

Minimum description length

N

Length of a data set

Pj, Pj(·)

An orthogonal projection

Z+

Set of positive integers

epsilon

Length scale

epsilon 0

Observation scale

tau

Embedding lag

parallel ·parallel

Euclidean norm---the usual distance function

Dimension Estimation

We are accustomed to thinking of real-world objects as one, two, or three dimensional. However, there exist complex mathematical objects, called fractals, that have noninteger dimension, a so-called "fractal dimension." Many real-world phenomena, in particular chaotic dynamic systems, can be observed to have properties of a fractal, including a noninteger dimension. A meaningful definition of fractal dimension comes from a generalization, or extension, of well-known properties of integer dimension objects. For a more detailed discussion of generalized fractal dimension and estimation of correlation dimension dc, see APPENDIX A.

Most applications of correlation dimension to physiological sciences have utilized the Grassberger and Procaccia algorithm (14, 15). However, this paper employs a new algorithm, which is technically more complex but is in practice more reliable and less prone to misinterpretation. Unlike previous estimation methods, this new algorithm recognizes that the dimension of an object (its structural complexity) may vary depending on how closely you examine it. Hence, the value of the estimate of correlation dimension may change with scale. It therefore offers a more informative and accurate estimate of dimension.

Computing correlation dimension dc as a function of scale dc(epsilon 0) can tell us much more about the structure of an object; for example, it can indicate the presence of large scale "periodic" motion and simultaneously detect smaller scale, higher dimensional chaotic motion. Quoting a single number as the correlation dimension of a data set ignores much of this information; in many respects, it produces an "average dimension." Plots of dimension as a function of scale are particularly important when complex physiological behavior is studied because they yield far more information than a single estimate at a fixed scale.

The estimation algorithm. The estimation algorithm used for the calculations in this paper is described in detail by Judd (19, 20), and an alternative treatment may be found in (for example) Ref. 17. The accuracy of this algorithm has been independently verified by Galka and colleagues (13). One important advantage of the new method is that it is possible to calculate error bars for dimension estimates. The confidence intervals on the dimension that the algorithm provides are dependent on the length of the time series.

For each time series, the dimension was calculated for time-delay embedding (see APPENDIX A, Embedding) in two, three, four, five, seven, and nine dimensions, a far greater range than necessary, but one which encompasses suitable values of embedding dimension suggested by false nearest neighbor methods (APPENDIX A, Embedding dimension).

Hence, for each data set, our dimension estimation methods produced a graph with many lines on it. Each line on the graph is the dimension estimate for the same data set with a different embedding dimension. These lines are a plot of the change in correlation dimension (vertical axis) with scale (horizontal axis). Scale is calculated as the logarithm of "viewing scale," so moving to the right on a plot indicates increasing scale. The right-hand end of the plots is the estimate of dimension at the largest scale (the most obvious features), whereas the left-hand end is the dimension estimate at the smallest scale (the finest details).

Phenomenological Models

Throughout this paper, the term "model" is used in the sense of a phenomenological model and not in the sense of a mechanistic model. In this section, we briefly highlight this distinction.

Surrogate-data techniques involve (implicitly or explicitly) building a model from data, as do RARM procedures. It is important to emphasize that the types of models we are describing are phenomenological rather than mechanistic. That is, these models are not described as a set of equations that have been motivated by physical laws or physiological insight. Rather, the models we describe here are fitted to experimental data to predict future behavior of the system or to simulate the system on a computer and produce an output indistinguishable from observations of the actual system.

Usually, a phenomenological model is selected from a broad abstract class of models by fitting data using some statistical criteria, that is, to find the most probable model within the class. Nonlinear models and reduced autoregressive models are particular examples of broad abstract classes. Mechanistic models include, for example, a model of the lungs as bellows or a physical model of blood-gas concentrations. In a mechanistic model, parameters and variables represent physical quantities, for example, the elasticity of the lung or blood oxygen concentration. In a phenomenological model, this is not the case. However, phenomenological models can provide clear and useful information about a system, such as the periodicities of the system.

Surrogate Data

Estimating the dimension of the data set gave valuable information about the geometric structure of these data, but dimension estimation alone is not enough to give a sure indication of the presence of low-dimensional chaos or even nonlinear dynamics. Any experimentally obtained data will include some observational noise and, when added to a deterministic linear process, can produce dimension estimates not dissimilar to the results of our calculations.

To determine whether our results indicate the presence of anything more complicated than a noisy linear system, we employed the surrogate-data methods described by Theiler et al. (59). The principle of surrogate data is the following. One first assumes that the data come from some specific class of dynamic system, possibly fitting a parametric (phenomenological) model to the data. One then generates surrogate data from this hypothetical system and calculates various statistics of the surrogate data and original data. The surrogate data give the expected distribution of statistical values, and one can check that the original has a typical value. If the original data have atypical statistics, then we reject the hypothesis that the system that generated the original data is of the assumed class. One always begins with simple assumptions and progresses to more sophisticated models if the data are inconsistent with the surrogate data.

We expect our data to be most consistent with some type of nonlinear dynamic system. Before considering this type of surrogate, it is necessary to determine whether a simpler description of the data would be sufficient. To do this, we compared our data with surrogates generated by the traditional (linear) methods (see APPENDIX B, Traditional Surrogate Data). Many studies in biological sciences have employed these traditional surrogate methods (2, 31, 42, 63, 67). These methods determine whether experimental data are significantly different from specific (broad) categories of linear systems. In addition to these linear surrogate tests, we applied a new, more complicated, nonlinear surrogate test (50, 53). This method was used to determine whether the data are distinguishable from these generated by a broad class of nonlinear models (see APPENDIX B, Noise-Driven Nonlinear System Surrogates, and APPENDIX C).

Reduced Autoregressive Models

In this section, we briefly describe the rationale of the final mathematical tool we utilize: RARM. We applied these methods to time series consisting only of the size of successive breaths (the difference between peak and trough values). From this we deduced the period of CAM present in the original time series. One could also examine the length of successive breaths to determine the presence of cyclic durational modulation. However, in this paper, we focus on CAM.

Our new method extends the traditional autoregressive (AR) model of order n [AR(n)], which predicts the next value in a time series as a weighted average of the last n values. We consider instead a RARM where any past values may be used to predict the upcoming value but only those that are important are used. To determine which past values are important, we employed Rissanen's minimum description length (MDL) criterion (35) using a modeling procedure described by Judd and Mees (21, 22); our implementation of these methods has been presented elsewhere (55) and is discussed elsewhere (48, 54). A computer software implementation of these techniques is available online (49) or from the first author. A brief description of the mathematical methods is presented in APPENDIX D. For now, let us assume that RARM can produce a model consisting only of those previous values that are useful in predicting future values. This is not necessarily a particularly good model in terms of prediction---it is only an approximation to the breath-to-breath dynamics that we utilize to extract important information. Using this information, we deduce the period of quasi-periodic behavior in the time series from the temporal separation of the previous values. Hence, RARM can identify the period of CAM in much the same way as autocorrelation may, except our methods prove to be more sensitive and more discriminatory. Our method involves automated application of statistical procedures to time series and can be applied even when amplitude modulation is not apparent to the eye.

By reducing the original data to a breath-to-breath time series, we effectively rescale the time axes so that each breath is of equal length. However, CAM on a breath-to-breath basis does not (necessarily) suppress time-dependent dynamics. A hypothesized cyclic variation in breath duration may be related to a cyclic variation in breath amplitude (and, hence, evident in the breath amplitude time series). These could essentially be two separate observations of the same periodic behavior. The duration of a single breath is also far more difficult to measure accurately because of (relatively) long flat peaks and troughs.

    RESULTS

We first present our results from applying our dimension estimation algorithm. Next, we describe the results of our surrogate-data and RARM calculations.

Dimension Estimation

The results of the calculations of dc(epsilon 0), as shown in Fig. 1, can be summarized as follows. All calculations fall into two broad categories. Most of the estimates of dc(epsilon 0) produced curves that increase, more or less linearly, with decreasing scale log epsilon 0, but some showed an initial decrease in dimension before increasing with decreasing scale (Fig. 1, subjects 1 and 4). For any particular data set, it was generally found that the graph of dc(epsilon 0) was shifted to higher dimensions as the embedding dimension was increased, although the shape of the graph varied little with changes in embedding dimension. In nearly all cases, the dimension estimates at the largest scale lay between two and three.


View larger version (27K):
[in this window]
[in a new window]
 
Fig. 1.   Correlation dimension estimates for 1 representative data set from each of the 10 subjects. Any data sets that produced dimension estimates dissimilar to those illustrated here are discussed in text (see Dimension Estimation). Plots are of scale (-log epsilon 0) against correlation dimension, with confidence intervals shown as dotted lines (often indistinguishable from the estimate). Correlation dimension estimates were produced for embedding dimensions of 2, 3, 4, 5, 7, and 9 for all data sets, except for subjects 2, 4, and 7. Subjects 4 and 7 failed to produce an estimate for the 9 dimensional embedding. Subject 2 did not produce an estimate when embedded in 3 or 9 dimensions. All other dimension estimates are illustrated; higher embedding dimension produces larger correlation dimension.

The more or less linear increase in dimension with decreasing scale epsilon 0, and the shift to higher dimensions as the embedding dimension is increased, are both indications that the system, or measurements, have a substantial component of high-dimensional dynamics, or noise, at small-to-moderate scales. The increase of dimension with decreasing scale is an obvious effect of high-dimensional dynamics or noise. The shifting to higher dimensions with increasing embedding dimension occurs because in a higher dimensional embedding the points "move away" from their neighbors and tend to become equidistant from each other, which in effect amplifies, or propagates, the small-scale, high-dimensional properties to large scales. (This effect is related to the counterintuitive fact that spheres in higher dimensions have most of their volume close to their surfaces rather than near their centers, as is the case in two and three dimensions.)

Some of the dimension estimates, particularly in two and three dimensions, produced curves that linearly increased for large-length scales but appeared to level off as length scale decreased. For most of the estimates we have computed, this is the case when the data are embedded in two dimensions. Furthermore, for these embeddings in two-dimensional space, the correlation dimension estimate seemed to approach two. This indicates that as we look closer at the data (that is, at a smaller length scale), the data appear to fill up all of our embedding space. For many of the dimension estimates (Fig. 1, subjects 7 and 9), the embedding in three dimensions also leveled at values slightly less than three. This behavior can be attributed to an attractor with correlation dimension of ~2.8-2.9. However, it is probably more likely that this too is simply due to the data "filling up" the three-dimensional space. This is consistent with the results of our false nearest neighbor calculations, which suggested that three- or four-dimensional space would be required to successfully embed the data.

There is one particular estimate that appeared to behave quite differently from all the others. Some of the curves of the estimates for subject 8 appeared to increase, decrease, and then increase again. This could indicate that, as we look more closely at the structure, there is some length scale for which the embedding structure seems to be relatively high in dimension, whereas by looking at an even smaller length scale the behavior has significantly lower dimension. These observations are supported by what we can observe directly from the data. This time series includes an episode of periodic breathing---increasing the complexity of the large-scale behavior (see Fig. 2). Similarly, some of the data sets for subject 2 include large sighs, causing the dimension estimate to increase at large scales (see Fig. 3).


View larger version (31K):
[in this window]
[in a new window]
 
Fig. 2.   Dimension estimate for subject 8, one of data sets used in our analysis. Periodic breathing caused the dimension estimates (dimension estimates used embedding dimensions of 2, 3, 4, 5, 7, and 9) at large scale to increase.


View larger version (26K):
[in this window]
[in a new window]
 
Fig. 3.   Dimension estimate for subject 2, one of our data sets along with the dimension estimates (shown are estimate with an embedding dimension of 2, 3, and 4). Note large sighs during recording and the corresponding increase in dimension estimate at moderate scale. Another data set from same infant exhibited similar behavior and produced a similar dimension estimate.

Finally, the remainder of the estimates (e.g., Fig. 1, subjects 1, 2, 4, 6, 7, 8, and 10) behaved in yet another manner. These estimates were approximately constant for a small range of large length scales and gradually increased over small length scales. The estimates at large length scales were generally about two to three, indicating that the large-scale behavior is slightly above two dimensional. The increase in dimension estimate for smaller length scales can again be attributed to either noise or high-dimensional dynamics. However, the scale of "small-scale structure" in the dimension estimates is at a larger scale than the instrumentation noise level. Typically, the smallest scale is loge(epsilon 0approx  -2.5, a scale of ~5% of the attractor (e-3 approx  0.049787 approx  0.05). The digitized signal will typically use at least 10 bits of the analog-to-digital converter (2-10 = 1/1,024 < 0.001); other sources of instrumental error are certainly at levels <5%.

The approximately two-dimensional behavior is probably due to the inspiration-expiration cycle along with the breath-to-breath variation within that cycle. This is easily visualized as the orbit of a point around the surface of a torus. A dimension estimate of two could indicate that the attractor was any two-dimensional surface; the embedded data, however, have an approximately toroidal shape. In this motion, there are two characteristic cycles, first, the motion around the center of the torus and, second, a twisting motion around the surface. Our estimates slightly over two indicate that this behavior is complicated further by some other roughness over the surface of the torus. The shape of such an attractor would very closely resemble the textured surface of a doughnut.

Surrogate-Data Analysis

Dimension estimation gave information about the shape of the dynamic system we are studying. In an attempt to classify this system, we applied surrogate-data techniques. First, we compared breathing dynamics to linear systems. Following this, we compared the breathing dynamics with nonlinear dynamic systems by fitting a type of nonlinear model to the data.

Linear surrogates. By comparing the value of dimension obtained from our data and surrogates consistent with each of these three null hypotheses, we were able to reject all three null hypotheses (see Fig. 4 for an example of such a calculation). These results are summarized in Table 1, which shows the number of SDs between the values of dc(epsilon 0) for data and surrogate, for the value of log (epsilon 0), which gave the greatest difference. This is calculated over the range -2.5 <=  log (epsilon 0<=  -0.5, and for de = 3, 4, 5. Data are from infants at 2 mo of age. The symbol n/a indicates that none of the surrogates produced convergent dimension estimate at any value of epsilon 0. For each data set and each hypothesis test, there are three pairs of numbers. These three pairs of numbers are the results for de = 3, 4, 5, respectively. The first number is the number of SDs by which the mean value of dimension for the surrogates exceeded that for the data. The second number (in parentheses) is the value of log (epsilon 0) for which this occurred.


View larger version (30K):
[in this window]
[in a new window]
 
Fig. 4.   Linear surrogate calculations, an example of surrogate-data calculations for algorithms 0, 1, and 2. Here, we compared correlation dimension estimate for 1 of our data sets (solid line) and 30 surrogates (dotted lines). There is a clear difference between the correlation dimension of the data and that of the surrogates.

                              
View this table:
[in this window]
[in a new window]
 
Table 1.   Results of a linear surrogate test

The work of Pilgram et al. (30) with respiratory traces during REM sleep produced similar observations for a different physiological phenomena. By rejecting these null hypotheses, we may make two important observations. First, the data are not a (monotonic) transformation of linearly filtered noise. Second, correlation dimension alone is sufficient to distinguish between our data and data consistent with these hypotheses.

These results, however comforting, are not particularly surprising. Our data are regular and periodic, and the surrogates' are not (see, e.g., Fig. 5).


View larger version (61K):
[in this window]
[in a new window]
 
Fig. 5.   Surrogate data. Sections of 3 surrogates generated by traditional techniques: algorithms 0, 1, and 2, and a section of a surrogate data set generated from a radial-basis model. Also shown is a section of real data (top) used to generate these surrogates. There are obvious similarities between true data and the nonlinear surrogate, whereas other surrogates are obviously different.

Nonlinear surrogates. For each set of data, we have calculated its correlation dimension. Using a modeling algorithm described elsewhere (21, 22, 51), we constructed a radial-basis model of the data. From this model, we constructed 30 surrogate data sets. The surrogates were embedded in two, three, and four dimensions, by using the same embedding strategy as the true data set. We then calculated the correlation dimension curve for each set of surrogate data.

The results (see Fig. 6) of these calculations fell into two very distinct categories. For many of the data sets, the surrogates very closely resembled the true dimension estimate, whereas for some others the data and the surrogates appeared to be very different. On a closer examination of the time series, it appears that the model failed to produce accurate surrogates only when the data set was significantly nonstationary. Although no data set used in these calculations had an obvious drift or changed sleep state, nonstationarity occurred with sudden change in respiratory behavior (see, e.g., Fig. 2). Hence, when the data were sufficiently stationary (as was the case with 24 of our 27 data sets), the modeling algorithm produced surrogate data that were indistinguishable (according to the method of surrogate data, with respect to correlation dimension) from the true data. Furthermore, the models exhibited a toroidal attractor, as suggested by the correlation dimension estimates.


View larger version (38K):
[in this window]
[in a new window]
 
Fig. 6.   Nonlinear surrogate dimension estimates. Surrogate-data calculations for 2 of our data sets, embedded in 2, 3, and 4 dimensions. First set indicated a close agreement between data and surrogate. Second set of calculations indicated very clear distinction. Hence, the model of first data set is indistinguishable (according to correlation dimension) from a noise-driven periodic orbit, whereas model of the second fails to produce particularly strong similarities. Notice that, for almost any value of epsilon 0, comparison of the value of dc(epsilon 0) of the data and the surrogates would also lead to these conclusions.

CAM Detected by Using RARM

In this section, we present some preliminary results of the detection of CAM in the respiratory traces of infants in quiet sleep. The data used for these calculations are different from those for which the correlation dimension was calculated. The data requirements of this algorithm are moderately large (typically 10 min of recording), calculation of correlation dimension and radial-basis models for such data sets proved prohibitive. Moreover, the two types of models are entirely distinct: RARM is more robust to nonstationarity, whereas radial-basis models are better at capturing qualitative (and many quantitative) features of respiration.

Table 2 outlines the results of our calculations applied to 14 data sets from 14 infants. Data used for these calculations were recorded during quiet sleep. Subjects 1-10 are the same subjects as used for correlation dimension estimates. Data for subjects 1-6 were recorded during the same study as those used for dimension estimates. Data for subjects 7-12 were recorded at 4 mo of age, for subjects 13-14 at 6 mo. Respiratory rate is the average respiratory rate over the duration of the recording. Note that, although there was some variability in both the respiratory rate and the period expressed as a number of breaths, the period in seconds was relatively constant. In most cases, this period also falls within the range of periodic breathing. In subject 11, periodic breathing with cycle times from 13.5 to 15.5 s occurred during the same study.

                              
View this table:
[in this window]
[in a new window]
 
Table 2.   Detection of CAM by using RARM

    DISCUSSION

This study has confirmed that apparently regular breathing during quiet sleep is probably chaotic. This conclusion should be qualified. As Rapp (33) observed, to conclude that a phenomenon is chaotic is both difficult and often irrelevant. In real data sets, noise contamination will always increase the dimensional complexity of the data, and almost any experimental data will exhibit noninteger correlation dimension. Identification of apparently chaotic behavior is, however, a good first step in dynamic analysis. Our data are the most convincing evidence that respiratory variability in humans is deterministic and not random, due to noise. Furthermore, we have extended our observations and analyses to describe the dynamic structure of the system in greater detail. We have demonstrated the presence of a two-dimensional periodic system: one dimension corresponds to the regular breath-by-breath cycle of respiration, the other to a regular periodic modulation of breath amplitude (CAM), with cycle length approximating that of periodic breathing.

By applying a new dimension estimation algorithm, we have produced reliable, accurate, and informative dimension estimates. This method produces error bars which, for moderately long time series (such as ours), are reassuringly small. This, however, must not be taken to indicate that we were able to estimate the dimension of the dynamic system responsible for this behavior with similar confidence. As with any experiment, the accuracy of the results is limited by physical constraints such as observational error and noise interference. Even if the data can be collected in such a way that they accurately represented the volume of the subject's lung, there may be other phenomena contributing to that measurement, which are independent of the dynamic system we are attempting to observe. Hence, the tightness of the error bars we have calculated was due more to the quantity of the data than to lack of experimental error.

Our dimension estimate results indicate that, on a large scale, there is low-dimensional behavior, whereas the small-scale behavior is often dominated by very high-dimensional dynamics or noise (that is, extremely high-dimensional dynamics). Even though false nearest neighbor techniques suggest that we were embedding in high enough dimensions, there was still some small-scale behavior that filled the embedding space. The scale at which the embedding space is filled by the dynamics could indicate the level of experimental noise.

The most conclusive estimates from this study indicated that the structure of the attractor is likely to be similar to a torus with small-scale, very high-dimensional dynamics. Hence, at large length scales, the structure looked like the surface of a torus, whereas at smaller length scales dimension increased. This indicates that the attractor appears to be a torus with a very rough surface. The most important conclusion from these data is that this two-dimensional periodic system indicates two levels of periodicity. Hence, in addition to the periodic inspiration-expiration motion, it is likely that there was some cyclic breath-to-breath variation in breath depth.

By applying the method of surrogate data we demonstrated that the correlation dimension is related to the data from which we estimate it in a nontrivial way. The surrogates produced by algorithms 0, 1, and 2 are clearly inadequate. It is apparent that they should fail, and this was confirmed by our results. We have constructed our own surrogates using a nonlinear modeling process and compared surrogates and data to test the accuracy of the model. For the majority of our data sets, we found that the data and nonlinear surrogates were indistinguishable according to correlation dimension. For those data sets that were distinguishable from their surrogates, we found that there were several possible reasons for this. Usually, if the data were nonstationary, the model simply failed to produce surrogates that were close enough to the data. The model is stationary and periodic, whereas the data are not.

Occasionally, with nonstationary data, the model failed to produce even periodic surrogates. If this was the case, then the model had a stable fixed point. In these cases, the dimension estimates of data and surrogate were obviously different, and a better model is required. The fact that this modeling algorithm failed in cases where the data were not stationary is not particularly surprising---both modeling and dimension estimation algorithms require stationarity. Perhaps, with improved modeling techniques, similar results could be obtained in these cases.

Even if both data and surrogate were stationary, the dimension estimates of the surrogates could still be different from those of the data. In all these cases, however, this has been found to be a problem with the level of dynamic noise introduced to the model to generate the surrogates. By changing the noise level, the dimension would also change, effectively moving the dimension estimate vertically. Because, in these cases, the shapes of the dimension estimate curves were approximately the same, by altering the noise level it was possible to produce surrogate estimates that were indistinguishable from the data. In all cases, however, the dynamic noise was substantially less than the model's root mean square prediction error. The root mean square prediction error is the noise level predicted by the modeling algorithm. However, this is the total noise and includes both dynamic and observational noise.

Dynamic noise and observational noise have a different effect on the correlation dimension estimates. Observational noise will increase the value of correlation dimension at length scales less than and equal to the noise level. It appears from our calculations that an increase in the level of dynamic noise increases the correlation dimension estimate equally across all length scales, effectively producing a vertical shift in the estimate. An increase in dynamic noise will certainly have a greater affect on dc(epsilon 0) for larger epsilon 0 than would a similar increase in observational noise. Assuming one has correctly identified the underlying deterministic dynamics, it may be possible to "tune" the level of dynamic noise so that surrogates and data have approximately the same value of correlation dimension estimates at moderate-to-large length scales and then alter the level of observational noise to tune the dimension estimates at small length scales. Hence, the level of noise required to be unable to reject the surrogate data is an indication of the relative proportion of dynamic and observational noise in the system. That is, we can distinguish between random behavior within the system (dynamic noise) and experimental error (observational noise).

Our calculations have demonstrated that the respiration of infants in quiet sleep can be modeled well with radial-basis methods. We conclude that the behavior is similar to a noise-driven periodic orbit. Furthermore, the radial-basis surrogates confirm that the periodic orbit associated with this behavior has at least two degrees of freedom, as suggested by our initial dimension estimates. Finally, our RARM calculations to detect CAM support these results.

The observation of CAM is intriguing. We serendipitously recorded periodic breathing from one infant. The cycle time of CAM (13.3-15.6 s) in the same infant corresponded almost exactly with that of the observed periodic breathing (13.5-15.5 s) as demonstrated in Fig. 2. The relationship to periodic breathing needs further investigation, but we believe that these two behaviors with identical cycle lengths (CAM and periodic breathing) are likely to be related and determined by similar factors, whatever they might be. These data support the hypothesis that oscillatory activity responsible for periodic breathing is ubiquitously present but masked during apparently regular breathing by the normal regular stimulation from respiratory motoneurons. Periodic breathing occurs when this normal regular drive is decreased [i.e., in infants when core body temperature is raised (18)]. The adoption of one particular physiological state, i.e., regular tonic respiration with CAM or periodic breathing is likely to be dependent on the environmental conditions and maturity of respiratory control as well as the presence of any pathological conditions.

In conclusion, this study has addressed the limitations of previous studies that have examined whether respiration is chaotic. We investigated children in quiet sleep when breathing appears most regular. Correlation dimension estimates are consistent with a chaotic system. Furthermore, unlike in most previous studies, we used surrogate data analyses to test whether the apparently chaotic behavior was due to linearly filtered noise. We found this unlikely and concluded that the simplest system consistent with our data is a noise-driven, nonlinear, radial-basis model. Our data are the most convincing evidence that respiratory variability in infants is deterministic and not random, due to noise. A recent study has demonstrated reduced variability of respiratory movements in infants, who subsequently died of sudden infant death syndrome (SIDS) [Schechtman et al. (43)]. This observation was retrospective but suggests that, because the variability that we have observed during quiet breathing is deterministic, then further study using DST could allow early identification of infants at risk of SIDS from simple measurements of respiratory patterns.

We speculate that since CAM is an important contributor to the complexity observed during quiet breathing, further studies might demonstrate distinct patterns of CAM in infants with respiratory control problems; for example, absence of CAM might explain the reduction in variability observed by Schechtman et al. (43) in infants who died of SIDS.

We believe that the techniques employed by us in this study could be used in the future for studying the complex dynamic behavior of respiration and its control in health and disease.

    APPENDIX A. DIMENSION ESTIMATION
Top
Abstract
Introduction
Appendix
Appendix
Appendix
Appendix
References

In this APPENDIX, we discuss the mathematical aspects of dimension estimation. Next, we present the new algorithm. Dimension estimation is logically, and operationally, divided into two steps: embedding the observed time series in a mathematical space and estimation of dimension from the embedded structure of the data. We begin with a brief description of the embedding process; see references for more details.

Embedding

Let x1, x2, x3, ... , xN represent a time series of N measurements of a system recorded with a fixed sampling rate. Construct an n-dimensional vector time series vt, the time-delay embedding of the scalar time series, by
<B><IT>v</IT></B><SUB><IT>t</IT></SUB> = (<IT>x</IT><SUB><IT>t</IT>−&tgr;</SUB>, <IT>x</IT><SUB><IT>t</IT>−2&tgr;</SUB>, … , <IT>x</IT><SUB><IT>t</IT>−<IT>n</IT>&tgr;</SUB>),  <IT>t</IT> = <IT>n</IT>&tgr; + 1, <IT>n</IT>&tgr; + 2, … , <IT>N</IT>
This is one of a number of different embedding strategies. For example, Sammon et al. (40, 41) employ a two-dimensional differential embedding where the coordinates are either (xt, dxt/dt) or (d2xt/dt2, d3xt/dt3), where dixt/dti is a numerical approximation to the ith derivative of xt.

The Takens embedding theorem (27, 56) guarantees that the structure of the cloud of points (see Fig. 7) defined by a time-delay embedding is diffeomorphic (equivalent) to that of the measured system in most circumstances---actually Takens' theorem says much more than this. The correlation dimension is invariant under diffeomorphism (smooth change of coordinates) and, hence, it is possible to analyze embedded data to discover a system's fractal dimension.


View larger version (46K):
[in this window]
[in a new window]
 
Fig. 7.   A time-lag embedding. One of data sets used in our calculation, together with time-lag embedding in 2 and 3 dimensions. Time lag used was 19 data points (380 ms).

An embedding depends on two parameters, the lag tau  and the embedding dimension n. For an embedding to be suitable for successful estimation of dimension, one must choose suitable values of these parameters.

Embedding dimension. A fundamental topological result (56) guarantees that an n-dimensional manifold can be successfully embedded in at most R2n + 1, but more recent results (8) show that estimation of dimension requires that an embedding dimension need only be greater or equal to n. In practice, one could guess a suitable value for n by successively embedding in higher dimensions and looking for consistency of results; this is the method that is generally employed. However, other methods, such as the false nearest neighbor technique (10, 57), are now available to suggest the value of n. For the data used in our studies, the false nearest neighbor technique suggested that an embedding dimension of three or four would be sufficient. Using this knowledge, we repeated the dimension estimation procedure described below for increasing values of n, starting at n = 2.

Embedding lag. There are two main methods (33) for choosing an appropriate value of the lag tau : the first zero of the autocorrelation function (4, 5) and the first minimum of the mutual information (1, 12, 23). The rationale of both of them, however, is to choose the lag so that the coordinate components of vt are reasonably uncorrelated while still being "close" to one another. When the data exhibit strong periodicity, as is the case with respiratory patterns, a value of tau  that is one-quarter of the length of the average breath generally gives a good embedding. This lag is approximately the same as the time of the first zero of the autocorrelation function. Coordinates produced by this method are within a few breaths of each other (even in relatively high-dimensional embeddings) while being spread out as much as possible over a single breath. Moreover, for embedding in three or four dimensions (as suggested by false nearest neighbor techniques), the data are spread out over one-half to three-quarters of a breath. This means that the coordinates of a single point in the three- or four-dimensional vector time series vt represent most of the information for an entire breath. This choice of lag is extremely easy to calculate and, for the data sets that we consider, it also seems to give much more reliable results than the mutual information criterion.

Dimension

Once we have embedded the data properly, we wish to measure the complexity of the "cloud" of points vt. The measure we use in this paper is the correlation dimension.

To define the correlation dimension in a meaningful way, we generalize the concept of integer dimension to fractal objects with noninteger dimension. In dimensions of one, two, three, or more, it is easily established, and intuitively obvious, that a measure of volume V (e.g., length, area, volume, and hypervolume) varies as
V ∝ &egr;<SUP><IT>d</IT></SUP> (1)
where epsilon  is a length scale (e.g., the length of a cube's side or radius of a sphere) and d is the dimension of the object. For a general fractal, it is natural to assume that a relation like Eq. 1 holds true, in which case its dimension is given by
<IT>d</IT> ≈ <FR><NU>log V</NU><DE>log &egr;</DE></FR> (2)
Let {vt}Nt=1 be an embedding of a time series in Rn. Define the correlation function, CN(epsilon ), by
<IT>C</IT><SUB><IT>N</IT></SUB>(&egr;) = <FENCE><AR><R><C><IT>N</IT></C></R><R><C>2</C></R></AR></FENCE><SUP>−1</SUP> <LIM><OP>∑</OP><LL>0≤<IT>i</IT><<IT>j</IT>≤<IT>N</IT></LL></LIM> <IT>I</IT>(∥<B><IT>v</IT></B><SUB><IT>i</IT></SUB> − <B><IT>v</IT></B><SUB><IT>j</IT></SUB>∥ < &egr;)
Here, I(X ) is a function the value of which is 1 if condition X is satisfied and 0 otherwise, and parallel ·parallel is the usual distance function in Rn. The sum Sigma iI(parallel vi - vjparallel  < epsilon ) is the number of points within a distance epsilon  of vj. If the points vi are distributed uniformly within an object, then this sum is proportional to the volume of the intersection of a sphere of radius epsilon  with the object, and CN(epsilon ) is proportional to the average of such volumes. Comparing with Eq. 1, one expects that
<IT>C</IT><SUB><IT>N</IT></SUB>(&egr;) ∝&egr;<SUP><IT>d</IT><SUB>c</SUB></SUP>
where dc is the dimension of the object. Considering Eq. 2, it is seen to be natural to define the correlation dimension dc by
<IT>d</IT><SUB>c</SUB> = <LIM><OP><UP>lim</UP></OP><LL>&egr;→0</LL></LIM> <LIM><OP><UP>lim</UP></OP><LL><IT>N</IT>→∞</LL></LIM> <FR><NU>log <IT>C</IT><SUB><IT>N</IT></SUB>(&egr;)</NU><DE>log &egr;</DE></FR> (3)
The curious normalization of CN(epsilon ) is chosen so that rather than CN(epsilon ) being an estimate of the average volume of an object within a radius epsilon  of a point, it is instead an estimate of the probability that two points chosen at random on the object are within a distance epsilon  of each other. The difference between the volume and the probability is only a constant of proportionality if the points were distributed uniformly, and this constant vanishes in the limit of Eq. 3. The reason for choosing the probability rather than the volume is that the concept of dimension still makes sense, indeed, generalizes to situations where the sample points vi are not distributed uniformly within the object. A discussion of the general situation is beyond the scope of this paper (see 19, 20).

The method most often employed to estimate the correlation dimension is the Grassberger-Procaccia algorithm. In this method, one calculates the correlation function and plots log CN(epsilon ) against log epsilon . The gradient of this graph in the limit as epsilon  right-arrow 0 should approach the correlation dimension. Unfortunately, when a finite amount of data is used, the graph will jump about irregularly for small values of epsilon . To avoid this, one looks instead at the behavior of this graph for moderately small epsilon . A typical correlation integral plot will contain a "scaling region" over which the slope of log CN(epsilon ) remains relatively constant. A common way to examine the slope in the scaling region is to numerically differentiate the plot of log epsilon against log CN(epsilon ). This ought to produce a function that is constant over the scaling region, and its value on this region should be the correlation dimension (see Fig. 8).


View larger version (28K):
[in this window]
[in a new window]
 
Fig. 8.   Correlation dimension from distribution of interpoint distances. Logarithm of distribution of interpoint distances and an approximation to the derivative for 1 of our sets of data embedded in 3 dimensions. Approximate derivative is a smoothed numerical difference. This calculation used the same data set as Fig. 7, embedded in 3 dimensions, with a lag of 19 data points (380 ms). Even with well-behaved data and a smooth, approximately monotonic distribution of interpoint distances, the choice of scaling region is still subjective.

Unfortunately, as Judd (19) points out, there are several problems with this procedure. The most obvious of these is that the choice of the scaling region is entirely subjective (Fig. 8). For many data sets, a slight change in the region used can lead to substantially different results. Judd demonstrates that for many objects, including many fractals, a better description of CN(epsilon ) is that for epsilon  less than some epsilon 0
<IT>C</IT><SUB><IT>N</IT></SUB>(&egr;) ≈ &egr;<SUP><IT>d</IT><SUB>c </SUB></SUP><IT> q</IT>(&egr;)
where q(epsilon ) is a polynomial of order t, the topological dimension of the set. The topological dimension is something like the smallest t, such that any tiny piece of the object fits in Rt without overlapping itself. One now has a definition of correlation dimension dependent on the scale epsilon 0.

The Grassberger-Procaccia method assumes that CN(epsilon proportional to  epsilon dc, but this new method allows for the presence of a further polynomial term that takes into account variations of the slope within and outside of a scaling region. This new method dispenses with the need for a scaling region and substitutes a single-scale parameter epsilon 0, which has an interesting benefit. For many natural objects, the dimension is not the same at all length scales. If one observes a large river stone, its surface at its largest length scale is very nearly two dimensional, but at smaller length scales, one can discern the details of grains that add to the complexity and increase the dimension. Consequently, it is natural to consider dimension dc as a function of epsilon 0 and write dc(epsilon 0). For an alternative treatment of this algorithm, see, for example, Ref. 17.

By allowing our dimension to be a function of scale, we produce estimates that are both more accurate and more informative. We avoid some of the approximation necessary to define correlation dimension as a single number and we can extract more detailed information about the changes in dimension with scale. Finally, the data requirements of this algorithm are modest enough to be satisfied by our data.

    APPENDIX B. SURROGATE-DATA ANALYSIS
Top
Abstract
Introduction
Appendix
Appendix
Appendix
Appendix
References

This appendix briefly outlines the surrogate-data methods utilized in this article. A more comprehensive description of these methods may be found in Ref. 59, and some caveats and recent extensions to these methods are outlined in Ref. 60. In Ref. 53, Small and Judd describe the application of correlation dimension as a test statistic for linear and nonlinear hypothesis testing. First, we discuss our choice of test statistic and then the rationale of linear and nonlinear surrogate-data methods.

Test Statistics

To compare the data to their surrogates, a suitable test statistic must be selected. A useful statistic must measure a nontrivial invariant of a dynamic system that is independent of the way surrogates are generated. Because we are interested in correlation dimension, which is independent of most surrogate-generation methods, it is natural to use this as our test statistic. Correlation dimension, as we have defined it, is a function of epsilon 0. There are several obvious ways to compare these curves. On many occasions, it is sufficient to compare the value of dimension for some fixed values of epsilon 0, and this is the method we used. Other possibilities include the mean value of the dimension estimate or the slope of the line of best fit. More sophisticated methods include statistical tests such as the chi 2 test or the Kolmogorov-Smirnov statistic applied to the distribution of interpoint distances to determine whether the distributions are the same.

Traditional Surrogate Data

Different types of surrogate data are generated to test membership of specific dynamic system classes, referred to as null hypotheses. The three types of surrogates described by Theiler et al. (59), referred to as algorithms 0, 1, and 2, address the three null hypotheses: (0) i.i.d. (independently and identically distributed) noise; (1) linearly filtered noise; and (2) monotonic nonlinear transformation of linearly filtered noise. Each of these null hypotheses should be rejected for data generated by a nonlinear (nearly) periodic system, which is what we believe the respiratory system to be. Hence, we expect that each of these null hypotheses would be rejected for our respiratory data.

In a recent paper, Theiler (58) [and also Theiler and Rapp (61)] addresses this problem and proposes that a logical choice of surrogate for strongly periodic data, such as respiratory traces or electrocardiogram signals, should also be periodic. To achieve this, he decomposes the signal into cycles and shuffles the individual cycles. Our data, however, do not have a convenient point at which to break the cycles. To break the cycles at peak inspiration (or expiration) would introduce a degree of subjectivity, since the point at which to separate the cycles is not clearly defined, and it would introduce points of nondifferentiability (that is, sharp points, cusps, or corners) that are not present in the original data.

Theiler's (58) alternative null hypothesis for strongly periodic signals is not a characterization of great interest to our present discussion. Theiler proposes that surrogates generated by shuffling the cycles address the hypothesis that there is no dynamical correlation between cycles. The null hypothesis that the data are generated by a noise-driven periodic orbit is of far greater significance to our analysis. Cycle-to-cycle variation in breath structure is a problem distinct from our present inquiry but also of great interest (55). We have considered the application of cycle-shuffled surrogates to similar data elsewhere (52).

Noise-Driven Nonlinear System Surrogates

Hypothesis testing with surrogate data is, essentially, a modeling process. To test whether the data are consistent with a particular hypothesis, one first builds a model that is consistent with that hypothesis and has the same properties as the original data, then one generates surrogate data from the model and checks that the original data are typical under the hypothesis by comparing them with the surrogate data. For surrogates generated by algorithm 0, 1, or 2, the model used is linear. Each of these surrogate tests addresses a null hypothesis that the data are either linear or some (linear or monotonic nonlinear) transformation of a linear process.

To address the null hypothesis that the data come from a noise-driven nonlinear system, which, one supposes, will have a focus or stable periodic orbit, we built a nonlinear model and generated noise-driven simulations from that model. These simulations are our surrogate data. The nonlinear model that we built from the data is a radial-basis model based on the methods of Judd and Mees (21, 22), and the test statistic we used was correlation dimension. Radial-basis models are used because they are known to be effective in modeling a variety of nonlinear dynamic systems, and the authors have at their disposal a sophisticated software implementation of this modeling method. To be precise, the null hypothesis we tested is that the data are consistent with a nonlinear system that can be described by a radial-basis model and that the data of such a system can be modeled adequately by using the algorithms we used. Rejection of the null hypothesis could imply that the data cannot be described by a radial-basis model or that the modeling algorithm failed to build an accurate model.

Building a nonlinear model of data is a decidedly nontrivial process. All that really concerns us here is that we were somehow able to generate a radial-basis model of the data. Figure 4 gives an example of some data generated by the methods we used. A more precise definition of the model is given in APPENDIX C.

The conclusions that can be drawn from testing with these nonlinear models are several. First, the surrogate data can indicate that our data are not consistent with a nonlinear system of the type generated by our modeling procedure. Second, it will determine whether correlation dimension can distinguish between data and surrogates. Finally, and most importantly, this is a test of the modeling procedure itself. If the null hypothesis cannot be rejected on the basis of our analysis, then this will indicate that the model we have built is an accurate model of the data, with respect to correlation dimension. Even if correlation dimension cannot distinguish between data and surrogat