|
|
||||||||
Laryngeal and Speech Section, National Institute of Neurological Disorders and Stroke, Bethesda, Maryland
Submitted 13 April 2004 ; accepted in final form 30 May 2005
| ABSTRACT |
|---|
|
|
|---|
150 ms). The absence of intrinsic laryngeal muscle responses is consistent with a lack of spindles in these muscles. Our results suggest that other sensory receptors, such as mucosal mechanoreceptors, provide feedback for voice control. somatosensory; sternothyroid; thyroarytenoid; stretch reflex
In most striated muscle groups, changes in muscle length are mediated by 1a afferents from muscle spindles. The presence of muscle spindles in the intrinsic laryngeal muscles in humans is controversial (19, 23). One report found muscle spindles in the interarytenoid (28), others reported spindles in the thyroarytenoid muscle (4, 5, 24), whereas a more recent study failed to confirm the presence of spindles in the thyroarytenoid muscle (8).
When servomotor-controlled displacements were applied to the thyroid cartilage during voicing in humans, rapid changes occurred in voice fundamental frequency and were accompanied by short-latency muscle responses recorded using surface electromyography (EMG) on the neck (26). The fundamental frequency changes in the voice demonstrated changes in the length and/or tension of the vocal folds. Because the surface EMG responses occurred 3035 ms following the onset of displacement but before the voice changes at 5565 ms, the authors suggested that the EMG responses were possibly muscle stretch responses induced by changes in vocal fold length/tension. The muscle(s) that produced the EMG response could not be identified, however, because only surface electrodes were used. In another study that used a servomotor to displace the thyroid cartilage, a response occurred
25 ms in the cricothyroid muscle following cartilage compression in two subjects and no response in the thyroarytenoid muscle. The basis for the cricothyroid response is not clear, given limited evidence of muscle spindles in this muscle (19, 23). Other explanations could be a response to cricothyroid joint receptors (7), or a stretch of the overlying strap muscles, which contain muscles spindles and may have been picked up from electrodes in the underlying cricothyroid muscle (16).
Our purpose was to test the hypothesis that muscle responses to thyroid cartilage displacement originate in the intrinsic laryngeal muscles of normal speaking adults (26, 29). We expected that a rapid force would move the thyroid cartilage posteriorly, exciting stretch receptors, which may be present in the cricothyroid muscle. Furthermore, we hypothesized that, when the force is released from the thyroid cartilage, the vocal folds will lengthen, stretching the thyroarytenoid muscle and raising the fundamental frequency. Such a mechanical lengthening could excite stretch receptors that might be present in the thyroarytenoid (24), producing a muscle response that would decrease the fundamental frequency by shortening the vocal folds.
In addition, to determine whether the effects of mechanical force on laryngeal muscles depend on ongoing laryngeal muscle activity, we applied the mechanical perturbation in six conditions: high-pitch phonation, which would increase intrinsic muscle activity; phonation at the persons typical pitch; pressed phonation (which increases thyroarytenoid muscle activity); high-pitch whisper (vocal fold lengthening without closure); effortful closure or Valsalva (vocal fold closure and shortening); and quiet breathing.
| METHODS |
|---|
|
|
|---|
EMG.
After a ground electrode was attached, the subjects were placed in a supine position with the neck extended. A small amount of 1% lidocaine was injected subcutaneously over the cricothyroid membrane to minimize discomfort during electrode insertions. Following standard techniques (13), bipolar needle EMG electrodes (27-gauge, 37.5 mm) were used to locate a laryngeal muscle before placing bipolar hooked wire electrodes in the muscle. Electrode placements in the thyroarytenoid muscles were verified by increased activation during phonation and effortful glottic closure without prominent onset and offset bursts indicative of lateral cricoarytenoid activity. Verification of the sternothyroid muscle included increased activation at the low end of a decreasing pitch glide and during head turning and head lowering. Placement in the cricothyroid was verified by increased activation during the high end of a pitch glide, but without activation during head rising to demonstrate that strap muscle activity was not included in the recording field. The bipolar hooked wire electrodes (0.0002-in. diameter with 1 mm of insulation removed at the tip) were placed in the thyroarytenoid and sternothyroid bilaterally and in the cricothyroid unilaterally. Surface EMG electrodes were placed bilaterally over the thyroid cartilage to replicate the previous laryngeal area surface electrode placement of Sapir et al. (26). On each side of the cartilage, the upper electrodes were placed
1 cm from midline and
1 cm below the thyroid notch bilaterally, and the lower electrodes were placed over the cricoid cartilage medial to the sternocleidomastoid on each side. The skin was cleaned, and conductive gel was applied to ensure optimal EMG recordings.
After electrode placements, maximal activation gestures were recorded, including the Valsalva maneuver, water swallow, and throat clear for thyroarytenoid activity, high pitch /i/ during an ascending scale for cricothyroid, and effortful chin tuck for sternothyroid. The EMG signals were band-pass filtered between 30 and 3,000 Hz before digitization. All acoustic, EMG, and stimulus signals were anti-alias filtered digitally on a 64 times oversampled signal using a brick-wall filter with linear phase at 5 kHz before the signal was downsampled to 12 kHz. The effective sampling rate was 12,000 samples/s for each channel.
Servomotor stimuli. The mechanical stimuli were delivered to the skin overlying the thyroid cartilage using a servomotor probe operating under force feedback control. A small curved 2 x 1 cm plate was attached at the end of the servomotor shaft to fit over the thyroid prominence. The direction of the probe displacement was adjusted to push the thyroid cartilage posteriorly toward the spine and somewhat rostrally. The stimuli had a rise and fall time of 20 ms and plateau duration of either 250 or 500 ms with a force level of between 0.9 and 1.6 N (mean = 1.4 N) to produce the largest displacement without discomfort. In three subjects, the force output saturated without producing discomfort, and the stimulus level was set just below the saturation point.
For the male participants, the probe rested on the thyroid prominence. For the female subjects, a midline position was chosen where a perceptible change in fundamental frequency could be produced. To ensure that the thyroid cartilage was effectively displaced by the stimulus on each trial, a consistent contact between the stimulus probe and the midline of the skin over the thyroid cartilage was maintained using a slight contact pressure and two-sided tape. Adjustments were made to the height of the probe to accommodate changes in laryngeal height (e.g., low-pitch condition) for each block of trials. The probe was monitored to ensure displacement of the thyroid cartilage toward the spine at stimulus onset with a clear rebound at offset.
To indicate cartilage displacement, a piezoelectric movement transducer was taped on the skin over the thyroid cartilage. The servomotor force and movement signals occurred abruptly in opposite directions at stimulus onset and offset (Fig. 1A). Movement artifacts frequently occurred at stimulus onset and offset in the EMG signals in opposite directions during thyroid cartilage displacement and rebound.
|
Task description.
The mechanical perturbations were applied during six phonatory and nonphonatory conditions. The high-pitch phonation condition replicated Sapir et al. (26). We first measured the fundamental frequency of each subjects typical speaking pitch on extended phonation of the vowel /i/. The subjects were then trained to produce the vowel /i/ at a test pitch that was one-half to two-thirds octave higher than their typical speaking pitch. Before each trial, a sinusoidal tone at the test frequency was presented, and the subject was asked to phonate at a matching pitch. The displacement was applied by the experimenter
12 s after phonation was initiated. The two stimulus durations of 250 and 500 ms were applied in a randomized order in the high-pitch condition. The 250-ms duration replicated Sapir et al. (26). The 500-ms stimulus duration was used to measure possible long-latency responses that might occur >250 ms after stimulus onset. In the second condition, servomotor displacements were applied while subjects produced the vowel at their typical speaking pitch. The "pressed" voice condition was used in four subjects taught to produce a strained voice that would increase the levels of thyroarytenoid muscle contraction.
The nonphonatory conditions included rest, high-pitch whisper, and Valsalva. In the rest condition, the displacement was applied during exhalation by monitoring the subject and the combined abdomen/ribcage Respitrace signal. In the high-pitch whisper condition, subjects were instructed to whisper the vowel /i/ while approximating the high-pitch condition, to increase cricothyroid muscle activation. The Valsalva condition was used to assess whether muscle responses to displacement differed with active contraction of the thyroarytenoid muscle. The servomotor stimulus duration for the nonphonatory tasks was 500 ms.
The majority of trials were in the phonation condition to adequately replicate Sapir et al. (26) and randomly vary stimulus duration. Approximately 6080 high-pitch phonation trials were recorded per subject, and 1015 trials were used for the other conditions. The conditions were presented in five trial blocks in random order. Approximately 15% additional sham trials were interspersed during the experiment to prevent the subject from anticipating the displacement. On sham trials, a sound mimicking the stimulus was presented without thyroid displacement.
Analysis.
The digitized signals were visually inspected and marked with custom routines written in MATLAB (version 6.5) to identify the onset, duration, and amplitude of the voice and muscular responses. Force onset and offset were defined as the points at which the force signal was 50% of the mean amplitude during the stimulus period (Fig. 1B). The fundamental frequency contour of the voice signal was obtained by applying a low-pass, digital filter (8th-order, zero-phase, Butterworth filter) and a custom zero-crossing algorithm that computed the inverse of the zero-crossing intervals to give fundamental frequency over time. The filter cutoff was
50 Hz below the first formant of the vowel.
The peaks of the fundamental frequency responses were labeled as F01F04 (Fig. 1B) and correspond to the fundamental frequency responses discussed by Sapir et al. (26). The latencies and amplitudes of the four points relative to stimulus onset were computed automatically with the use of custom pick-peaking algorithms (Matlab, version 6.5). The amplitudes for each fundamental frequency (F0) change were defined as follows: for F01, the difference (Hz) between the F0 at the F01 minima and the mean F0 over the 200-ms baseline period; for F02, the difference (Hz) between the F0 at the F01 minima and the F02 peak; for F03, the difference (Hz) between the F0 at the F03 peak and the F0 at stimulus offset; and for F04, the difference (Hz) between the F0 at the F03 peak and the F04 minima (Fig. 1B).
To remove direct current offset and movement artifacts due to the servomotor force impulse, the EMG signals were high-pass filtered >20 Hz with a fourth-order, zero-phase, Butterworth filter. The EMG traces from each trial were visually inspected to identify stimulus-related changes relative to the baseline period, 200 ms before stimulation. A baseline deflection >5 ms after the force stimulus onset was marked as the onset of a muscle response (Fig. 1B). Response offset was marked as the point at which the EMG signal returned to baseline and remained stable for at least 10 ms. An initial deflection occurring at stimulus onset was labeled as a stimulus-related movement artifact. The EMG measures obtained for each muscle response included 1) response latency (ms); 2) response duration (ms); and 3) mean amplitude (µV). The frequency of responses was computed for each muscle in each subject as a percentage of the total number of trials.
Planned statistical analyses compared voice and muscle responses across the experimental conditions. To determine whether different servomotor stimulus durations affected the fundamental frequency events (F01F04), paired t-tests (P < 0.05) were used to compare the latency and amplitude of the four F0 events between the two durations with a Bonferroni correction for the eight tests (P = 0.05/8 = 0.00625). If significant voice changes occurred across conditions, then the EMG data were compared statistically; otherwise, the two duration conditions were combined for both the voice and EMG data. Paired t-tests (P < 0.05 with Bonferroni correction) compared the amplitude and latencies of the F0 events between the high-pitch and low-pitch conditions. The EMG data from the two pitch conditions were compared statistically, if voice differences occurred (P < 0.05). To determine whether the F0 events were related to muscle responses, separate Pearson correlation coefficients (P = 0.05 with Bonferroni correction) related the latencies and amplitudes of the first two F0 events with initial muscle response latency in subjects with reliable muscle responses.
| RESULTS |
|---|
|
|
|---|
|
|
|
|
|
|
|
The pattern of EMG results was similar in the low-pitch and high-pitch conditions. Responses occurred in the low-pitch condition in the sternothyroid and surface EMG recordings in the four subjects who also had reliable sternothyroid responses in the high-pitch condition. The latencies of the sternothyroid and surface EMG were similar in both pitch conditions (Fig. 6A and Table 1), but the frequency of both responses tended to decrease in the low-pitch condition, particularly for the surface EMG. Statistical comparisons between trials with and without a muscle response (sternothyroid or surface) found no differences in the latency or amplitude of voice changes for the low-pitch condition.
The F0 changes observed in the high- and low-pitch conditions were not seen in the pressed phonation condition (4 subjects). Muscle responses were only observed in two subjects, with one subject (S10) showing sternothyroid responses and the other subject (S9) showing surface responses. Responses were not observed in the thyroarytenoid and cricothyroid, although both muscles were active before and during the stimulus. The sternothyroid responses tended to be more variable in the pressed phonation condition than in the other voiced conditions. The latency and frequency of surface responses in S10 were 29 ms (±3 ms) and 93%, respectively, however, which were similar to this subjects responses in the other pitch conditions (Table 1).
Nonphonation conditions. We compared responses across conditions that varied vocal fold position (29). During Valsalva, the vocal folds are closed and shortened, whereas, during high-pitch whisper, they are open and lengthened. During exhalation at rest, the glottis is open and the vocal folds are relaxed. No responses to force occurred in the thyroarytenoid or cricothyroid muscles in these conditions. Only the two subjects (S2 and S5) who had reliable sternothyroid responses during high-pitch phonation had sternothyroid responses in the whisper and rest conditions. The latencies of the sternothyroid responses in the nonphonation conditions were similar to those in both pitch conditions (Fig. 6A), although fewer responses occurred in the nonphonation conditions (Fig. 6B). Sternothyroid muscle responses were infrequent after cartilage release in the whisper condition. Surface EMG responses were less frequent in the low-pitch and whisper conditions and did not occur during quiet exhalation. No muscle responses occurred in the Valsalva condition.
| DISCUSSION |
|---|
|
|
|---|
Instead, the only muscle responses occurred in the sternothyroid muscle and were most likely a result of thyroid cartilage displacement in a posterior and upward direction, which would stretch this extrinsic strap muscle. The surface EMG recordings had similar latencies as the intrinsic sternothyroid responses, indicating that the surface electrodes most likely picked up signals from the sternothyroid. We used the same surface electrode placements as Sapir et al. (26) and found comparable EMG onset latencies, suggesting that their surface EMG responses may also have originated from the sternothyroid.
One difference between the studies is that all 19 subjects had muscle responses in the Sapir et al. study (26), whereas only 6 of our 10 subjects had EMG responses. Sapir et al. only recruited young male subjects with large thyroid prominences, which may have enhanced muscle response elicitation in that study. In contrast, our subject group included three women who did not have a laryngeal prominence. However, two of our female subjects had sternothyroid responses, whereas two male subjects with evident thyroid prominences did not, indicating that servomotor placement on a prominent thyroid cartilage was not essential for eliciting muscle responses. The lack of a sternothyroid muscle response in four of our subjects, despite consistent voice responses, also demonstrates that a sternothyroid response was not required for a voice response.
The voice responses elicited by force application were very similar in the two studies, although our F0 changes were greater in amplitude and duration (26). This is likely due to the higher force levels and displacements used in our study, 1.4 N and 8.5 mm, compared with 0.7 N and 3.3 mm, respectively, in Sapir et al. (26). Our initial F0 decrease lasted for 71 ms and was longer than theirs of 24 ms, most likely because of higher forces being applied in our study. Despite the use of greater forces, no thyroarytenoid or cricothyroid responses were detected in our study.
The later F02 response peaked at 80 ms in Sapir et al. (26) and at 176 ms in this study. Sapir and colleagues suggested that the F02 voice change was mediated by a muscle response occurring at 34 ms. In our study, each subject showed an F02 voice response, but only six had sternothyroid responses, suggesting that the F02 voice change was not dependent on a sternothyroid muscle response. Rather, the F02 and F04 voice changes may be mechanical rebound responses, because they both occurred in the opposite directions from the initial F01 and F03 voice responses to mechanical displacement. Although the F02 increase was greater in subjects with sternothyroid responses than in subjects without muscle responses, the F0 increase is not commensurate with sternothyroid contractions that typically depress the thyroid cartilage and lower the voice (14, 20). Furthermore, because the F02 change occurred in the absence of a muscle response, this voice change is more likely produced by passive rebound of the laryngeal tissue.
On the other hand, the F02 and F04 voice responses might correspond to the auditory pitch-shift reflex (9, 18, 30). The subjects in our study felt and heard the servomotor stimulus, so the later F02 and F04 voice changes may have been compensatory responses, causing the subjects to shift their pitch in the opposite direction from the perceived error. The latency of the F02 and F04 responses in this study occurred
100 ms after the mechanically induced F01 and F03 voice changes, which approximates the latencies reported by Burnett et al. (9). Future studies are needed to examine how the laryngeal muscular system responds during the pitch-shift reflex.
In contrast with Titze et al. (29), who reported bilateral cricothyroid responses in both of their subjects, no cricothyroid responses occurred in our 10 subjects. A possible explanation is that the EMG recordings of Titze et al. included electrical pickup from sternothyroid muscles overlying the cricothyroid electrodes. We checked that our cricothyroid recordings did not increase during a chin-tuck or head-raising gesture to demonstrate that we were not picking up activity from neighboring strap muscles. Such a precaution was not mentioned in the Titze et al. study. The latency of their cricothyroid response was
30 ms (based on Fig. 6, p. 2278, Ref. 29), which is similar to our sternothyroid response latencies. The absence of cricothyroid and thyroarytenoid responses in our study does not support the role of intrinsic laryngeal stretch reflexes in vocal vibrato, as proposed by Titze et al. (29).
Sternothyroid responses.
Reflexes in the sternothyroid muscles have not been reported previously, but muscle spindles have been identified in human fetal sternothyroid muscles and monkey preparations (16, 23), along with dense muscle spindle populations in the human sternohyoid (8). The 35- to 40-ms sternothyroid latency seems late for a monosynaptic stretch reflex, given that stretch reflexes in the masseter (21) and trapezius (1) have latencies of 6 and 12 ms in humans, respectively. Reflex latency, however, depends on the anatomical orientation of the intrafusal organ and whether the stimulus delivery effectively and rapidly lengthens the intrafusal fibers. The sternothyroid responses occurred most often in the high-pitch condition when the larynx is moved upward and stretches the sternothyroid, possibly increasing the stretch effects on the 1a afferents in this muscle when force was applied to the thyroid cartilage (22). In studies of whiplash and falling with a rapid head acceleration/deceleration response, onset latencies in the sternocleidomastoid from these indirect stretch stimuli vary between 22 and 35 ms (3, 15), similar to the 25- to 30-ms latencies found here.
Laryngeal proprioception.
Although the voice changes in fundamental frequency clearly demonstrated that changes occurred in the length and tension of the vocal folds, our stimulus may not stretch the spindles previously reported in the vocalis portion of the thyroarytenoid (24). Another study (8), however, failed to find muscle spindles in the human thyroarytenoid. One the other hand, our servomotor stimulus did not elicit the laryngeal adductor response in the thyroarytenoid muscle, suggesting that our external servomotor stimulus did not deflect the mucosal mechanoreceptors previously shown to elicit thyroarytenoid responses in humans (6) and are capable of conveying sensory information related to thyroarytenoid muscle tension and vocal fold movement (2, 25). At this point, our study does not provide evidence that either the thyroarytenoid or cricothyroid muscles mediate short-latency stretch responses.
In conclusion, although other receptors may mediate laryngeal sensory feedback during vocalization, such as muscle spindles in the human interarytenoid muscle (28) and mechanoreceptors in the laryngeal mucosa (12), we speculate that the short-latency sternothyroid responses found here provide evidence that spindles in extrinsic laryngeal muscles might provide sensory feedback regarding changes in laryngeal height. Possibly the rapid changes in laryngeal height, which occur during pitch changes (14), could generate somatosensory feedback for laryngeal control during voice production.
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
V. M. Henriquez, G. M. Schulz, S. Bielamowicz, and C. L. Ludlow Laryngeal reflex responses are not modulated during human voice and respiratory tasks J. Physiol., December 15, 2007; 585(3): 779 - 789. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Hunter and I. R. Titze Refinements in modeling the passive properties of laryngeal soft tissue J Appl Physiol, July 1, 2007; 103(1): 206 - 219. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Visit Other APS Journals Online |