Research Article - (2023) Volume 14, Issue 3

Binaural Beat Stimulation Improves Stuttering Symptoms
Dmytro Chernetchenko1,2*, Pramax Prasolov1,3, Sam Aganov1,4,5, Andrii Voropai1,2, Yuliia Polishchuk6, Dmytro Lituiev1 and Eugene Nayshtetik1
 
1Department of Sciences, University of California, California, United States
2Department of Physics, Dnipro National University, Dnipro, Ukraine
3Department of Pharmacy, University of San Francisco, San Francisco, USA
4Department of Human Sciences, The Ohio State University, Ohio, United States
5Department of Science, University of the Cumberland, Williamsburg, United States
6Department of Pedagogy and Psychology, Dnipro National University, Dnipro, Ukraine
 
*Correspondence: Dmytro Chernetchenko, Department of Sciences, University of California, California, United States, Email:

Received: 28-Nov-2022, Manuscript No. JCRB-22-19041; Editor assigned: 01-Dec-2022, Pre QC No. JCRB-22-19041 (PQ); Reviewed: 15-Dec-2022, QC No. JCRB-22-19041; Revised: 27-Jan-2023, Manuscript No. JCRB-22-19041 (R); Published: 03-Feb-2023, DOI: 10.35248/2155-9627.23.14.464

Abstract

Background: Stuttering is a speech disorder that affects more than 70 million people worldwide, limiting their ability to communicate and socialize. In recent decades, several studies have demonstrated a link between stuttering and abnormal Electroencephalographic (EEG) β-power in cortex during speech preparation and production.

Aim: This study investigated the efficacy of a novel auditory neuromodulating technology that leverages euphonic music tracks with broad-spectrum binaural beats to induce selective EEG spectral power changes.

Methods: Adults with Stuttering (AWS, n=6) and participants from the control group without speech pathologies (n=6) were exposed to binaural stimuli for 5 minutes. EEG and Electrocardiographic (ECG) bio-signals were recorded before, during, and after exposure.

Results: During standard reading tasks without stimulation, in controls but not in the AWS group, β-power was significantly higher in the left hemisphere than in the right hemisphere. After stimulation, the power of the β-band in AWS participants in the left hemisphere increased 1.54-fold, while changes in the right hemisphere activity were not significant. The average β-band power within the left front temporal area and temporoparietal junction of the cortex after stimulation in AWS participants shows an increase by 1.65-fold and 1.72-fold respectively. The rate of disfluency dropped significantly immediately after stimulation (median 74.70% of the baseline rate), but the effect was not significantly different from the baseline 10 min later. Similarly, the speech rate significantly increased immediately after stimulation (median 133.15%) but it was not significantly different 10 min later. Furthermore, we found significant correlations of experimental boost of β-power in left temporoparietal projection (Spearman ρ=-0.54) and left front temporal area (Spearman ρ=-0.58) with disfluency rate of speech.

Conclusion: We show for the first time that auditory binaural beat stimulation can substantially improve speech fluency in AWS, and its effect is proportional to boost in EEG β-band power in left front temporal and temporoparietal junction of cortex. Changes in β power were detected immediately after exposure and persisted for 10 min. Additionally, these effects were accompanied by a reduction in stress levels, as monitored by ECG markers. This suggests that auditory binaural beat stimulation temporarily improves speech quality in AWS by increasing the EEG β-band power.

Keywords

Stuttering; Binaural beats; Auditory stimulation

Introduction

Stuttering is a speech fluency disorder ICD-11, World Health Organization, 2019 that is usually diagnosed in infancy, childhood or adolescence. Stuttering involves impairment in speech fluency and is characterized by frequent repetitions on prolongations of sounds or syllables. Even though 80% of children recover, both with natural development and therapy, the remaining 20% continue to stutter in adulthood, which accounts for 1% of the adult population [1]. The causes of stuttering are still poorly understood but are usually attributed to speech motor control disruption due to abnormal electrical activity of the brain, anatomical abnormalities, genetic and molecular factors, stress or psychological trauma, or developmental delay. Neurophysiological studies have offered the promise of actionable discoveries regarding stuttering causes. Neuronal electrical activity of the brain is conventionally separated into several frequency bands: δ (0.5 Hz–4 Hz), θ (4 Hz–7 Hz), α (7 Hz–13 Hz), and β (13 Hz–30 Hz). Several studies have shown that activity in the Electroencephalographic (EEG) β spectrum range, which is commonly heightened in the awake state and focused activities is reduced in individuals with stuttering. EEG and Magneto Encephalographic (MEG) studies have shown that in Adults Who Stutter (AWS), compared to the control group, α power is higher while β power is lower during speech production. This may be due to the role of the β-band in coordinating speech planning and execution within the speechproduction brain network, as it is known that β-oscillations become more coherent between the bilateral primary motor, premotor, and auditory cortices before speech production. Additionally, EEG and MEG studies suggest that coordination between auditory clues and motor excitation, which is carried out through β oscillations is affected in AWS.

Mersov, et al., studied neural oscillations in the speech motor network during preparation and execution of planned speech production in AWS using Magneto Encephalography (MEG). Compared to controls, AWS showed stronger β suppression in the speech preparation stage, followed by stronger β synchronization in the bilateral mouth motor cortex. AWS also recruited the right mouth motor cortex significantly earlier in the speech preparation stage compared to controls [2-7]. Exaggerated motor preparation is believed to be linked to reduced coordination in the speech motor network in AWS. researched EEG during the spontaneous pre-speech preparation process in AWS. But, they found that stuttering severity is linked to a stronger EEG β-power during speech intention. The more severely a person stuttered, the stronger the alpha power in bilateral frontal, and low β power in the mid anterior and right central EEG sensors. AWS with more severe stuttering seem to show stronger maintenance of the current cognitive or sensorimotor state, as stuttering severity was associated with increased beta power. Thus, findings are differing from described in Mersov, et al. and might be explained by differences in study designs and with regard to this study by another cognitive process during speech planning. Such differences in results suggest that β-power suppression occurs differently in AWS with different severity of stuttering, not only during prespeech formation, but also during the pronunciation process [8].

A complex pattern of sympathetic-parasympathetic balance emerges in literature on speech-related stress. On one hand, speech production and especially speech production in AWS is correlated with higher sympathetic activation. On the other hand, AWS tends to develop a compensatory mechanism by increasing parasympathetic activation and reducing heart rate, a phenomenon termed an autonomous coactivation of ANS [9]. The effect of sympathetic activation reduction by relaxation auditory stimulation of binaural beats with frequency carrier components in θ-range of brain activity could accompany the parasympathetic compensatory mechanisms. Thus we hypothesize that reduction of sympathetic activation and parasympathetic activation may work quite similarly to internal compensation stress reduction mechanisms in AWS and should help to alleviate the severity of stuttering symptoms [10].

Several methods have been developed before to entrain the brain oscillations. Synchronization of brain activity at the frequency of sensory stimulation and its harmonics in and beyond the corresponding sensory brain areas has been achieved with visual flicker, somatosensory tactile stimulation and auditory stimulation [11]. Popular approaches for auditory brain entrainment include isochronous sounds, monaural beats and binaural beats. Stuttering reduction using noise and auditory voice feedback treatments has been explored for several decades. The feasibility of entraining the β-band by isochronous sound exposure has been demonstrated in healthy adults and in AWS [12].

To define the regions of interest within the cortex we noted that earlier studies found that brain activity EEG patterns normally seen during speech production in non-stutterers were either absent, bilateral in nature, or lateralized to the left hemisphere. while in AWS compensation effects were found in the right hemisphere. Increased gray matter volume in right hemisphere motor auditory homologues coupled with increased white matter density in the fiber tracts connecting them have been interpreted as evidence of right hemisphere compensation [13]. β-band activation in the premotor cortex plays the important role of fluent speech production.

Given earlier results suggesting a possible connection between stuttering and the deficit in spectral power of the β EEG band and the lack of activation in speech production centers of the brain, we hypothesized that an increase in β-power with binaural beat stimulation might temporarily alleviate disfluency episodes and improve speech quality in AWS [14].

To test our hypothesis, we exposed people with Stuttering (AWS) and fluent individuals (controls) to proprietary binaural beat soundtracks. Simultaneously, we recorded the EEG, ECG, and speech production. To reveal the influence of auditory stimulation on electrophysiological brain activity, we analyzed the whole EEG spectrum and performed separate analysis for the electrodes group located over the Left Frontotemporal (LFT) region and Temporoparietal junction (LTP) of the cortex. As speech production may be linked to changes in activation of sympathetic ANS, we monitor sympathetic activation in response to auditory stimulation using Heart Rate Variability (HRV).

Materials and Methods

Experimental design

Adult volunteers with stuttering (AWS, 6 males; aged 18 years-38 years; mean age: 28.3 ± 7.1 years) and fluently speaking volunteers (controls, 6 males; 26 years-52 years old; mean age: 32.0 ± 10.1 years) signed informed consent and participated in the study at the laboratory of faculty of physics, electronics and computer systems of Dnipro national university. The study was approved by the ethical commission of Dnipro national university, protocol #15 from 2021-05-17. All the participants were right-handed and had no history or complaints of hearing difficulties. The stuttering participants received no medications or stuttering therapy during the experiments. None of the control participants reported a history of speech, language, or hearing difficulties. The experimental protocol consisted of six stages, 5 min each (Figure 1A).

• Baseline exposure under relaxation and with eyes closed,
• Reading activity pre-stimulation,
• Auditory stimulation,
• Reading task activity immediately after the stimulation,
• Reading task activity after 10 min of relaxation, and
• Resting state measurement with eyes closed. During each stage, the EEG and ECG signals were continuously recorded.

The reading aloud was assayed in the participants' native language (Russian) for five minutes. Voice was recorded using a built-in microphone in a MacBook Pro 13" (2015) laptop.

Auditory stimulation

An auditory euphonic Binaural Beat (BB) stimulus with three spectral components (δ, α, and β) was prepared using a custom generator and post processing algorithms United Kingdom patent application No. 2116927.1. Ambient royalty-free music with vocals was used as a carrier to specifically target the brain cortical centers responsible for voice recognition (audio recording can be found in supplementary materials in the folder “stimulus”). To produce a euphonic binaural stimulus, we applied a low-pass filter with a cutoff of 170 Hz, which was derived empirically for the chosen music background (to avoid reverberations of sound). The resulting track was normalized in the final stage, and raw stereo audio data were generated. The stimulus used in this study contained binaural components at 3 Hz, 7 Hz and 21 Hz produced by frequency shift between left and right channel with a low-pass cut-off of the binaural effect at 170 Hz (supplementary Figures 1A and 1B). The spectral characteristics of the resulting auditory output stimuli are shown in supplementary Figure 1C. The output Sound Pressure Level (SPL) of the stimulus was normalized to the 73 dB level to match the optimal level of auditory binaural stimulation and volume was kept at constant level throughout the stimulation stage. The auditory stimulus was delivered using high-quality wireless sennheiser momentum true wireless 2 headphones with an output frequency response band in the range of 5 Hz to 33,000 Hz. All participants confirmed that the used SPL would not distract them and described the sound level to be comfortable (Figure 1). The headphones were worn by participants during the whole experiment, but the stimulus was played only during the stimulation phase [15].

Clinical-Bioethics-Location

Figure 1: The design of the study. A) Scheme representing the design of the experiments; B) Location of EEG electrodes (32 in total) used during all experiments with highlighted groups of electrodes over the left temporoparietal junction (green on upper topography) and left frontolateral area (blue on upper topography) and groups of electrodes used for the wholehemisphere analysis (bottom topography, blue for the left and green for the right hemisphere). (EEG) Electroencephalographic.

EEG registration and signal processing

A medical-grade digital brain electric activity monitor (contec KT88-3200, S/N 591012399249) was used to collect raw EEG data using conventional wet cap electrodes. Raw data were collected using the proprietary companion software tool contec EEG32. The raw output data were saved in the European EDF+ format. We used 10/20 EEG electrode placements according to the international standard system Acharya, et al. with reference electrodes A1 and A2 (the electrodes configuration is shown in Figure 1B).

The sampling rate for each channel was 200 Hz and the analogue front-end of the EEG device has 0.1 μV resolution and a 0.016 Hz-1000 Hz input signal frequency range. The input data were digitally filtered with a f=0.1-30 Hz band pass filter in order to extract useful EEG signals and exclude Electromyography (EMG) artifacts. Brain electrical activity was post-processed using Fast Fourier Transform (FFT) followed by Power Spectrum Density (PSD) [16]. The noise from Electromagnetic Interference (EMI) was reduced by applying a notch filter with a cut-off frequency of 50 Hz. Episodes of signals with bad contact of electrodes were excluded from the record by the automatic built-in lead-off detection function of the EEG monitor. Additionally, EOG artifacts were extracted using Independent Component Analysis (ICA) infomax decomposition algorithm. The absolute power was numerically integrated over each frequency of the following bands: δ (0.5 Hz– 4 Hz), θ (4 Hz– 7 Hz), α (7 Hz– 13 Hz), and β (13 Hz– 30 Hz) over each electrode's position (32-electrode EEG measurement). Consequently, an average of 32 channels was used for the statistical analysis of the spectral power. The spatial distribution of the PSD calculated using a standard formula was visualized as a heatmap for each EEG spectral band. To obtain the power of electrical brain activity in areas associated with speech production processes, we calculated the mean average power at the left temporoparietal junction (anatomically associated with Wernicke's area) within electrodes C3, CP3, P3, P7, TP7, T7, and the left frontotemporal area within electrodes F3, FC3, F7, and FT7 (anatomically associated with Broca's area) (Figure 2). Topographies, compressed spectrum graphs, and trend graphs were produced using open-source EEGLAB v.2021.1 software (Figure 3).

Clinical-Bioethics-frequencies

Figure 2: ECG-based markers of stress during reading tests. A) HRV markers (meanRR, heart rate, stress index, RMSSD, LF, HF) are shown for AWS and control group in baseline rest state, reading before the stimulation, immediately after (0 min), and 10 min after the binaural beats stimulation. RR intervals deviationwith time demonstrate HRV characteristics before stimulation; B) Right after stimulation; C) And 10 min after stimulation; D) Note an increase in amplitude and higher frequencies alongside with reduction in regularity of oscillations exemplifying physiologic relaxation. (ECG) Electrocardiography,(HRV) Heart Rate Variability, (AWS) Adults with stuttering,(RMSSD) Root Mean Square of Successive Differences, (LF/HF) Low-to-high Frequency ratio.

Clinical-Bioethics-Temporoparietal

Figure 3: EEG β-power change at the different stages of experiment. A) EEG β -waves in reading experiments in AWS (top) and control group (bottom) averaged across all 32 electrodes; B) Power in β- band for left and right hemispheres (left and right, respectively) during the reading activities in AWS and control; C) Power in β - band averaged across electrode groups located over the LTP junction (left) and LFT area during the reading activities in AWS and controls. (EEG) Electroencephalography, (AWS) Adults With Stuttering, (LTP) Left Temporoparietal junction, (LFT) Left Frontotemporal area.

ECG registration and signal processing

Electrocardiographic (ECG) recordings were conducted using a contec 8000GW device (HW S/N: 39124881923) with four wet electrodes placed on the limbs to obtain a conventional six-lead ECG system (I, II, III, avL, avR, and avF). The input analog ECG signal was amplified with 1000x gain by a low-noise analog front-end. To remove artifacts, ECG-signal segments with poor contact of electrodes were detected and removed using built-in AC-current lead-off detection [17]. The input signal was low-pass filtered with a Butterworth IIR filter of order n=8 and cut-off frequency f=30 Hz to prevent aliasing and exclude Electromyographic (EMG) artifacts. Baseline drift was removed with a high-pass filter Butterworth IIR filter of order n=8 and cutt-off frequency f=0.67 Hz. The sampling frequency was 1000 Hz, while pass-band attenuation was taken at 1 dB. The influence of EMI was reduced by applying a notch filter with a cut-off frequency of 50 Hz. Raw ECG data were exported to HL7 aECG standard XML format (Figure 4).

Clinical-Bioethics-Topography

Figure 4: Topography and frequency specificity of binaural beats stimulation-induced changes. A) Topographical distribution of β- power before, after and 10 minutes after the stimulation; B) Coherence between LTP junction and LFT area during speech in AWS, before, immediately after and 10 min after stimulation; C) Coherence between LTP junction and LFT area during speech in controls, before, immediately after and 10 min after stimulation. (AWS) Adults With Stuttering, (LTP) Left Temporoparietal junction, (LFT) Left Frontotemporal area.

The state of the Autonomic Nervous System (ANS) and stress level were assessed by Heart Rate Variability (HRV) using a set of standard metrics calculated from ECG RR-intervals (heart beatto- beat intervals) forth by the European society of cardiology and North American society HRV standards (“heart rate variability,” 1996): MeanRR, mean heart rate, standard deviation of NN intervals (SDNN), Root Mean Square of Successive Differences (RMSSD), Low-To-High Frequency ratio (LF/HF), Low Frequencies power (LF), High Frequencies power (HF), and Baevsky stress index. The Baevsky stress index (SIdx), which is a measure of HRV reflecting cardiovascular system stress. High values of SI indicate reduced variability and high sympathetic cardiac activation. SIdx was computed according to the formula (Baevsky RM; Berseneva AP, n.d.);

Equation

Where;

AMo is the mode amplitude presented in percentage points,

Mo is the mode (the most frequent RR interval),

MxDMn is the variation scope reflecting the degree of RR interval variability.

Mode Mo was simply calculated as the median of the RR intervals. The AMo was obtained as the height of the normalized RR interval histogram (bin width 50 ms) and MxDMn as the difference between the longest and shortest RR interval values. To make SIdX sensitive to slow changes in mean heart rate (which would increase the MxDMn and lower AMo), the very low frequency trend is removed from the RR interval time series using the smoothness priors method (Figure 5).

Clinical-Bioethics-disfluency

Figure 5: Speech disfluency and speech rate during experiments. A) Speech disfluency rate before, after and 10 min after binaural beats stimulation; B) Speech rate before, after and 10 min after binaural beats stimulation.

Speech quality assessment

Speech recordings were evaluated by speech therapists for a range of disfluencies, such as repetitions of syllables, whole words, sound prolongations, and blocks. Each audio recording was scored blindly by a speech therapist (a participant ID was provided but neither experimental stage nor category). Disfluencies were measured in terms of the number of repetitions (separately for sounds, syllables, and words), prolongations, and blocks. The total number of disfluencies per experiment was normalized to the total number of words read [18]. As the texts differed from one stage of the experiment to the other (each time a new unfamiliar text was proposed for reading to exclude memorization effect), it was necessary to calculate the relative number of disfluencies for each measurement. The same set of texts was provided to all participants at respective stages of the experiment. Speech rate was estimated as words read per minute. At the end of the experiment, participants rated the pleasantness of auditory stimulation and its effect on speech abilities as well as their overall stress before and after stimulation with available options: Easier to speak after stimulation; no change; harder to speak after stimulation.

Statistical analysis

To examine the effect of treatment for each cohort (AWS, controls) on physiological readouts while accounting for repeated experiments within participants, we applied Linear Mixed effects Models (LMM). Every set with physiological raw data was processed and scored blindly by automated script written in python programming language (Figure 6).

Clinical-Bioethics-Correlation

Figure 6: Correlation between speech disfluencies and β-power over the speech-relevant centers after beats stimulation (0 min and 10 minutes after). A) Correlation between disfluency rate per AWS participants and β-band power average electrodes over the LFT area. (AWS) Adults With Stuttering, (LTP) Left Temporoparietal junction, (LFT) Left Frontotemporal area.

All statistical analyses were done using the R programming language. We researched the relationships between changes in spectrum power of brain cortex electrical activity after stimulation and disfluency rate of speech. To assess relationships across these variables, non-parametric tests (i.e., spearman’s rank) were used where variables were not normally distributed [19]. The effects due to participants' individual physiology were treated as a varying intercept random effect, effects due to the experiment were treated as random effects nested within the participants' effects, and effects due to the cohort or treatment were treated as fixed effects. Analysis was performed using lme4 and lmer test libraries in R software, and the significance of the fixed effects was reported. Speech characteristics were compared using the wilcoxon t-test for a single (first) experiment per participant [20].

Results

Auditory stimulation results in reduction of stress level as monitored by ECG

To assess the reaction of AWS and controls on reading tasks we compare HRV parameters in the resting state and during the initial reading. The reaction of AWS and controls was different. In controls unlike AWS, heart rate and LF power during the reading increased relative to the baseline resting state (by 1.04 ± 0.06 p=0.03 in wilcoxon test and 2.51 ± 1.69 p=0.03 fold, respectively, in wilcoxon test). Next, we evaluated the effect of auditory stimulation on general stress level, heart rate, and HRV parameters to elucidate ANS state. A significant decrease was observed in the heart rate immediately after stimulation compared to the baseline (p=0.024 in wilcoxon test). After 10 min, the heart rate stayed significantly lower only in the control group (p=0.01 in wilcoxon test). Moreover, the Baevsky's stress index was down in both AWS and control group relative to the baseline both immediately after the exposure (by 1.16 ± 0.12 p=0.01 in wilcoxon test and 1.26 ± 0.19 fold, respectively, p=0.03 in wilcoxon test) and 10 min later (p=0.01 and p=0.03 in wilcoxon test, Figure 2A). No significant difference was found in the HRV RMSSD after stimulation. An example of heart rate modulation during the experiment is shown by an ECG series for one AWS participant before, immediately after, and 10 min after stimulation (Figures 2B-2D).

AWS display distinct spatial signatures of brain activation during speech production

To evaluate the baseline electrophysiological differences between AWS and controls, we measured EEG characteristics for both groups. No significant difference in β power averaged across all electrodes was found between the control and AWS cohorts in the baseline rest state (p=0.43 in Wilcoxon test, supplementary Figures 2A and 2B). The total power in the β-band in the control group was significantly higher in the left hemisphere than in the right hemisphere (p=0.03 in wilcoxon test) during the baseline state measurements. However, electrical activity in the AWS during the resting state in the left and right hemispheres was the same (p=0.38 in wilcoxon test).

Next, during the reading task in the baseline state, the average power of the β-band did not differ significantly between the AWS and control groups (p=1.0 in wilcoxon test). Left–right hemispheric asymmetry in β-band power was observed in the control group (p=0.03 in wilcoxon test) but not in the AWS group (p=0.74 in wilcoxon test). Detailed results of the hemispheres electrical activity distribution during the experiment are shown in (supplementary Figure 2A and 2B, supplementary Table 1).

The β power in LTP junction and LFT cortex area during the reading task did not significantly increase neither in AWS (p=0.64 in wilcoxon test, p=0.54 in wilcoxon test) nor in the control group (p=0.06 in wilcoxon test in controls, p=0.15 in wilcoxon test in controls) relative to the baseline rest state. However, the relative average power of the β-band in the LTP junction and LFT area correlated across participants in the control group (spearman ρ=0.89, p=0.03, supplementary Figure 3) and AWS group (spearman ρ=0.9, p=0.05, supplementary Figure 4).

Auditory stimulation enhances β spectral power and leads to spatial redistribution of brain activity

To assess the effectiveness of the auditory binaural beat stimulation in enhancing modulated frequencies of brain electrical activity, we recorded and analyzed EEG activity before, during (supplementary Figure 5), and after stimulation, at rest, and during reading activity, as shown in Figure 1, and analyzed the overall power of the β band, as well as each hemisphere, and within cortex projections related to the speech production.

The overall absolute β power in the control group was insignificantly increased by 1.4-fold after stimulation (p=0.16 in wilcoxon test) and stayed at 1.2 fold 10 min after compared to the baseline reading (p=0.56, wilcoxon test). The overall β-power in the AWS insignificantly increased by 1.32-fold after stimulation (p=0.15 in wilcoxon test) and remained at 1.14 fold 10 min after stimulation (p=1.0 in wilcoxon test; Figure 3A) compared to the baseline reading.

Detailed analysis of spatial distribution showed that after stimulation, the power of the β band in AWS participants in the left hemisphere increased 1.63 fold (with p=0.01 in wilcoxon test), while changes in the right hemisphere activity were not significant (p=0.74 in wilcoxon test, Figure 3B). In the control group, no significant changes in β power were observed in either brain hemisphere after stimulation (p=0.44 in wilcoxon test for left; p=0.09 in wilcoxon test for right hemisphere, Figure 3B). Spatial changes were visualized as differences in the topography of grand-averaged normalized β brain activity during text reading after stimulation (Figure 4A).

Next, we analyzed the average β-band power within the LFT and LTP junctions. After the stimulation, the average power of the β-band within LTP projection in AWS participants increased by 1.74 fold after stimulation (p=0.03 in wilcoxon test) and stayed at 1.65 fold 10 min after stimulation (p=0.15 in wilcoxon test compared to the baseline). After stimulation, the average power of the β-band within the LFT area in AWS participants increased by 1.72 fold after stimulation (p=0.01 in wilcoxon test) and stayed at 1.61 fold 10 min after stimulation (p=0.84 in wilcoxon test compared to the baseline). Coherence analysis between the electrodes electrical activity associated with LFT area and LTP junction in AWS cohort shown that average coherence within β band before stimulation was equal to 0.628± 0.009 (I=64.44, Ibeta=7.54), after stimulation 0.66 ± 0.01 (I=60.73, Ibeta=7.91) and 10 min after 0.606 ± 0.01 (I=62.91, Ibeta=7.27). At the same time, in the control group the coherence within β band before stimulation was 0.21 ± 0.01 (I=21.29, Ibeta=2.61), after stimulation 0.23 ± 0.01 (I=32.23, Ibeta=2.75), and after the next 10 min 0.1 ± 0.01 (I=22.64, Ibeta=1.24) (Figure 4D). Additionally, we observed a cross-sectional correlation between the activity of the two areas in both AWS (supplementary Figure 6A) and control (supplementary Figure 6B) groups immediately after the stimulation (Spearman ρ=1.0, p=5e-5 at AWS and ρ=0.94, p=0.01, in controls) and 10 min later (Spearman ρ=0.86, p=0.01, in AWS and ρ=0.97, p=0.001, in controls, respectively). Cross-sectional correlations for each patient are shown in supplementary Figure 7.

Auditory stimulation leads to improvement of speech quality and conditional β-activation in speech centers in AWS

To investigate the effect of auditory stimulation on speech quality, we recorded speech samples before and after the stimulation (Figure 1). The speech rate (words per minute) and rate of disfluencies (the ratio of disfluencies to the total number of words uttered) were counted in the AWS group by a speech pathologist. The rate of disfluencies dropped after the stimulation (median 74.70% of the baseline rate with 90% CI: 37.11%–98.16%, p=0.04 in paired wilcoxon test, Figure 5A), but there was no significant difference from the baseline 10 min later (median 65.51% of the baseline rate, 90% CI: 39.15%–93.70%, p=0.15; Figure 5B). Similarly, speech rate increased immediately after stimulation (median 133.15% of the baseline rate, p=0.04), but was not significantly different 10 min later (median 126.63% of the baseline rate, p=0.08). Immediately after stimulation, six out of the six participants noted the subjective ease of reading the text aloud (short questionnaire described in methods). Speech recordings showed a smoother, clearer, and faster text pronunciation for AWS after stimulation (see the link on records in supplementary materials).

Improvement of speech quality is correlated to the specific brain activity patterns

As speech improvement is expected to be mediated by activation of the LFT area and LTP junction, we analyzed the correlation between changes in the rate of disfluencies and activation in the respective areas. To elucidate the contribution of auditory stimulation-induced changes in electrical activity to changes in speech fluency, we compared changes in β band power in the LFT and LTP junction before and after stimulation to the disfluency rate. Improvement in fluency after stimulation and 10 min after compared to the reference reading stage correlated with an increase in the power of the β band in both LTP junctions (spearman ρ=-0.54, p=0.03, Figure 6A) and LFT area (spearman ρ=-0.58, p=0.02, Figure 6B).

When comparing speech fluency between the AWS participants who responded with an immediate increase in β-power within the speech centers upon auditory stimulation (n=4, average increase by 2.3 fold) to the rest of the AWS participants (n=2), the former displayed an improvement in speech fluency both immediately (30.03%, p=0.03 in the paired wilcoxon test) and after 10 min (25.38%, p=0.03 the paired wilcoxon test). Additionally, in the responder sub-group, the increase in β power persisted after 10 min (1.77 fold relative to the baseline reading stage, p=0.03 in the paired student's t-test, p=0.05 in the paired wilcoxon test). This suggests that improvement in speech quality may be mediated by activation of the β-frequency band in speech centers, and the change in spectral power of the β-band may serve as a surrogate variable predictive of functional response in speech quality.

Discussion

In this study, we assessed the responses of adults with and without stuttering to auditory stimulation with binaural beats using a broad range of physiological metrics.

A hypothesis that drove our efforts stated that participants with stuttering may exhibit lower β-power than controls. We were unable to confirm this hypothesis in our cohort, potentially due to the low number of participants. However, we observed hemispheric asymmetry in β activity in controls but not in AWS, which is in agreement with observations in children who stutter. Next, we were able to induce an increase in β-power during reading activity in AWS but not in controls by means of auditory binaural beat stimulation. This did not create a significant left-right asymmetry in participants with stuttering. However, the increase in β-power in the AWS was significant only in the left hemisphere. This is a desirable effect, as the speech centers localize to the LTP junction and LFT cortex area. Furthermore, we observed that upon auditory stimulation, the β-power increased in left projections in AWS but not in controls.

Next, we demonstrated that auditory stimulation improved speech quality. The effectiveness of β-range BB stimulation in improving speech quality in AWS agrees with earlier studies, suggesting a link between β deficit and stuttering. Moreover, our results suggest that there is a dose-effect relationship between speech improvement and the increase in EEG β-power. As a further step, we need to explore the effect of stimulation on a larger group of AWS with a longer follow-up period.

Stuttering episodes are often accompanied by stress and anxiety which is another common target of stuttering therapy. In this study, we probed stress levels by monitoring participants’ heart rate and Heart Rate Variability (HRV) by ECG RR intervals. We show that the response to speech related stress in AWS and controls is different. During the reading task controls show significant increase in heart rate and sympathetic activation (increasing of LF spectrum power), while AWS do not. This substantiates the existence of compensatory mechanisms in AWS. Additionally, our stimulation reduced HR and increased HRV in AWS. We observed a significant drop in the stress index after exposure to the stimuli in participants with stuttering and controls, suggesting a reduction in stress levels. This is in agreement with earlier work showing that brain waves can be entrained with BB which may cascade into other physiological functions, such as HRV.

As a further step, we plan to reproduce results with sham audio stimuli without BB and study the effect of both amplitude and exact frequency of binaural shift in β-range of brain waves (currently set to 21 Hz) on fluency improvements. Other tone frequencies should be further investigated because the optimal resonant frequency can be individualized for different patients. The hypothesis of calibration of the modulated frequency may depend on individual brain electrical activity and may be reflected in speech phonetic patterns. Cross-over design and placebo trials of the experiment should be conducted to clarify the effects of the stimulus that do not depend on groups of participants. The long-term effects of exposure, including possible habituation or long-term exposure, as well as the impact on participants' general well-being, should be further investigated. The duration and effect of carrier music parameters, as well as variations in the binaural shift frequency on the effect duration, may be investigated in more detail in future research.

No adverse effects were observed in this study. The participants did not experience negative or irritating effects or fatigue after 5-minutes of BB stimulation. Thus, longer stimulation may also be considered because of the non-invasiveness and absence of side effects of the proposed approach. In addition, such an auditory entertainment approach can be used in a multi-day course with several sessions.

An in-depth understanding of effects of the auditory BB stimulation and its capability to temporarily reduce stuttering symptoms may be useful for end-user applications. The findings of this work may be used for design of digital therapeutic techniques for the treatment of speech disfluencies.

Acknowledgments

We are grateful to Dr. Elena Kruglyakova for productive scientific discussions and consultations on the neurophysiological aspects of speech production and EEG biosignal processing.

Data Availability

Data may not be shared due to participant confidentiality concerns.

Conflict of Interest

The authors declare the existence of following commercial interests.

• The authors of this paper represent one organization (SynthezAI Corp., San Francisco, California, USA) that funds the following research.
• Obtaining commercial benefits from the used results of previously provided clinical trials NCT03532152: "Study of the effect of the VR technology on recovery of the autonomic nervous system in volunteers affected by stress". Pure Purr LLC.
• Nebeskey Inc. obtained commercial benefits related to the development of similar products based on different AI applications.
• Obtaining commercial benefits related to the development of the product (ZOV.ai) as an outcome of the current study.
• The method of generation of auditory stimuli protected by United Kingdom Patent Application No. 2116927.1.

References

Citation: Chernetchenko D, Prasolov P, Aganov S, Voropai A, Polishchuk Y, Lituiev D, et al. (2023) Binaural Beat Stimulation Improves Stuttering Symptoms. J Clin Res Bioeth. 14:456.

Copyright: © 2023 Chernetchenko D, et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.