Abstract

Introduction. Binaural beats (BBs) are phantom sound illusions perceived when two sounds of slightly different frequencies are separately transmitted to the ears. It is suggested that some BB frequencies might entrain the brain and enhance certain cognitive functions such as working memory or attention. Nevertheless, studies in this regard are very scarce, quite controversial, and merely covering a very small portion of this vast field of research (e.g., testing only a few BB frequencies), not to mention adopting some limited methodologies (e.g., no assessment of the loudness of the BB sound, adopting only between-subject analyses, and testing only one perceptual modality). Hence, we aimed to assess the potential effects of alpha, beta, and gamma BBs on cognitive-behavioral parameters of working memory and attention examined simultaneously in two different modalities (visuospatial and auditory-verbal). Methods. This within-subject five-arm randomized placebo-controlled clinical trial included 155 trials in 31 healthy right-handed subjects (17 women, 14 men, years old). Each subject listened to 8-minute sessions of 10 Hz, 16 Hz, and 40 Hz binaural beats versus 240 Hz pure tone and silence (in random orders). In each 8-minute block, they played a dual 2-back task with feedback enabled. Their cognitive-behavioral parameters (working memory capacities, signal detection measures (hit rate, false alarm rate, sensitivity, and response bias), and reaction speed measures (response time and intrasubject response time variability)) were calculated. The effects of the sound interventions and short-term training on these working memory and attention measures were assessed statistically using mixed-model linear regressions, repeated-measures ANOVAs and ANCOVAs, Bonferroni post hoc tests, and one-sample -tests (). Results. The following are some major statistically significant findings (): In the visuospatial modality, the 10 Hz BB reduced the response time and intrasubject response time variability and reduced the extent of decline over time in the case of visuospatial working memory, sensitivity, and hit rate. In the auditory-verbal modality, the 10 Hz intervention reduced the hit rate, false alarm rate, and sensitivity. The 10 Hz intervention also caused the lowest intermodality discrepancies in hit rates and false alarm rates, the highest response time discrepancies, and negative discrepancies in working memories and sensitivities (indicating the superiority of the visuospatial modality). The response biases tended to be liberal-to-neutral in the verbal modality and rather conservative in the visuospatial modality. Reactions were faster in the visuospatial modality than the auditory-verbal one, while the intrasubject variability of reaction times was smaller in the auditory-verbal modality. Short-term training can increase the hit rate, working memory, and sensitivity and can decrease the false alarm rate and response time. Aging and reduced sound intervention volume may slow down responses and increase the intrasubject variability of response time. Faster reactions might be correlated with greater hit rates, working memories, and sensitivities and also with lower false alarm rates. Conclusions. The 8-minute alpha-band binaural beat entrainment may have a few, slight enhancing effects within the visuospatial modality, but not in both modalities combined. Short-term training can improve working memory and some cognitive parameters of attention. Some BB interventions can affect the intermodality discrepancies. There may be differences between the two modalities in terms of the response speeds and intrasubject response time variabilities. Aging can slow down the response, while increasing the volume of audio interventions may accelerate it.

1. Introduction

Working memory is one of the most fundamental and recognized cognitive functions and is known as the foundation of thinking and learning [13]. It is the system controlling the online organization and processing of information to temporarily hold, process, and operate information for effective comprehension, reasoning, decision-making, problem-solving, goal-directed behavior, language, solving arithmetical problems, understanding geometric analogies, etc. [39], and it is associated with many indices such as fluid intelligence, academic performance, and effective behavior [3]. Therefore, strengthening working memory (WM) can enhance the quality of numerous cognitive and behavioral outcomes.

Rhythm, music, and audio stimuli, in general, are used by humans to improve cognitive performances and enhance moods while studying or in social gatherings [1013]. A new type of audio stimulus is binaural beats. Binaural beats are a form of auditory illusion; they form when the brain attempts to localize the source of a sound while two pure tones slightly different in frequency are relayed independently to each ear. In this case, a third phantom binaural beat with a frequency equal to the discrepancy between the two independent sounds is generated in the Inferior Colliculus [14], which projects it to the primary auditory cortex [1518]. It is claimed that they may influence different cognitive functions and mood states, like memory, attention, vigilance, and creativity [16, 19], perhaps through alterations in the functioning of different brain networks as a result of synchronized hemispheric oscillations and brainwave entrainment [4, 16, 2024]. Therefore, binaural beats are being introduced as a new potential cognitive booster that might also have various influences such as changing the mood [10], altering the states of consciousness [25], or entraining the whole brain [26, 27] (although recent studies failed to find mood-altering effects [28]). The noninvasive nature of these stimuli, their inexpensiveness, their ease of administration, and their potential ability to modulate cognition without previous training make binaural beats an intriguing candidate for use by both impaired and healthy individuals [10].

Therefore, binaural beats (BBs) might be used to enhance the working memory capacity. However, studies in this regard are few, quite controversial, and limited by methodologies and methodological differences (e.g., a study had enrolled only 4 subjects; all studies had merely between-subject analyses; only a few BB frequencies were ever researched, and the effects of many other binaural beat frequencies remain to be examined; each study has used a different measure of working memory; the duration of exposure to binaural beats differed among studies; and in most studies, the effect of binaural beats had been examined after the exposure to them and not during their exposure). Moreover, no study to date has assessed both visuospatial and auditory-verbal working memories simultaneously. Finally, none so far has assessed the effect of the intervention sound volume: previous studies which had reported the level of sound volume had either fixed the sound volume or adjusted it to the maximum loudness that could be comfortably heard by the subject. The constant sound volume in the former case could be too loud and discomforting for some subjects while too quiet for some other subjects, being suboptimum and interfering with results in both conditions, whereas, in the latter case, the effect of the customized sound volume itself should have been accounted for statistically; however, this had not happened.

There are merely 6 controversial studies on the effects of a few binaural beat frequencies on heterogeneous measures of working memory: Beauchene et al. [4, 16] assessed the effects of 5-minute sessions of binaural beat stimulation separately on verbal and visuospatial working memories. They concluded in two separate studies that 5-minute induction of binaural beats at merely 15 Hz (but not at 5 Hz or 10 Hz) could increase both verbal work memory and delta accuracy in a visuospatial working memory task (delta being the difference between accuracies estimated in the last and first thirds of the 5-minute session), in comparison with classical music, pure tone, and silence [4, 16]. It should be noted that their raw accuracy measures did not show any significance, and their significant findings were on measures such as delta accuracy and ranked accuracy. Moreover, their statistically significant findings in both articles might have been erroneous, as they seemed to have used between-subject statistical analyses for a within-subject (repeated-measures) design, besides other errors. Unlike their results, Ortiz et al. [29] (who evaluated the effects of binaural beats before and during the task (for a total of 15 minutes a day, for 5 days) on verbal working memory) observed an improved function under theta frequencies compared to beta binaural beats or white noise. As another addition to the dispute, Kraus and Porubanova [30] assessed the influence of 12-minute sessions of binaural beats at 9.55 Hz merged with the sound of the sea compared to a 12-minute control sound of the sea alone while examining the working memory performance using the Automated Operation Span Task (AOSPAN). The binaural beat intervention improved the working memory of participants. In another study comparing the effects of the exposure to 30 minutes of beta versus theta frequencies of binaural beats on the working memory, Lane et al. [31] had their participants run vigilance tasks. Their findings indicated that theta beats might increase fatigue, confusion, and difficulty concentrating, while beta beats might decrease false alarms, confusion, and fatigue while improving target detection [31]. Wahbeh et al. [23] assessed the effects of exposure to 30 minutes of theta binaural beats while testing the verbal memory of 4 subjects using the Rey Auditory Verbal Learning Test (RAVLT). They observed that pink noise exposure in the control group had a better result compared to binaural beats.

This randomized clinical trial was conducted because of the following reasons as well as shortcomings in the literature: (1) The improvement of working memory and other cognitive-behavioral functions using noninvasive and inexpensive methods such as the binaural beat stimulation would be of utmost clinical and scientific interest. (2) Moreover, studies on the potential effects of binaural beats on the working memory or other cognitive-behavioral functions are quite scarce and highly controversial, not to mention that most of them had assessed the effects of binaural beats after the exposure and not during it. (3) There is no study assessing the effects of binaural beats on cognitive-behavioral features simultaneously in both the visuospatial and auditory-verbal modalities; this could allow comparisons of both modalities with each other and develop more comprehensive conclusions or deductions. (4) There is merely one study on the response times; there is no study on the signal detection measures (signal sensitivity and response bias), intrasubject variability of response times, or the BB sound volume effects. And (5) there is no within-subject design with correct, repeated-measures analyses.

The main null hypotheses were the lack of any significant effects of any of the sound interventions as well as short-term training on the hit rate, false alarm rate, working memory, signal detection measures, response times, and intrasubject response time variability in either of the visuospatial or auditory-verbal modalities, in both of them together, and also while assessing the intermodality discrepancies. Additionally, the effects of sex, age, and sound volumes on the cognitive-behavioral parameters of working memory and attention were assessed. There will be qEEG assessments and analyses as well to be reported later.

2. Subjects and Methods

2.1. Trial Design

This repeated-measures (within-subject) five-arm randomized placebo-controlled clinical trial was performed on 155 experiments in 31 subjects (5 within-subject groups of 31 each). The sample size was determined as 31 subjects based on the few previous studies on the effects of BBs on working memory (e.g., 4, 20, 28, 29, and 34 subjects), noting that a minimum of 20 subjects is recommended for EEG studies [32], and also considering the central limit theorem.

The protocol and/or its ethics were assessed by the institutional review boards of two different institutes (Iran University of Medical Sciences and Institute for Cognitive Science Studies, Tehran, Iran) in accordance with the Helsinki declaration (registration number: IR.IUMS.REC.1399.1353). All subjects were briefed in detail about the methods, goals, and limitations of the study. They filled in and signed written consent forms. Participants could leave the study at will. No changes were made to the methods after the trial commencement.

2.2. Pilot Study

Five volunteers were tested with the task in order to evaluate and determine the parameters of the -back task. A pilot EEG study was conducted on a subject as well (to be detailed in the next article). The pilot cases were not included in the sample.

2.3. Participants, Eligibility Criteria, and Setting

The subjects were enrolled in the study from different sources including a pool of volunteers registered at the National Brain Mapping Lab, members of online forums, and acquaintances of the first author. The method of sampling was sequential: the subjects were evaluated and enrolled until reaching the desired sample size. In the case of any dropouts, new participants would be acquired. All the experiments were performed in 2021 at the National Brain Mapping Laboratory, Tehran, Iran.

The inclusion criteria were being right-handed healthy subjects aged between 18 and 50 years old, with healthy hearing potential (assessed by Barbara Bates’ methods) and with healthy or corrected vision. All the subjects were examined by a physician. The exclusion criteria were the presence of any ongoing or previous clinical neurological or psychiatric diseases (and/or visiting specialists or taking any related medication) and the history of severe head trauma. Furthermore, the included subjects had to fluently know the English numbers 0 to 10.

2.4. Preintervention Treatment

On the previous day, a movie of the task was sent to the subject along with instructions on how to play the task. The participant was also instructed to have breakfast. On the experiment day, the subject was seated in a relaxed position in front of a flat computer screen placed about 60 cm away from the subject. The participant put the left index finger on the “a” button of the keyboard and the right index finger on the “l” key. The subject played the -back task for a block of 8 minutes, to become familiarized with the task. They were instructed to pay attention to both the visuospatial and audio-verbal stimuli as much as comfortably possible.

2.5. Randomization and Blinding

In this repeated-measures within-subject randomized clinical trial, the subjects were not randomized into any groups, but instead, the order of experiments was randomized and all experiments were performed on each subject. An online digital randomizer was used to randomize the order of interventions in each subject. Random orders were concealed within sequentially numbered containers until interventions were assigned. All randomization steps were done by the first author. The researcher was responsible for playing the audio interventions; therefore, he could not be blinded to the sound interventions. Since the subjects heard the interventions, they were not blinded to the interventions either, whereas they did not know the difference between the rather similar sound interventions (except the silence intervention). Still, we did not consider the study as blinded because of these factors.

2.6. Summary of the Study Design

One session sufficed for each patient. Each session had 5 blocks of 8 minutes, with 3-minute rests between every two blocks. In each of the 5 blocks, 3 events happened: (1) the subject played a psychometric task (dual 2-back) for 8 minutes; (2) at the same time, a sound (the intervention) was played in the earphone for 8 minutes; and (3), at the same time, a 32-channel trigger-enabled EEG device recorded the subject’s brain activity.

2.7. Experimental and Control Sound Interventions

There were 5 “sound interventions” in this study: three binaural beats, a positive control (pure tone), and a negative control intervention (silence, placebo). The binaural beats and pure tone were produced using the Gnaural program (open-source software obtainable from sourceforge.net).

The silence intervention was defined as the absence of any binaural beats or pure tones [4, 16] and not as absolute silence. The binaural beats in use were 10 Hz (the alpha band), 16 Hz (the beta band), and 40 Hz (the gamma band). The base tone of all binaural beats was 240 Hz [4, 16], so that, for example, in the case of the 16 Hz binaural beat, one ear would receive the 240 Hz pure tone, and the other ear would hear 256 Hz pure tone for the 16 Hz BB to be generated in the brain. The left/right ear receiving the base frequency was determined randomly for each subject but remained unchanged for all the interventions of that person. The duration of each intervention was selected as 8 minutes, based on pilot studies. Sound interventions were played immediately before beginning the task and starting the EEG recording and were stopped right after finishing the task and ending the EEG recording. There was a 3-minute rest between every two 8-minute experiments. All the randomized 8-minute interventions of each subject would be carried out in a single session of about an hour (5 blocks of 8 minutes each plus 4 rests of 3 minutes each).

The positive control was the stereo pure tone at 240 Hz, in order to rule out and account for the possible effects of the base frequency. The negative control (i.e., silence) was the lack of any sound interventions; during this placebo intervention, the earphones remained in the ears but no sound (except the task sounds) was played. This was done to rule out the potential placebo effects associated with the experiment setup. Also, this was used as the baseline “condition.”

The loudness of the sound intervention was determined for each subject as the maximum loudness that could be comfortably heard and tolerated by the participant. For each subject, this was adjusted in the beginning; thus, the sound volume was the same for all interventions performed on a given subject. The level of sound intensity was recorded for each participant to be later modeled in statistical analyses. It should be noted that the loudness of the task sound (the auditory-verbal modality of the -back task) was constant and standardized for all subjects and would not be reduced or increased.

The laboratory emphasized keeping a minimum environmental sound noise, e.g., the lab and cell phones would be disconnected and silenced. Still, some mild forms of noise would be inevitable. If some loud noise was accidentally heard at the lab, the session would be discarded and repeated. This happened only once.

2.8. Outcomes: Working Memory and Attention Measures

A dual -back task [33] written in Matlab programming language (Mathworks, Natick, Massachusetts, USA) was tested and deemed appropriate for the study. Its parameters were set by the pilot study as for each modality, 3.330 seconds between every two stimuli (i.e., the duration of each trial), 145 stimuli in each block (i.e., 145 trials per block), lure errors set at 0.2, maximum 40 hits per modality, and the numbers 0 to 10 told by a female voice for the verbal modality. New Matlab code was written by VR to enable the software to also record the response time for each of the modalities.

This task has two modalities that run simultaneously: the visuospatial and auditory-verbal modalities. In each modality, there are 145 short trials during an 8-minute block. Each trial in the visuospatial modality begins at the same time as its counterpart in the verbal modality.

The visuospatial modality: in each trial, the visuospatial modality shows a blue square placed randomly in one of the 9 positions of a matrix (against a black background) for 3.330 seconds. After 3.330 seconds, a new trial begins. The order of the blue square positions has been created in a pseudorandom way; in the beginning of the next trial, the blue square “jumps” to a new random position (or sometimes it remains in the same previous position). This procedure repeats 145 times per block, which lasts about 8 minutes. The subject is instructed to press the “a” key with the index finger of the left hand if the position of the blue square is the same as its position in two trials ago.

The auditory-verbal modality: exactly at the same time when the visuospatial modality trial begins, the verbal trial begins as well. A female voice utters a number between 0 and 10. This number has been determined in a pseudorandom manner. After 3.330 seconds, a new trial begins and another number (that may or may not be the same as the previous number) is said. This continues for 145 trials. The subject is instructed to press the “l” key with the right index finger if the number heard in any given trial is the same as the number heard two trials ago.

The -back task has mark trials, in which the stimulus is the same as the stimulus seen or heard in 2 trials ago; the correct response would be to press the key in such mark trials. The appearances of mark trials (when the participant should press the key) on the two modalities were independent of each other, i.e., the sequence of the mark trials in the visuospatial modality had nothing to do with the sequence of the mark trials in the verbal modality. Therefore, sometimes, the mark trials of both modalities could fall within the same trial; in such cases, the subject needed to press both the “a” and “l” keys almost simultaneously.

The -back software collected the trials at which there needed to be responses for each modality (i.e., the mark trials), as well as the trials at which the subject did respond within each modality (i.e., the responses). It also recorded the time between the presentation of the stimulus and pressing the key in each modality.

Based on these data, the hit rates (i.e., ) and the false alarm rates (i.e., 1–specificity or ) were calculated for each modality. These parameters were calculated once for the whole 8-minute block and once again for the first and second 4-minute halves of each 8-minute block. These calculations were done outside the -back software, using Excel programming (Microsoft, Redmond, Washington, USA).

Each response accompanied immediate feedback: two different strings (related to visuospatial and verbal responses) were written in white font on the bottom of the black screen (left and right sides, respectively). If the response was correct, the relevant string would turn green, and if the response was incorrect, the string would turn red. At the end of each block as well, the subject would be presented with a summary of hits, false alarms (FA), , and lure errors.

Based on the hit rate and false alarm rate of the whole block and each of the half blocks, two different outcomes were calculated: the working memory capacity was calculated for each modality as [16, 34]. Signal detection measures were calculated for each modality as (showing the subject’s sensitivity to the stimulus) and (showing the subject’s response bias (liberal, neutral, and conservative)) [35]. The third outcome was the response time for each modality, which was calculated for the whole block as well as for each of the first and second half blocks. The response time was calculated (for each modality) as the average response time for the block and each of the half blocks. Moreover, the standard deviation (SD) of the response time for the 8-minute block and its 4-minute half blocks was computed. The response time SD was calculated as the measure of intrasubject variability of the response time in each 8-minute block and each half block.

As suggested by Beauchene et al. [4], the difference between outcomes in the second half of each block minus the first half of that block was calculated for 5 of the 7 psychometric parameters, as delta values (e.g., delta working memory capacity or delta response time).

Also, as suggested by another article by Beauchene et al. [16], data pertaining to each subject was ranked among different sessions (for example, working memory scores of each subject were converted to the ranks 1 to 5).

Finally, the discrepancies between the visuospatial and verbal modalities were calculated: For this purpose, each visuospatial parameter was subtracted from the same-name verbal parameter (i.e., the ). This way, a positive discrepancy would indicate a larger verbal parameter compared to its visuospatial counterpart, while a negative discrepancy would point to a greater visuospatial parameter compared to the same-name auditory-verbal parameter. No changes were made to trial outcomes after the trial commencement.

2.9. Statistical Analysis

The continuous data were considered normally distributed due to the central limit theorem. Descriptive statistics and 95% confidence intervals (CIs) were computed for all the cognitive/behavioral variables. An unpaired -test was used to compare the mean ages of men and women.

The primary outcome: the effects of different sound interventions on cognitive and behavioral variables in each modality (visuospatial or auditory-verbal) were assessed using one- and two-way repeated-measures analyses of (co)variance (ANOVA/ANCOVA). The models were optimized manually. The Mauchly and Levene tests were used to assess the assumptions. In the case of the violation of the sphericity assumption, the Greenhouse-Geisser correction was used. Each significant ANOVA or ANCOVA was followed by a Bonferroni post hoc test.

To model and assess both the visuospatial and auditory-verbal modalities simultaneously (and to compare both modalities with each other and to assess the interaction of the intervention by modalities), a mixed-model linear regression was used. For post hoc pairwise comparisons following significant regression analyses, the Bonferroni test was used.

For comparing the two halves of each of the five sessions, the delta values (calculated as the outcome in the second session minus the outcome in the first session) were compared with the constant value 0, using a one-sample -test.

The ranked data were compared across the 5 interventions using a Friedman test.

The effects of the interventions on discrepancies between the modalities (calculated as verbal minus visuospatial) were assessed using the ANOVAs, ANCOVAs, and Bonferroni post hoc tests. The models were optimized manually.

The secondary outcomes: the effects of short-term training (time) were assessed, in a similar fashion to the above analyses: the delta values in different time-blocks were compared with zero. The effects of time on cognitive and behavioral parameters were assessed using manually optimized one- and two-way ANOVAs and ANCOVAs as well as mixed-model linear regressions, all followed by the Bonferroni post hoc tests. The ranked data were not assessed over time.

A Pearson correlation coefficient was used to assess the correlations between the response times with the hit rates, false alarm rates, indices, and working memories. The level of significance was set at 0.05.

3. Results

3.1. Participant Flow

After sending online invitations on online forums with about 9000 members and screening the subject pool of the National Brain Mapping Lab, 162 healthy individuals were called and invited to participate in the study. Of them, 58 initially agreed to participate, but 27 were excluded: 19 refused to participate later (mostly because of the fear of COVID-19 cross-infection), 4 were excluded due to neurologic and/or psychiatric conditions not reported originally, 3 were left-handed, and 1 was excluded by the physician due to having cold signs and the possibility of COVID-19 infection (Figure 1). Each of the 5 intervention groups included 31 subjects. Of the 31 participants, 14 were males and 17 were females. The mean (SD) age of the participants in each intervention group was years (range: 19-42). The mean ages of males and females were years (range: 21-42) and years (range: 19-38), respectively. Males and females were not significantly different in terms of mean age (-test, ). There were no losses or dropouts after the randomization. The trial ended when reaching the desired sample size. No subject complained of any discomforts. No harms were identified with this study.

3.2. Effects of the Sound Interventions

Descriptive statistics and 95% CIs for the outcomes in different intervention groups are presented in Tables 1 and 2 and Figures 26. The results of the one-sample -test comparing each mean delta value with zero are shown in Tables 1 and 2.

3.2.1. Effects of the Interventions in the Visuospatial Modality

(1)Hit Rate. The hit rate did not change significantly in different experimental groups (). The roles of gender, age, and volume were nonsignificant (). Similarly, the interactions were nonsignificant ().(2)False Alarm Rate. The false alarm rate in different experimental groups did not differ significantly (). Sex, age, and sound volume were not significant variables (). The interactions were nonsignificant as well ().(3)(Sensitivity in Signal Detection Theory). The index did not change significantly across different sound conditions (). Sex, age, and sound volume were insignificant (). Likewise, the interactions were nonsignificant ().(4)(Response Bias in Signal Detection Theory). The index was positive in all experimental groups without any differences across the groups (). The effects of gender, age, and sound intensity were nonsignificant (). Also, the interaction of age and sound intensity with the experiment was insignificant (); however, the gender interaction was significant ().(5)Visuospatial Working Memory. The working memory capacities were not much different among the sound conditions (). The role of sex, age, and sound intensity was nonsignificant (). Also, the interactions were nonsignificant ().(6)Reaction Time. The response speed was faster in the 10 Hz BB group and to a lesser extent in the silence group compared with the other sound conditions (i.e., their response times were shorter than that of the other groups) (). The role of age was significant, and age had a direct and positive effect on the response time (). Also, the interaction of age and intervention was significant (). The Bonferroni test showed no significant pairwise comparison ().(7)Standard Deviation of the Reaction Time. The SD of response time in the 10 Hz BB group was smaller than that in other groups (). The effect of gender was insignificant (). But both age () and sound volume () played a significant role; age was directly/positively related to the reaction time SD, but sound volume was inversely related to the reaction time SD. The interactions of gender () and volume () were nonsignificant, but the interaction of age by intervention was significant (). The Bonferroni test showed no significant pairwise comparison among the interventions (all values = 1.0).(8)Delta Hit Rate. The average visuospatial delta hit rate was negative in all groups, indicating that in the second half of each block, the hit rate decreased compared to the first half. The delta hit rate fluctuated slightly among the different experimental groups in a way the 10 Hz and 40 Hz BB groups had the least drop (). The effects of sex and sound intensity were nonsignificant (both values = 0.5). But the sound intensity interaction () was significant, and the sex interaction was marginally significant (). The Bonferroni test did not show any significant pairwise comparisons (all values ≥ 0.197).(9)Delta False Alarm Rate. All deltas of the average false alarm rates were negative (indicating that the false alarm rate in the second half of each block was lower than that in the first half). No significant differences were observed across the sound conditions (). The effects of gender, age, and sound loudness were nonsignificant (). The interactions were insignificant as well ().(10)Delta. The mean of most groups was negative, except for the 10 Hz BB group whose delta was positive; this indicated that unlike in the other intervention groups, the variable increased in the second half of the 10 Hz intervention block compared to its first half. The difference across the conditions was significant (). Sex and sound intensity were nonsignificant (). The sex interaction was nonsignificant (), but the sound intensity interaction was significant (). The Bonferroni test showed no significant pairwise comparisons (all values ≥ 0.123).(11)Delta Working Memory. The average of most groups was negative, except for the 10 Hz BB group whose delta WM was positive, indicating that working memory increased in the second half of the 10 Hz block compared to its first half. The ∆WM varied significantly among the 5 groups (). The effects of gender and sound intensity were nonsignificant (). The gender interaction was nonsignificant (), but the interaction of sound intensity was significant (). The Bonferroni test showed no significant pairwise comparisons (all values ≥ 0.141).(12)Delta Reaction Time. All mean delta response times were negative, indicating that in the second half of each block, compared to its first half, the response time reduced and the response speed increased. The 5 groups were not different in terms of delta reaction times (). The role of age () and its interaction were nonsignificant (). The impact of sound volume was significant: louder sounds made the delta response time more positive (); however, its interaction was nonsignificant (). The role of sex () and the sex interaction were not significant ().

3.2.2. Effects of the Interventions in the Auditory-Verbal Modality

(1)Hit Rate. The hit rate in the 10 Hz BB group was smaller than that of the other groups (). The impact of age and sound intensity was insignificant (). Similarly, the interactions were nonsignificant (). The Bonferroni test showed no significant pairwise comparison ().(2)False Alarm Rate. The false alarm rate was the highest in the positive control (pure tone) group; the overall difference across the 5 groups was significant (). The effects of sex and sound volume were not significant (). The sex interaction was nonsignificant (). However, sound intensity interaction was significant (). No significant pairwise comparison was detected (all values = 1.0).(3)(Sensitivity in Signal Detection Theory). The index varied very subtly among the 5 groups; still, this small difference was statistically significant (). This variable was about 1% lower in the 10 Hz BB group, compared to the other groups. The role of sex was marginally significant (). The sex interaction was nonsignificant (). The impact of sound volume was insignificant (), but its interaction was significant (). There was no significant pairwise comparison (all values = 1.0).(4)(Bias in Signal Detection Theory). All values were negative in all groups. The difference among the groups was marginally significant (). The effects of sex, age, and sound volume were nonsignificant (), and so were the interactions ().(5)Verbal Working Memory. The average verbal WM in the 10 Hz BB group was slightly lower than that of the other groups, and this difference across the 5 groups was marginally significant (). The role of sex was marginally significant (). The sex interaction was nonsignificant (). The role of sound intensity was nonsignificant (), but its interaction was significant ().(6)Reaction Time. There was no significant difference in the speed or time of response to auditory-verbal stimuli in different experimental groups (). The effects of sex () and age () were nonsignificant, but the role of sound volume was significant: the louder the sound, the shorter the reaction time (or the faster the response) (). The interactions were not significant ().(7)Standard Deviation of the Reaction Time. The response time SD did not differ significantly across the 5 conditions (). The role of sex () and age was nonsignificant (). But the sound volume played a significant role: the louder the sound, the less scattered the response time (). The interactions were nonsignificant ().(8)Delta Hit Rate. The mean delta hit rates in the 2 control groups were close to zero, but in the 3 BB groups, they were negative (indicating a relative decrease in the percentage of correct responses in the second half of each group). The difference among the groups was insignificant (). The effects of sex, age, and sound volume were nonsignificant (). Also, the interactions were nonsignificant ().(9)Delta False Alarm Rate. All delta false alarm rates were positive, indicating an increase in the percentage of incorrect answers in the second half of each block compared to its first half. The difference across the groups was nonsignificant (). The roles of sex, age, sound volume (), and the interactions were all insignificant ().(10)Delta. All mean delta values were negative, indicating that the values decreased in the second half of each block compared to its first half. The groups were not significantly different (). The effects of sex, age, sound intensity (), and the interactions were all nonsignificant ().(11)Delta Working Memory. All mean ∆WM values were negative, indicating that verbal working memory in the second half of each block had decreased compared to its first half. The 5 groups were not significantly different (). Sex, age, sound volume (), and the interactions () were nonsignificant.(12)Delta Reaction Time. All auditory-verbal mean delta reaction times were negative, indicating that the reaction time in the second half of each block was slightly shorter than that in the first half. Delta reaction times were not different under the 5 conditions (). Age, sex, sound loudness (), and the interactions () were insignificant.

3.2.3. Effects of the Interventions in Both Modalities

(1)Hit Rate. There was no significant difference among the hit rates measured in the 10 groups (). There was a significant difference between the modalities (). The roles of gender (), age (), and sound volume () were nonsignificant. The interaction of modality by the intervention was nonsignificant ().(2)False Alarm Rate. Differences in the false alarm rates in different groups were not significant (). There was a significant difference between the modalities (). The effects of sex (), age (), and sound volume () were nonsignificant. The interaction of modality by the intervention was nonsignificant ().(3). The index was not significantly different across the 10 groups (). There was no significant difference between the modalities (). The impacts of sex (), age (), and sound volume () were nonsignificant. The interaction of modality by experiment was nonsignificant as well ().(4). The average indices were not significantly different across the 10 groups (). There was a significant difference between the modalities (). The roles of sex (), age (), and sound volume () were nonsignificant. Besides, the interaction of modality by experiment was insignificant ().(5)Working Memory. The working memory capacities were not significantly different among the 10 groups (). There was no significant difference between the modalities (). The effects of gender (), age (), and sound volume () were nonsignificant. The interaction of modality by experiment was not significant as well ().(6)Reaction Time. There was no significant difference in response speed or response time among different groups (). There was a significant difference between the modalities (). The role of gender () was nonsignificant, but the effects of age (, positive and direct relationship between age and reaction time, ) and sound volume (, an inverse relationship between the sound loudness and reaction time, ) were significant. The interaction of modality by experiment was insignificant ().(7)Standard Deviation of the Reaction Time. The SD of response time in different groups was not significantly different (). The modalities were significantly different (). The role of sex () and age () was insignificant, but the sound volume (, inverse association with reaction time SD, ) had a significant effect. In addition, the interaction of modality by the intervention was not significant ().(8)Delta Hit Rate. No significant effect was observed.(9)Delta False Alarm Rate. The only significant variable observed was the effect of modality () in a way that all means of visuospatial delta values were negative (i.e., fewer false responses in the second half compared to the first half) while all means of auditory delta values were positive (i.e., an increase in the rate of incorrect answers in the second half).(10)Delta. There was no significant variable.(11)Delta Working Memory. No significant variables were detected.(12)Delta Reaction Time. There were no significant parameters.

3.2.4. Effects of the Interventions on Ranked Data

No statistically significant Friedman value was observed when comparing the five intervention groups, in terms of the variables: ranked hit rates, ranked false alarm rates, ranked indices, ranked working memory capacities, and ranked response times either in the visuospatial modality or in the verbal modality (all values > 0.1).

3.2.5. Effects of the Interventions on the Discrepancies between the Two Modalities

(1)Hit Rate. The discrepancy between the hit rates in the modalities was almost zero in the 10 Hz group, while in the other groups, the discrepancies were all positive; they were the largest in the silence and 40 Hz groups. The overall difference among the 5 groups was significant (). The interaction of intervention by sound was significant (), but its interaction by sex was not (). The effects of sound volume and sex were insignificant (). No significant pairwise comparison was detected by the Bonferroni test ().(2)False Alarm Rate. The discrepancies between the 2 modalities were all positive; they were almost similar in most groups except in the silence and 40 Hz groups, which showed a higher discrepancy (especially in the 40 Hz group). The overall difference among the groups was significant (). The interaction with the sound loudness was significant (), but the interaction with sex was not (). The effects of sound volume and sex were insignificant (). No significant pairwise comparisons were detected ().(3). In the 10 Hz group, the visuospatial modality had a greater index than the verbal modality. In the 16 Hz group, the indices were similar in both modalities. In the rest, the verbal was greater than its visuospatial counterpart. The overall difference was significant (). The interaction with sound volume was significant (), but the interaction with sex was not (). The effect of sex was marginally significant (). The effect of sound volume was insignificant (). No significant pairwise comparison was observed ().(4)Working Memory. The discrepancy was negative in the 10 Hz group, was almost zero in the positive control and the 16 Hz groups, and was positive in the silence and 40 Hz groups (). The interaction of sound volume was significant (), but the interaction of sex was not (). The effects of sound volume and sex were insignificant (). No significant pairwise comparison was seen ().(5)Reaction Time. All discrepancies were positive (verbal response times being longer than the visuospatial response times). The maximum discrepancy was observed in the 10 Hz group, while the least discrepancy was observed in the 16 Hz and 40 Hz BB groups, with a significant overall difference (). The interactions with age () and sex () were marginally significant. The effect of age was marginally significant (). The effect of sex was insignificant (). The only significant pairwise comparison was observed between the 10 Hz group and the 40 Hz group ().

3.3. Effects of Short-Term Training

Descriptive statistics and 95% CIs for cognitive-behavioral outcomes in different time-blocks are presented in Tables 3 and 4. and Figures 711. The results of the one-sample -test comparing each mean delta value with zero are also shown in Tables 3 and 4.

3.3.1. Effects of Training/Time in the Visuospatial Modality

(1)Hit Rate. The mean hit rate gradually increased over time until it decreased slightly in the last block; this trend was significant (). The role of sex was nonsignificant (). Furthermore, the interaction of sex by the time variable was nonsignificant (). The Bonferroni test showed only a significant pairwise comparison between the first and fourth time-blocks ().(2)False Alarm Rate. The average false alarm rate gradually decreased until it was almost fixed in the last block, and this trend was significant (). The role of sex was nonsignificant (). Additionally, the interaction of sex by time was nonsignificant (). The Bonferroni test showed no significant pairwise comparison.(3)(Sensitivity in Signal Detection Theory). The average index gradually increased over time until it almost decreased in the last block (). Sex was not significant (). The interaction of sex by time was nonsignificant as well (). The Bonferroni test showed 3 significant pairwise comparisons between the first and fourth blocks (), between the first and fifth blocks (), and between the second and fourth blocks ().(4)(Bias in Signal Detection Theory). The average index first increased over time and then decreased in the last three blocks (). The effects of sex, age, and sound volume were nonsignificant (). The interactions were not significant as well ().(5)Visuospatial Working Memory. The average working memory gradually increased until in the last block, it almost decreased; this trend was significant (). The role of sex () and the interaction of sex by time () were nonsignificant. The Bonferroni test showed 3 significant pairwise comparisons, between the first and fourth time-blocks (), between the first and fifth blocks (P = 0.023), and between the second and fourth blocks ().(6)Reaction Time. The speed of response gradually increased (or the reaction time decreased over time); this trend was significant (). The role of sex was insignificant (). Also, the interaction of sex and time was insignificant (). The Bonferroni test showed 4 significant pairwise comparisons: between the first and fourth time-blocks (), between the first and fifth blocks (), between the second and fourth blocks (), and between the second and fifth blocks ().(7)Standard Deviation of the Reaction Time. The SD of response time in the first two blocks was almost constant and then gradually decreased until in the last block, it increased; this trend was significant (). The effect of sex was insignificant (). But both the variables age () and sound volume () played a significant role. In addition, the interactions were nonsignificant (). The Bonferroni test showed no significant pairwise comparison (all values > 0.6).(8)Delta Hit Rate. The delta hit rate fluctuated slightly and only marginally significantly (). The role of sex () and the sex-by-time interaction were not significant ().(9)Delta False Alarm Rate. It fluctuated significantly over time (). The role of sex was nonsignificant (). The interaction of sex by time was nonsignificant as well (). The Bonferroni test showed a significant pairwise comparison between the second and fourth blocks ().(10)Delta. did not fluctuate significantly over time (). The role of sex was nonsignificant (). Similarly, the interaction of sex and time was insignificant ().(11)Delta Working Memory. Most of the mean ∆WM values were negative, indicating that the working memory capacity in the second half of each block was slightly smaller than that in the first half. ∆WM fluctuated significantly over time (). The role of sex was nonsignificant (). Likewise, the interaction of sex and time was nonsignificant (). The Bonferroni test showed no significant pairwise comparisons ( values ≥ 0.059).(12)Delta Reaction Time. Most of the mean delta response time values were negative, indicating that the reaction time in the second half of each block was slightly shorter than the reaction time in the first half (i.e., the reaction speed increased slightly in the second half). The delta response time did not change significantly over time (). The role of age () and its interaction with time () were not significant. The role of sound volume was significant (). But its interaction was nonsignificant ().

3.3.2. Effects of Training in the Auditory-Verbal Modality

(1)Hit Rate. The average hit rate gradually increased until it almost became constant in the third and fourth blocks and then decreased slightly in the last block; this trend was significant (). The role of sex was insignificant (). Also, the interaction of sex and time was insignificant (). The Bonferroni test showed only a significant pairwise comparison between the first and fourth blocks ().(2)False Alarm Rate. The average false alarm rate decreased gradually over time, but this trend was not significant (). The roles of sex (), age (), and sound volume () as well as the interactions of time by sex (), age (), and sound volume () were insignificant.(3)(Sensitivity in Signal Detection Theory). The average index gradually increased until in the last block, it decreased slightly (). The role of sex was marginally significant (). The interaction of sex and time was not significant (). The Bonferroni test showed a significant pairwise comparison between the first and fourth blocks ().(4)(Bias in Signal Detection Theory). The average did not change significantly over time (). The roles of sex (), age (), and sound volume () and the interactions were nonsignificant ().(5)Verbal Working Memory. The average auditory-verbal WM gradually increased until it reached a plateau in the last block (). The role of sex was marginally significant (). The interaction of sex and time was insignificant (). The Bonferroni test showed 2 significant pairwise comparisons between the first and third blocks () and between the first and fourth blocks (); the comparison between the first and fifth blocks was marginally significant ().(6)Reaction Time. The speed of response increased significantly over time, i.e., the average response time decreased by training (). The roles of sex () and age () were nonsignificant, but the role of sound volume was significant (). The interactions of time by sex () and sound volume () were insignificant, although the interaction of age by time was significant (). The Bonferroni test showed 5 significant pairwise comparisons between the first and third blocks (), the first and fourth (), the first and fifth (), the second and fourth (), and the second and fifth blocks ().(7)Standard Deviation of the Reaction Time. The SD of auditory-verbal response time was at first almost constant and even slightly increasing, but then, it decreased from the third block onwards (). The roles of sex () and age () were insignificant. But the sound volume played a significant role (). The interactions were nonsignificant: age (), volume (), and sex (). No significant pairwise comparison was detected ( values > 0.2).(8)Delta Hit Rate. The mean delta hit rate did not change significantly over time (). Sex, age, sound intensity (), and the interactions were nonsignificant ().(9)Delta False Alarm Rate. No significant changes were observed (). The effects of sex, age, and sound loudness were nonsignificant (). Also, the interactions were nonsignificant ().(10)Delta. The mean did not change significantly over time (). Sex, age, sound intensity (), and the interactions were insignificant ().(11)Delta Working Memory. The mean ∆WM did not change significantly (). The role of sex, age, and sound volume was nonsignificant (). Also, the interactions were nonsignificant ().(12)Delta Reaction Time. The mean delta of auditory-verbal response time did not change significantly over time (). The role of age, sex, and sound loudness (), and also, their interactions by the time variable were nonsignificant ().

3.3.3. Effects of Training in Both Modalities

(1)Hit Rate. The mean hit rate gradually increased until it began to decrease slightly in the last block (). There was a significant difference between the 2 modalities (). The effects of sex (), age (), and sound volume () were nonsignificant. Also, the interaction of modality and time was nonsignificant (). The Bonferroni test showed 2 significant pairwise comparisons, between the first and fourth time-blocks () and between the first and fifth time-blocks ().(2)False Alarm Rate. The average false alarm rate gradually decreased until it became rather fixed in the last block (). There was a significant difference between the 2 modalities (). The roles of sex (), age (), and sound volume () were nonsignificant. Additionally, the interaction of modality and time was nonsignificant (). The Bonferroni test showed two significant pairwise comparisons, between the first and fourth time-blocks () and between the first and fifth blocks ().(3)(Sensitivity in Signal Detection Theory). The average gradually increases over time until it almost decreases in the last block (). There was no significant difference between the modalities (). The effects of sex (), sound volume (), and age () were nonsignificant. Also, the interaction of modality and time was nonsignificant (). The Bonferroni test showed 4 significant pairwise comparisons, between the first and third time-blocks (), between the first and fourth time-blocks (), between the first and fifth blocks (value ), and between the second and fourth blocks ().(4)(Bias in Signal Detection Theory). The average value did not change significantly over time (). There was a significant difference between the modalities (). The roles of sex (), age (), and sound volume () were nonsignificant. Also, the interaction of modality and time was insignificant ().(5)Working Memory. The average working memory gradually increased until it almost decreased in the last session (). There was no significant difference between the modalities (). The role of sex (), age (), and sound volume () was not significant. Similarly, the interaction of modality and time was nonsignificant (). The Bonferroni test showed 4 significant pairwise comparisons, between the first and third time-blocks (), between the first and fourth time-blocks (), between the first and fifth blocks (value ), and between the second and fourth blocks ().(6)Reaction Time. The speed of response constantly increased over time, i.e., the average reaction time decreased (). There was a significant difference between the modalities (). Sex () was nonsignificant, but age (, a positive relationship with reaction time, ) and sound volume (, an inverse association with reaction time, ) were significant predictors. The interaction of modality and time was not significant (). The Bonferroni test showed 5 significant pairwise comparisons, between the first and third sessions (), between the first and fourth sessions (), between the first and fifth sessions (), between the second and fourth sessions (), and between the second and fifth sessions ().(7)Standard Deviation of the Reaction Time. The average SD of response time in the first two blocks was almost constant, and then, it gradually decreased until it increased in the last block; this trend was significant (). There was a significant difference between the 2 modalities (). The effects of sex () and age () were nonsignificant, but the sound loudness (, an inverse relationship, ) was significant. The interaction of modality and time was nonsignificant (). The Bonferroni test showed no significant pairwise comparison ().(8)Delta Hit Rate. No significant variables were observed.(9)Delta False Alarm Rate. All variables were nonsignificant except for the role of modality () and the interaction of modality by time (6).(10)Delta. No variables had a significant role.(11)Delta Working Memory. No significant variables were detected.(12)Delta Reaction Time. There was no significant variable.

3.3.4. Effects of Training on the Discrepancies between the Two Modalities

(1)Hit Rate. All discrepancies were positive. No significant variable was observed in the 3-way repeated-measures ANCOVA.(2)False Alarm Rate. All FA discrepancy means were positive. There was no significant variable.(3). The discrepancy means revolved around zero, without any significant variable.(4)Working Memory. The WM discrepancy averages were close to zero, without any significant variable.(5)Reaction Time. All discrepancies were positive, without any significant difference.

3.4. Correlations between Response Times with Cognitive Functions

The Pearson correlation coefficient showed significant negative correlations between response times and the variables hit rate, , and working memory in many sessions; it also found significant positive correlations between response times and false alarm rates in some sessions (Tables 5 and 6).

4. Discussion

Since many of the variables studied by us were not available in the literature on binaural beat stimulation effects, we were limited to comparing and discussing those aspects with studies from other fields that had similar concerns. The assessment of visuospatial deltas in the present study showed that the mean delta of visuospatial hit rate in all groups was negative indicating that in the second half of each session, the hit rate decreased compared to the first half. This negativity might be due to fatigue or boredom. The extent of this decrease in the 10 Hz group was significantly less than that in the other groups. This means that the alpha band might counterbalance mechanisms underlying fatigue in the second half through a possible range of hypothetical mechanisms such as increasing the concentration or reducing fatigue. Also, the mean deltas of visuospatial as well as visuospatial spatial working memory were negative in most groups, except for the 10 Hz BB group whose delta values were positive, indicating that the variables visuospatial and working memory increased in the second half of the 10 Hz block relative to its first half—and these differences among the groups were significant. Our findings in terms of suitability of alpha-band binaural beats (in the visuospatial modality) were in line with another study comparing 9.55 Hz binaural beats versus control [30]. On other cognitive domains, some studies have also shown favorable results concerning alpha binaural beats: a study showed improvements in Stroop test performance as a result of 10.2 Hz binaural beat stimulation [36]. Another research showed that 8 weeks of entraining the brain using a rhythmic audiovisual stimulator at 10 and 18 Hz would improve the IQ and memory of children with disabilities [37]. McMurray [38] showed that brain stimulation with alpha binaural beats may improve both attention and working memory in healthy elderly who may naturally experience decreased alpha activity. Higher amplitudes of alpha brain waves might be associated with improved working memory, attention, vigilance, information processing speed, perceptual abilities, and inhibitory processes [23, 30, 3944]. Improved visual working memory has been linked to increased alpha rhythms [44]. Perhaps, alpha oscillations may indirectly improve working memory by filtering out irrelevant information and averting disturbances caused by conflicting stimuli [4547]. Nevertheless, not all results are in favor of the alpha stimulation: Beauchene et al. [4, 16] failed to find any effects of 5 minutes of alpha BB stimulation on visuospatial or verbal working memories. More interestingly, Wahbeh et al. [23] asserted that 30 minutes of alpha binaural beat stimulation might deteriorate auditory-verbal learning. Their results were in line with another finding of the present study: in the auditory-verbal modality, we observed that the hit rate and the auditory were slightly but statistically significantly lower in the 10 Hz BB group than the other groups (especially compared to the silence and 40 Hz groups). This pattern was also seen in verbal working memory, even though in only a marginally significant way. Although more research is needed for confident interpretation of our findings, it seems that perhaps the alpha binaural beat stimulation can improve visuospatial working memory at the expense of deteriorating verbal working memory, possibly by shifting the attention and/or allocating cognitive resources to the visuospatial modality. Future research simultaneously performed on both the visuospatial and verbal modalities is needed to verify our results. Our finding in terms of delta values in the visuospatial modality contrasted with the only other study that had used the delta method: Beauchene et al. [4] compared delta visuospatial accuracies among control interventions as well as 5 Hz, 10 Hz, and 15 Hz binaural beats. They found that delta accuracy values were negative in all groups, except in their 15 Hz group, which had a positive delta accuracy [4]. Unlike the present study, in their study, the 10 Hz intervention caused one of the largest negative visuospatial delta accuracies [4]. The difference observed between the results of the two studies needs more research for possible explanations. It might be speculated that the lack of any rest between sessions as well as shorter durations of sessions in their study [4] might change the fatigue states of subjects, compared to our research. Moreover, for calculating delta values, they omitted 2 middle minutes of each session and subtracted the last 1.5 minutes from the first 1.5 minutes (while we subtracted the second half from the first half in order to avoid data loss). In the verbal modality of the present study, the false alarm rate was slightly but statistically significantly higher in the pure tone and 40 Hz groups, while it was the minimum in the silence and 10 Hz groups. Without any similar articles, we cannot compare and discuss this more. It is suggested that some binaural beats can entrain brain waves [2022, 24] and alter the functioning of the reticular formation (responsible for the regulation of arousal, attention, concentration, and vigilance) [23]. Although our study focused on working memory, the and indices were a part of the signal detection theory, which has been associated with attention as well [48, 49]. Attention acts like a filter that oversees information and picks a limited amount of it to allow locking on goal-related stimuli and discarding undesired ones [50]. Enhancing this important gateway to information can improve many other cognitive processes as well [5153]. Working memory is heavily interlaced with attention [53, 54], and binaural beats might improve attention (although studies are controversial and a few): Colzato et al. [55] assessed the effects of 40 Hz binaural beats versus a constant tone (as control) on attention measured by a global-local task. They concluded that binaural beats might not induce suppression of task-irrelevant information but can condense the spotlight of attention [55]. Crespo et al. [56] assessed the effects of listening to 20 minutes of theta and beta binaural beats on attention; they did not observe any changes in the attention or the EEG activity of participants [56]. Kennel et al. [57] investigated whether listening to 9 sessions of 20-minute beta binaural beats during 3 weeks could reduce inattention in children with attention-deficit/hyperactivity disorder. They did not find any significant result [57]. On the other hand, another study found some positive effects of beta binaural beats on attention [20].

Unlike Beauchene et al. [16] who asserted that there was a significant increase in ranked working memory in their 15 Hz group, we could not find any effect of binaural beats on any ranked working memory or response time measures in either modality. The reason for the dispute can be in methodological differences such as durations of stimulations, frequencies used (5, 10, and 15 in their study versus 10, 16, and 40 in ours), and statistical analyses. Beauchene et al. [16] used a between-subject analysis that is always used for the comparison of 2 groups (a Mann–Whitney test) for comparing 6 within-subject (repeated-measures) groups. No other studies had ranked their findings so that we can compare our results with them.

In this study, response times were negatively correlated with the hit rates, indices, and working memories, while they were positively correlated with false alarm rates. This is in line with previous findings [58]. In our study, response times were shorter in the visuospatial modality compared to the auditory-verbal modality even though the verbal modality responses had been done with the dominant hand. This contrasted with the literature which indicated that auditory-verbal responses may be faster than visuospatial responses; it also was contrary to the literature indicating that those responses entered with the dominant hand may be faster [5961]. Possible reasons for our findings might be the much longer duration of the visual stimulus compared to the verbal one, as well as a probable preference of the individuals to pay more attention to the visuospatial modality; also perhaps, the subjects considered the visuospatial modality as the dominant modality (as also indicated by their indices). Also, other methodological specifications such as the setup of the current study and the simultaneous exposure of the subjects to both the visuospatial and auditory-verbal stimuli in this study might play a role. Response times are a function of factors such as sex, although this is controversial with many studies not finding a difference and one finding a difference merely in right-handed individuals [59, 60, 6264]; also, age [60, 62, 63], limb dominance [59, 60], practice [6567], and properties of stimulus such as duration or intensity [61] might predict the response time. Attention as well might affect the speed of responses, especially the complicated ones [61, 68, 69]. Standard deviations of reaction times can be associated with intelligence [69]. We did not observe a link between the reaction time and sex but found aging to have a significant role. Interestingly, despite the lack of improvement in cognitive indices in this study (such as attention indicated by the index and working memory) after the third or four sessions, response times continued to become shorter and shorter by training until the last session, which can imply the effect of practicing on reaction time [67].

In the visuospatial modality, the response time to the visual stimulus showed a significant decrease in the 10 Hz group compared to the other groups (especially compared to the 16 Hz and pure tone groups). There was only one study regarding the effects of binaural beats on response times: Beauchene et al. [16] observed no significant difference between response times measured under 5 Hz, 10 Hz, and 15 Hz binaural beats compared to controls. The difference between their and our findings might stem from different methodologies such as dissimilar durations of sessions, different modalities in question (visuospatial in this study versus verbal in theirs), and statistical analyses; for instance, they used the between-subject 2-group Mann–Whitney test for comparing 6 repeated-measures (within-subject) groups, which was not correct. Furthermore, it should be noted that like in their study (which used the verbal modality), we as well did not observe any significant effect of binaural beats on the response time in the verbal modality. It was observed in the current study that a person’s age had a significant effect on response times (i.e., with increasing age, the speed of reaction decreased). It was also observed that the intrasubject variability of the visuospatial reaction time was slightly smaller in the 10 Hz group than in the other groups. This variability of reaction time increased in older people and also increased with decreasing the sound intervention volume. The effects of age on response times have been documented earlier [60, 62, 63]. The effect of the intervention volume on reaction time variability might imply that these interventions could have played a positive role in decreasing the intrasubject variability, perhaps through masking and reducing potential auditory distractions existing in the lab environment. No other study has assessed this item. Louder sounds also made the visuospatial delta response times more positive, meaning that by hearing louder sounds, the reactions became slower (longer response times) in the second half of each 8-minute session compared to its first half. Perhaps, louder sounds might have some exhausting effects, but no studies have ever assessed this factor, and without further evidence, we cannot confidently interpret the results. In the auditory-verbal modality, increasing the volume of the audio intervention could accelerate the response and reduce the reaction time as well as the intrasubject variability of reaction times. This again might be a result of the intervention sounds masking potential auditory distractors; nevertheless, this is not the only possible hypothetical explanation. For instance, it is shown that increasing the volume of background noise can intensify alpha brainwaves and reduce the power of beta rhythms [70]. Moreover, there might be some generic effect to all the tested interventions (including pure tone); for example, white noise might improve learning [71].

The comparison of modalities with each other showed that both the hit and false alarm rates in the auditory-verbal modality were greater than those in the visuospatial modality. This indicated the tendency of participants to respond to auditory-verbal stimuli more freely and more frequently. Still, the working memory capacities and indices remained similar in both modalities. The mean values of the index in the visuospatial modality were positive, whereas in the auditory-verbal modality, these values were negative. This suggests that the individuals’ biases in the auditory-verbal modality were liberal and to some extent neutral in some groups, implying a degree of tendency to respond to auditory stimuli with the least skepticism and when feeling the slightest sense of familiarity. On the other hand, in the visuospatial modality, the subjects’ biases were conservative, meaning that they did not respond to the visuospatial stimuli unless being rather confident. These biases were not affected by the 5 sound interventions. It was also found that the reaction time in the visuospatial modality was shorter than the reaction time in the auditory-verbal modality. Instead, the intrasubject variability of reaction time was greater in the visuospatial modality than that in the auditory-verbal modality. Aging and decreasing the sound volume could slow down the response, while decreasing the loudness of the sound could increase the variability of the reaction time.

Another interesting point found in the comparison of the modalities was that all the average visuospatial delta FA rates were negative (i.e., there were fewer incorrect answers in the second half compared to the first half), whereas all the mean verbal delta FA rates were positive (more incorrect responses in the second half). This indicated that during an 8-minute session, the efficacy reduces in the auditory-verbal modality while it improves in the visual modality. Such a simultaneous change in both modalities might be interpreted as shifting one’s attention (or other cognitive resources such as error detection) from the auditory-verbal modality to the visuospatial one. The 5 audio interventions did not play a role in these patterns. However, when we calculated the differences between both modalities in terms of cognitive-behavioral parameters, the audio interventions seemed to play a significant role in many intermodality discrepancies: In the case of hit rate, interestingly, two inverse patterns were observed in both modalities, resulting in the maximum intermodality discrepancies in the silence or 40 Hz groups, and the minimum discrepancy in the 10 Hz group (which showed the highest visuospatial hit rate and the lowest verbal hit rate). An almost similar pattern was observed in the false alarm rates, causing the greatest and smallest discrepancies in the 40 Hz and 10 Hz groups, respectively. Working memory and the index almost followed a similar pattern to the hit rate, with the silence and 40 Hz groups having the highest positive discrepancies and the 10 Hz group having a negative discrepancy—indicating a greater visuospatial compared to the verbal , in the 10 Hz group. The average response time was always longer in the verbal modality compared to the visuospatial one. In the verbal modality, it was the slowest (longest) in the 10 Hz group and the fastest (shortest) in the 40 Hz group; this was inverse in the visuospatial modality, being the fastest in the 10 Hz group. As a result, the discrepancy was the maximum in the 10 Hz group, which was significantly greater than that seen in the 40 Hz group. It seems that the assessed audio interventions can have different (and perhaps inverse) effects on the two modalities. The closest study to our design may be that of Hommel et al. [72] who assessed the impact of 40 Hz binaural beats on the cross-talk of two tasks. Originally, lower frequencies of binaural beats used to be associated with mental relaxation while higher frequencies were thought to induce attentional concentration and alertness [73, 74]. Accordingly, high-frequency beats were expected to bias the cognitive control toward focus and persistence, i.e., more attentional resources to be assigned to the task at hand [72]. However, some recent findings contradicted this anticipation: Reedijk et al. [75] compared the effects of the alpha and gamma binaural beat stimulations on subjects’ performance in an attentional blink [76] task, which presents subjects with two visual targets in a stream of stimuli. If the second target is presented briefly after the first, participants usually miss the second one. This has been linked to overcontrol, which is an excessively strong focus on the first target, leaving too few resources for the second one [77]. Reedijk et al. [75] observed that the alpha stimulation did not affect attentional blink, whereas the gamma entrainment decreased the attentional blink, suggesting that, unlike the original expectation, gamma stimulations might broaden the distribution of available focus (instead of inducing a stronger focus). Another study of Reedijk et al. [78] might suggest the same: they reported that the gamma stimulation might improve performance in a divergent (but not in a convergent) thinking task, perhaps because divergent thinking may benefit more from broadly distributed resources compared to convergent thinking [72]. The dual -back task used in our study needs divided attention, and therefore, it might also benefit from a broader distribution of attentional and cognitive resources. Thus, perhaps the discrepancy observed between the response times in the two modalities can be considered a marker of the distribution of cognitive resources, i.e., flexibility. From the indices and response times, it can be speculated that the dominant modality (the one taking more attentional resources) may have been the visuospatial one. From the combination of significant intermodality discrepancies, it might be suggested that the 40 Hz intervention, silence, and to a lesser extent the 16 Hz and pure tone interventions could shift more resources to the verbal modality, increasing cognitive flexibility as seen by the increased hit rates, FA rates, indices, working memories, and faster responses in the verbal modality (and the reverse outcomes in the visuospatial modality). On the other hand, the 10 Hz intervention might shift the attentional resources to the visuospatial modality, increasing cognitive persistence—indicated by the faster responses and increased hit rates, indices, and working memories in the visuospatial modality. Although our findings were in line with the three studies on cognitive flexibility [72, 75, 78], future studies are warranted to assess our speculation.

The findings of this study showed that short-term training could affect a person’s cognitive function, such that some cognitive parameters improved over the 40-minute time of this study (e.g., reaction time); some others usually improved until the third or fourth sessions and then either reached a plateau or slightly decreased (e.g., working memory). There is a capacity limit on the number of chunks concurrently retained in working memory (somewhere between one and four) [38]. Naturally, working memory has been thought of as a permanent feature correlated with fluid/general intelligence [7981] that seems to be highly heritable [82] and resistant to extraneous experiences [83]. Nevertheless, recent evidence suggests otherwise, that working memory can be enhanced by medication or practice [3, 84, 85], although not all recent studies agree with the malleability of working memory [86]. A study using a dual -back task showed that training can improve test results as well as general fluid intelligence [87]. Dual-task performance may be improved with dual-task training and repetition plus tasks like dual -back that may activate the right dorsolateral prefrontal cortex [88, 89]. Our findings showed that short-term training can be useful to some extent, but after some sessions, the tendency to improve reduces; this is perhaps a result of fatigue [90] or simply because some limits might have been reached. Notably, practicing a working memory task might not be necessarily transferable to other tasks [91]. Interestingly, training did not affect discrepancies between the modalities. No studies were available to compare our results with.

4.1. Limitations and Advantages

This study was limited by some factors. Like all previous studies regarding the effects of binaural beats on working memory, no a priori power calculations were done to determine the sample size in this study. Still, the current sample size ( trials in 5 groups of 31 each) was comparable to or larger than most of the few articles in this field and also provided adequate powers to calculate numerous significant results. Furthermore, since the -back task seems to require upholding, continuous updating, and processing of information, it has face validity as a working memory task [92]; however, recent evidence casts doubt on its construct validity as a working memory task, as it might measure attention as well, especially in older subjects [92]. Moreover, interpreting -back findings needs utmost care [93]. This is why we explicitly evaluated -back results and also calculated not accuracy and reaction latency, but instead the hit and false alarm rates as well as and indices besides response times and intrasubject response time variabilities, as recommended earlier [93]. An advantage over previous studies is that we assessed both the visuospatial and auditory-verbal modalities simultaneously. This allowed us to observe that the significant improvement detected in visuospatial working memory might actually occur at the cost of a decline in verbal working memory and that the bigger picture might be indicative of some shifts of attention or allocated resources between the modalities. On the other hand, we controlled for numerous confounding variables such as IQ and even genetics by adopting a within-subject design. The generalizability of our findings is limited to right-handed healthy adults and young adults. Still, it benefited from a rather broad age range and various intervention sound volumes.

5. Conclusions

Within the limitations of this randomized clinical trial on the alteration of working memory and attention measures under the effect of binaural beats, it could be concluded that (1)in the visuospatial modality, the alpha binaural beat stimulation was able to accelerate reactions and reduce response latencies as well as intrasubject variabilities of reaction times. The 10 Hz BB entrainment could also change the pattern of decline in some visuospatial parameters over time (indicated by delta values): this intervention reduced or stopped the extent of decline over time in terms of visuospatial working memory, , and hit rate. Aging might slow down responding to the stimulus and increase the intrasubject variability of reaction time. This reaction time variability increased also with decreasing the sound intervention volume. By listening to louder sounds, reactions might become slower after some time (indicated by visuospatial delta response times becoming more positive under louder sounds).(2)In the auditory-verbal modality, the 10 Hz intervention reduced , hit rate, and false alarm rate (compared to all groups except silence). Working memory was as well reduced by the 10 Hz BB, but only in a marginally significant way. Louder sounds might accelerate responses and reduce intrasubject variabilities of reaction times.(3)The audio interventions could as well affect the discrepancies between the two modalities: the intermodality discrepancies in the hit rates were the lowest in the 10 Hz group and the greatest in the silence and 40 Hz groups. Similarly, the minimum and maximum false alarm rate discrepancies were observed in the 10 Hz and 40 Hz groups, respectively. In the case of working memories and indices, the 10 Hz intervention caused a negative discrepancy (indicating a greater working memory and in the visuospatial domain than the verbal one) while the other interventions caused positive or almost-zero discrepancies, with the silence and 40 Hz interventions causing the highest positive discrepancies. The 10 Hz entrainment caused the greatest intermodality discrepancy of response time while the 40 Hz stimulation caused the smallest response time discrepancy.(4)Each of the parameter working memory or index was rather similar in the visuospatial versus verbal modalities, while both the hit and false alarm rates were greater in the auditory-verbal modality compared to the visuospatial one.(5)Response biases (the indices of the signal detection theory) indicated that in the auditory-verbal modality, the participants mostly had a liberal bias (implying their tendency to respond to auditory stimuli with minimum hesitation) and in some groups somehow a neutral bias in the auditory-verbal modality. Instead, in the visuospatial modality, the subjects had conservative biases, implying that they would not respond to visuospatial stimuli unless being reasonably certain about it.(6)While in the visuospatial modality, mean delta false alarm rates were negative (indicating fewer errors in the second half of each 8-minute block); they were positive in the auditory-verbal modality (indicating more errors in the second half). This might imply a shift of attention and/or cognitive resources to the visuospatial modality over time.(7)Response times were shorter in the visuospatial modality than in the auditory-verbal one. However, the intrasubject variability of reaction times was smaller in the auditory-verbal modality than in the visuospatial one.(8)Faster reactions may accompany better hit rates, working memories, and indices, as well as lower false alarm rates.(9)Regardless of the modality, aging and reduced intervention sound volume may slow down the response (and increase response times). Reduced sound intensities may as well increase the intrasubject variability of response times.(10)Short-term training can improve the hit rate, false alarm rate, working memory, index, and response time.

Data Availability

The data are available from Dr. Vahid Rakhshan upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflict of interest.

Authors’ Contributions

Dr. Vahid Rakhshan conceived the study in 2017 as his thesis proposal presented to the interviewing panel of the Institute for Cognitive Science Studies (ICSS) while applying for the cognitive neuroscience Ph.D. position. Then, he developed the concept further and conceived every bit of the ideas and hypotheses as his Ph.D. thesis and more (e.g., the assessment of both modalities together and their discrepancies, the assessment of short-term training effects, the examination of response times and intrasubject response time variabilities, the role of sound volume, and any other ideas). He also designed the whole study and each and all its methodological items and parameters. He wrote the Matlab code needed for the estimation of the response times. He funded the study, searched for the participants, collected the data, checked/validated/recalculated the computations done by the -back software in the 155 experiments, performed Excel programming, preprocessed the data, and prepared the hundreds of needed data files. He conceived, designed, and implemented all the statistical analyses, manually optimized all the models, interpreted the findings, discussed them, drafted the whole article, prepared the figures and tables, and submitted/revised/proofread the paper. Finally, he wrote the Ph.D. thesis. Prof. Mohammad Taghi Joghataei and Dr. Peyman Hassani-Abharian were, respectively, the first and second primary supervisors of the thesis. Dr. Mohammad Nasehi and Dr. Reza Khosrowabadi were the first and second co-supervisors of the thesis, respectively.

Acknowledgments

The authors express their sincere gratitude to the National Brain Mapping Lab (NBML) for their valuable assistance. The study was self-funded by Dr. Vahid Rakhshan.