Encoding and Recognition Processing of Chinese Characters: A Functional Magnetic Resonance Imaging Study
This study aimed to investigate the conceptual memory processes that underlie encoding and recognition processing of Chinese characters. Healthy participants () performed a semantic-relatedness paradigm using categorically related logogram pairs from four different categories (fruit, animal, tool, and clothing). During intentional encoding, subjects were instructed to make semantic judgments and select category-correlated features to bind and memorize logogram pairs. During recognition, subjects were asked to recognize the memorized items. The MATLAB software and spatial clustering analysis were used for image data processing. Compared with baseline, encoding mainly activated BA13, with significant effects in BA6/8/9/46/45/47, BA24, BA7/39/40, BA37/20, and BA18/19; meanwhile, recognition mainly activated BA6/8/9/10/13/45/46/47, BA31, BA7/40, and BA18/19. Compared with recognition, encoding activated BA18/19/37/20/36 with a peak activation area in BA18. Compared with encoding, recognition significantly activated BA7, BA31/32, and BA10. In conclusion, distributed networks of discrete cortical regions with distinct roles are active during semantic processing of logograms. The ventral occipitotemporal and inferior frontal regions display increased levels of encoding-related activity. The dorsal medial brain regions, including the superior frontal gyrus and occipitoparietal regions, are associated with recognition-related activity.
Conceptual memory is involved in the general knowledge of concepts, objects, facts, people, and word meaning. Recent neuroimaging studies in conceptual knowledge begin to reveal that specific functions are performed within small areas of the brain, with these small areas involved in a hub or network that also includes areas of the brain responsible for perceiving and acting . Conceptual knowledge is usually shared by individuals from the same culture, with some variations according to each individual experience.
Alphabetic languages and logographic Chinese are markedly different because they represent the meaning of words in very different ways. Indeed, alphabetic words are read by assembled phonology, while Chinese characters are read as a concept by their visual shapes, without involvement of phonemes . A previous study revealed that phonological processing of Chinese characters occurs in Brodmann area (BA) 9, BA46, and BA40, with minor activation of BA45/47, while phonological processing in native English speakers is made in BA44/45 and BA22 .
However, the organization of this knowledge within the brain is controversial. Indeed, there are two theories about this: the distributed-only and distributed-plus-hub views, respectively . Studies revealed that recognizing specific knowledge activates specific areas: colors and forms activate the left ventral temporal cortex, whereas recognition of size information activates the parietal cortex [4–6]. Tasks such as animal naming are uniquely associated with the left medial occipital cortex, whereas tool naming specifically involves the left premotor and posterior middle temporal cortices [7, 8].
Recent studies have provided strong evidence that the human conceptual system is not organized by category specificity [9, 10]. It has been postulated that the organization of semantic representations has more than one governing principle. It is important to note that the activity associated with each object-specific category involves a relatively large expanse of brain areas, suggesting that representations of different categories are distributed and overlapping . It is possible that feature-based models provide the flexibility needed to represent an infinite variety of object categories [11, 12].
Few studies have examined the brain areas involved in the processes of encoding and recognizing conceptual information regarding Chinese characters. Therefore, the aim of the present study was to use functional magnetic resonance imaging (fMRI) to study the brain areas involved in encoding and recognizing conceptual information concerning Chinese characters.
2. Materials and Methods
Fourteen native Mandarin Chinese speakers, including seven males and seven females, were recruited to participate in the semantic category memory test. All subjects were right-handed as determined by the handedness inventory . Participants were all undergraduate students from the Southern Medical University, Guangzhou, China. Participants ranged in age from 20 to 23 years and had normal or corrected-to-normal vision. This study was approved by the Medical Ethics Committee of Nanfang Hospital. All participants provided written informed consent before enrolment, in accordance with the Declaration of Helsinki. The subjects underwent a Clinical Dementia Rating (CDR) test. Inclusion criteria were (1) no memory disturbances, (2) normal physical status, and (3) CDR = 0. Subjects who had a history of neurological or psychiatric illness condition or who were taking any medications immediately prior to or during the scans were excluded from the study.
2.2. Chinese Characters
All test logograms (visually presented in pairs to each subject) were selected from the Modern Chinese Dictionary . All logograms were used commonly and had a frequency of occurrence of no less than 30 per million, according to the Modern Dictionary of Frequently Used Chinese . In each logogram pair, the first one was regarded as the stimulating logogram, while the second was considered the responding logogram. Four groups of semantic categories (fruits, animals, tools, and cloths) were selected for this study, using four stimulating and four responding logograms from the same category, alongside eight other logograms from similar categories. By combinations and repetitions, a total of sixty-four logograms were selected for each category memory task.
The experiment was performed using a 1.5 Tesla Magneto scanner (Siemens, Sonata, Germany) equipped with a fast gradient system for echo-planar imaging (EPI). A standard radio frequency head coil was used. Head motion was restricted, with ear plugs used to reduce scanner noise. Visual stimuli were presented to subjects by projecting the video display of computer onto a translucent screen. Subjects viewed the stimuli through a mirror attached to the head coil. The presentation time of stimuli was accurately controlled by the DMDX display software (http://www.u.arizona.edu/~kforster/dmdx/dmdx.htm). Before the tests, subjects were visually familiarized with the procedures and test conditions to minimize anxiety and to enhance task performance. EPI was performed using a gradient-echo with scan parameters of repetition time (TR)/echo time (TE)/flip angle = 2,500 ms/25 ms/90 degrees. The acquisition of the matrix was 64 × 64 in each plane, with a field-of-view (FOV) of 210 mm × 210 mm. Thirty contiguous axial slices (slice thickness = 4 mm) parallel to the AC-PC line were acquired to cover the whole brain. Anatomical MRI was acquired using a T1-weighted 3D gradient-echo pulse-sequence, which provided high resolution (1 × 1 × 1 mm3) images of the entire brain.
2.4. Presentation of Logogram Pairs
All logograms were presented to the subjects in Chinese using the same font and size. A block design was used. The test in each subject contained four cycles composed of four blocks: 25-second (s) block of a memory-encoding condition, 25 s block of a recognition condition, 25 s block of a baseline1 condition, and 25 s block of baseline2 condition. Each block consisted of an image reminding the patients of the testing process displayed for 1 s, followed by eight logogram pairs displayed for 3 s each (Figure 1). Each cycle covered one category: fruits, animals, tools, or cloths.
During each memory-encoding condition block, subjects were asked to identify and memorize four logogram pairs that belonged to the same category (e.g., fruit group test: grape-cherry), and these were mixed with other four similar logogram pairs (e.g., tomato-cucumber). During intentional encoding, subjects were instructed to make semantic judgments and select category-correlated features to bind and memorize logogram pairs; that is, subjects were instructed to judge the same group of logogram pairs from the presented eight pairs using their own conceptual knowledge (e.g., size, color, and form) to find the stimulating logogram and responding logogram and memorize them. Each logogram pair was presented randomly for 3 s.
In each recognition condition block, all pairs of logograms shared the same category (e.g., fruit group). There were four logogram pairs presented that were exactly the same as during the encoding condition; the other four logogram pairs were recombined from the same group (e.g., grape-cherry and grape-pear). Examples of stimuli for the fruit group test are presented in Figure 1. The logogram pairs appeared in a random order. Subjects were required to recognize four pairs of logograms just memorized during the encoding condition. Subjects indicated their logogram pair selection by pressing a key button with the index of the right (dominant) hand.
In each baseline1 condition block, a fixation logogram pair (e.g., apple-apple) was adopted in order to create a condition in which no memorization or recognition was needed. The fixation logogram pair was replaced every 3 s, and subjects were instructed to maintain fixation on the logogram pair.
In each baseline2 condition block, four fixation logogram pairs (e.g., apple-apple) were presented together with four other logogram pairs, with the responding logogram different from the stimulating one (e.g., apple-cherry). The button had to be pressed when the fixation pair appeared.
2.5. Data Analysis
The MATLAB software (MathWorks, Inc., Natick, MA, USA) and SPM8 (http://www.fil.ion.ucl.ac.uk/spm) were used for image data processing. Image preprocessing was performed using the Data Processing Assistant for Resting-State fMRI (DPARSF) V1.0 software (http://restfmri.net/forum/DPARSF). DPARSF is a plug-in software package based on SPM (http://www.fil.ion.ucl.ac.uk/spm) and RS-fMRI Data Analysis Toolkit (REST, by Song et al., http://www.restfmri.net) . Functional data preprocessing included slice timing correction, motion correction, spatial normalization, and smoothing (FWHM = 4 mm). Data with head motions of more than 2.0 mm maximum displacement in any of the , , or directions or 2.5 degrees in any angular motion were discarded.
Preprocessing of fMRI data was performed with a time-series of images acquired from the same subject using the least squares approach and six-parameter (rigid body) transformation to remove movement artifacts. The images were spatially smoothed with an 8 mm Gaussian kernel to decrease spatial noise. Skull stripping of the 3D MRI T1-weighted images was carried out with the Alice software (Perceptive Systems, Inc., Boulder, CO, USA). A generalized linear model approach was used in each participant according to image groups (encoding, recognition, baseline1, and baseline2). Activation maps were calculated by comparing images acquired during the task with control conditions (i.e., the encoding condition was compared with baseline1 and the recognition condition with baseline2) using Student’s -tests. One-sample -tests were performed on activation maps using SPM8 (http://www.fil.ion.ucl.ac.uk/spm). Multiple comparison corrections for the data were performed using the Monte Carlo simulation (AlphaSim by B. Douglas Ward, http://afni.nimh.nih.gov/pub/dist/doc/manual/AlphaSim.pdf). Combination threshold of voxel was , and a cluster size >389 corresponded to a corrected .
The activation maps of all 14 subjects were then overlaid on the corresponding T1 images using random effect analysis .
MNI coordinates were converted to Talairach coordinates using nonlinear transformation. Coordinates shown in Talairach space for the center-of-mass and volume (mm3) of each activation cluster were determined based on the averaged activation maps. Anatomical labels and BAs were identified according to the Talairach Daemon database. The regions that showed significant differences were extracted as regions of interest (ROIs).
The detailed brain regions of significant activation for encoding versus baseline1 are listed in Table 1 and presented in Figure 2. When comparing encoding with baseline1, large clusters of activation were located in right prefrontal cortex with a peak activation area in insular cortex (BA13) extending to the midinferior frontal region (coordinates: 36, 24, 7, -value = 9.20, and ). Significant activations included three clusters in different regions: (1) right prefrontal cortices (BA6/8/9/46/45/47) including sublobar/claustrum and anterior cingulate gyrus (BA24), their activation extending to the prefrontal area; (2) right parietal lobe (BA7/39/40); and (3) ventral occipitotemporal cortex including fusiform gyrus (BA37/20) and inferior occipital cortex (BA18/19).
As shown in Table 1 and Figure 3, when comparing recognition versus baseline2, the activation pattern in bilateral cortices included frontal cortex (BA6/8/9/10/13/45/46/47), posterior cingulate gyrus (BA31), parietal lobe (BA7/40), and cuneus (BA18/19). However, there were limited activities in left cortices. Significant activations included two clusters in different regions: (1) bilateral superior medial frontal and (2) occipitoparietal regions. Strong activation was located in the right superior frontal gyrus with a peak activation area in the premotor cortex (BA6, coordinates: 6, 9, 58, -value = 8.54, and ).
The detailed brain regions of significant activation for encoding versus recognition are presented in Table 2 and Figure 4. Compared with recognition, encoding activated large clusters in bilateral ventral occipitotemporal cortex (BA18/19/37/20/36) with a peak activation area in left inferior occipital gyrus (BA18) (coordinates: −30, −96, −3, -value = 9.22, and ). Significant activations included two clusters in different regions: (1) ventral occipitotemporal cortex and (2) inferior and medial frontal cortices (BA47/11/25).
As shown in Table 2 and Figure 5, compared with encoding, recognition significantly activated three clusters in different regions: (1) bilateral precuneus (BA7), (2) cingulate gyrus (BA31/32), and (3) superior medial frontal gyrus (BA10). Strong activation was located in the left superior parietal gyrus with a peak activation area in precuneus (BA7, coordinates: −3, −73, 45, -value = 8.50, and ).
The present fMRI study aimed to explore neural substrates of integrative semantic categories in a more holistic manner. In contrast to previous studies, we used category-related logogram pairs to contribute to deep semantic strategies for encoding and recognition tasks. To ensure that brain activation was specifically attributed to semantic processing and not to categorical processing, semantics and control conditions were matched for orthographic or phonological (in Chinese) processing demands, and all four categories were pooled together. Logogram pairs from four different categories (fruits, animals, tools, and cloths) were included in a single experiment and analyzed as an entity to examine cerebral activity at a relatively integrative level rather than at a specific level.
The tasks employed in the present study activated ventral occipitotemporal, inferior frontal, superior frontal gyrus, and occipitoparietal regions, but not the posterior left inferior frontal gyrus (LIFG) (BA44/45), which is commonly activated in semantic memory tasks. The role of the LIFG remains unclear. Some studies claimed that the anterior LIFG (BA47) plays an important role in semantic processing , whereas the posterior LIFG (BA44/45) is specialized for phonological processing . The data presented above suggest that the anterior region is associated with semantic processing, irrespective of phonological demands. The LIFG has been previously shown to be involved in generating semantic associations [20, 21], particularly while making decisions concerning semantic associations [22–24]. Another explanation of this modulatory response is that it reflects increased demand for selection between categorical associations [25–27]. In the semantic tasks used in the present study, the subjects were required to select the related features of the logogram pairs to generate categorical associations. Therefore, our findings support the idea that the LIFG is involved in selecting among competing semantic features stored in the cortex .
The dorsolateral prefrontal cortex (DLPFC) is roughly equivalent to BA9 and BA46 and plays an important role in processing mnemonic information, for example, working memory. Some fMRI data have shown that the DLPFC activity is higher during semantic relationship-encoding processing compared with item-specific encoding [29–31], indicating that the DLPFC preferentially modulates semantic relational processing. In this study, subjects were required to build semantic relations between items using category-correlated features. Thus, it may be inferred that left BA9/46 activation may contribute to modulating semantic relational processing, as might be suggested by previous studies [32, 33]. Nevertheless, these results are similar to a previous study showing that BA9/46 is involved in the recognition of Japanese logograms .
As shown above, the total extent of brain activation, particularly in lateral-ventral regions of the occipitotemporal cortex, was significantly larger for the encoding process compared with the recognition one. These results indicate that the neurocognitive mechanisms of semantic processing underlying encoding may differ from those behind recognition . Goodale and Milner  proposed a division of labor in the visual pathways of primate cerebral cortex between a ventral stream that contributes to perception of the visual world and a dorsal stream specialized for the visual control of action. In the current study, lateral and ventral occipitotemporal regions displayed increased levels of encoding-related activity when the subjects were required to encode logogram pairs. However, we found little evidence for ATL involvement in semantic processing; activation was only observed in the ventral-lateral regions of posterior temporal cortex. This is consistent with a report showing that normal individuals display strong activation of ventral-lateral regions of posterior temporal cortex during a categorical association task, whereas patients with ATL atrophy failed to show activity in this region .
Our data suggest that the encoding task relies on feature-general processes to identify target word pairs for memorization and feature-specific processes to encode the pairing, while the retrieval task only relies on feature-specific processes for successfully retrieving the response words. The direct contrast between these two tasks might reflect the feature-general processes of semantic representation. However, further study using Chinese characters is necessary to complement these results. A previous report proposed that adults more effectively engage right hemisphere brain regions involved in the visual-spatial analysis of Chinese characters , indicating that Chinese character processing is age related. In addition, semantic and visual errors were shown to be associated with young age and low education level in children . Therefore, in this study, we selected subjects with similar ages and education levels, with the examination process clearly explained to all subjects, to minimize the effects of these confounding factors. Further studies will focus on populations of various ages as well as different language comprehension and education levels.
The present study suffers from some limitations. It was not exhaustive in semantic scope since all four categories were pooled to only keep the effect of encoding and recognizing logograms. It would be interesting to test subjects with the same method on different tasks, such as categorically related picture pairs. It would also be useful to examine the anatomic and functional connectivity of the ROIs to identify the directional and effective connections between ROIs, thus determining how ROIs interact with each other within a broader network. Such connectivity analysis can be explored to assess whether anatomic connectivity affects functional connectivity. Finally, it would be interesting to use the same alphabetical materials in English to test native English speakers and further investigate whether language surface features affect semantic processing representations. In addition, an increase in sample size would further validate and support the findings of this study.
Overall, our results suggested that distributed networks of discrete cortical regions with distinct roles are active during semantic processing of logograms. The ventral occipitotemporal and inferior frontal regions display increased levels of encoding-related activity. The dorsal medial brain regions including superior frontal gyrus and occipitoparietal regions are associated with recognition-related activity.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
This study was funded by the National Natural Science Foundation of China (Grant nos. 81271524 and 81371514).
S. X. Lv and S. S. Deng, Xiandai Hanyu Cidian, The Commerical Press, Beijing, China, 2005.
Y. Liu, Xiandai Hanyu Changyongci Cipin Cidian, China Astronautic Publishing House, Beijing, China, 1990.
K. B. McDermott, S. E. Petersen, J. M. Watson, and J. G. Ojemann, “A procedure for identifying regions preferentially activated by attention to semantic and phonological relations using functional magnetic resonance imaging,” Neuropsychologia, vol. 41, no. 3, pp. 293–303, 2003.View at: Publisher Site | Google Scholar
X. Li, J. Wu, and Q. Guo, “Differential activity of semantic and font size judgment processing of Japanese word: an fMRI study,” in Proceedings of the 6th International Conference on Complex Medical Engineering (CME '12), pp. 677–682, IEEE, Kobe, Japan, July 2012.View at: Publisher Site | Google Scholar