Abstract

The pattern, process, and mechanism of residential heterogeneity vary significantly with different geographical scales. However, most traditional methods ignore the checkboard and modifiable areal unit problem (MAUP), which may cover up the complexity and hierarchy of social space. Taking Hangzhou city as an example, a multiscalar method was proposed based on the information entropy theory to estimate residential heterogeneity and its scale sensitivity. Based on the sixth population census of Hangzhou and the housing price database of 6,536 residential districts from 2008 to 2018, we explore the scale effect and dynamic characteristics of residential heterogeneity. The results of spatial simulation and geostatistical analysis based on Python Spatial Analysis Library (PySAL) module show that the multiscalar algorithm better presents the real segregation pattern than traditional method, which is one of the new models and technologies in urban geography complex system. Exploring residential heterogeneity through multiscalar lens provides an important basis for the gradual and refined urban renewal.

1. Introduction

In the context of neoliberalism and capitalism, the complexity of urban residential geography becomes more pronounced as its diversity increases. The sociospatial differentiation dominantly represented by residential heterogeneity is one of the effective ways to identify periphery and vulnerable area. While physical geography and remote sensing have always placed the concept and rescaling of scale at the forefront of their research, including cartographic scale, geographic scale, operational scale, and measurement scale [1], human geography, on the other hand, paid little attention to the scale problem and has different theories and methodologies [2], with three main sociological implications [3]. Firstly, scale refers to the spatial categories or geographic scope we use to understand and study our world, including the global, nation, local, households, and everyday life [4]. Secondly, scale is used to represent the level of social practices and spatial structure [5]. In addition, based on the theory of scale politics, scale is also a component of space production and the crystallization of social relations [6]. From the classical factorial ecology of Chicago School to the radical Marxism and positivism in the 1970s [7, 8] and to the recent paradigms of spatial simulation and geostatistics, human geographer has systematically studied the heterogeneity of social area for nearly a century and called for a more complex and multidimensional methodologies [9, 10]. The quantitative study on the heterogeneity of residential space can also be used to evaluate the spatial efficiency of urban policy initiatives, thus providing a basis for analysis and public policy intervention [11]. Indeed, the spatial lens through which we measure segregation has a profound effect on more just and socially equitable public policies, community development, and responsible governance [12].

Residential heterogeneity reflects the geographic unevenness of social resource in urban space, presenting a mosaic landscape full of diversity and polarization overlapping [13]. Scholars have struggled to adjust indicators of social segregation to measure residential segregation in the context of more complex, multiracial, and marginalized realities [14]. Since Duncan came up with the earliest Dissimilarity Index (D) in 1955 [15], more than 20 types of indices have been developed. Earlier indices, such as the classic Gini Index (G) [16], Atkinson Index (A) [17], and Information entropy Index (H) [18], were essentially global nonspatial indices that did not take into account spatial proximity and geographical scale, which may be sensitive to the variations in the number and population of tracts [19]. In the 1980s, the comparison of different indices stimulated a wave of reflection on the methodology of dissimilarity index. The second-generation differentiation index, represented by Exposure Index (xPy), improved the index by considering the spatial distribution of different groups based on the distance decay function [20]. At the end of the 20th century, rapid urban expansion and suburbanization that resulted in the global residential heterogeneity in macroscale seems to be less significant [21]. If we still use a single-scalar index, this may produce segregation paradox [22, 23]. That is, although the segregation index seems to be decreasing, the real segregation and contradiction within the space is still increasing. The third-generation index, represented by Spatial Dissimilarity Index (SDI), is modified by adding spatial weight matrix [24, 25]. Generally, the research of residential heterogeneity has changed from single-scalar paradigm to macro–micro research or macro refinement method. Multiscalar lens provides a framework to integrate the macrostability and microdiversity of urban complex systems [26].

The multiscalar phenomenon of residential differentiation was discovered as early as the mid-20th century [27, 28]. That is, residential heterogeneity is higher on a smaller scale and lower on a larger scale, also known as the modifiable areal unit problem (MAUP) [29]. However, due to technical and data constraints, this issue did not attract the attention of the economic geography at that time. The multiscalar analysis techniques for spatial heterogeneity have only recently been implemented and become the cutting edge of research [30]. In order to overcome the checkboard problem and modifiable areal unit problem (MAUP), Wong proposed a boundary modified version dissimilarity index based on the ratio of perimeter to area [31]. Reardon argued that scale sensitivity could be reduced by analyzing the spatial segregation profiles and the macro–micro segregation ratio [32]. O’sullivan and Wong attempt to simulate spatial segregation and its scale effects using a kernel density estimation method [33]. Fowler further demonstrates the key assumptions for multiscalar segregation measures [34]. Manley uses a multiscalar method to modeling the macro-, meso-, and microscale residential patterns in Auckland, New Zealand, from 2001 to 2013 [35]. Johnston and other scholars have creatively proposed a dissimilarity model which considers both macrostability and microdiversity [36]. Hennerdal and Nielsen developed a multiscalar method based on k-nearest neighbors to measure the exposure for different ethnic groups [37]. Similarly, Catney used a spatial weighting method to test the scale effect in England and Wales [12]. Recent work on modeling residential heterogeneity tends to shift from traditional environmental or economic determinism in framing multidimensional lenses and problem-oriented paradigm.

Previous studies mainly measure the residential heterogeneity by aggregating predefined census tracts, which is difficult to overcome the data randomness and MAUP. With the advent of fine-grained and massive data, it is possible to quantitatively study the multiscalar phenomenon between and within different spatial units. In view of these identifiable gaps and the opportunities provided by multiscalar measures, three core themes are explored. Primarily, this paper proposes a novel methodology for evaluating algorithms that calculate social influence in residential heterogeneity. Secondly, this paper systematically studies the relationship between different dimensions of residential differentiation and provides data support and theoretical basis for differential urban renewal measures and progressive refined spatial restoration. Thirdly, it also contributed to the interpretation of changing segregation patterns in response to the complex interaction of a variety of structures and developments on different spatial levels. The paper is structured as follows. The next section presents the social reproduction of geographic scale. The third section presents study area, data processing, and the methodological aspects of multiscalar segregation. The fourth section presents the results of the estimated empirical model for residential heterogeneity. The fifth section presents the conclusion and provides a discussion.

2. The Reproduction of Geographic Scales

Geographic scales are produced rather than given, which is defined as the reproduction of labor and the journey of work [38]. The reproduction of geographic scales explores how residents themselves shape the constitution of these scales in complex ways, including a dual structure of regional scope and relational hierarchy [39]. Different resources and social structures provide different investment returns for different investments. At the same time, competition between different regions to attract capital exacerbates geographical imbalances. With the restructuring of labour-capital relations and subsequent reassembly, the unevenness of resources allocation and development in urban social morphologies, infrastructures, and everyday life are becoming more fragmented and diversified [40]. The theory of social reproduction of geographical scale is derived from the Marxist theory of space economy, the political economy. Since Smith had introduced the concept of politics of scale, Radical Marxist scholars have emphasized how economic and political processes shape the emergence of scales [41]. For Brenner, geographic scales can be understood through the hierarchization and rehierarchization among vertically differentiated spatial units [6]. Studies have shown that a decline of dissimilarity index in one scale or single dimension does not necessarily imply an increase of social integration or mobility. As Smith and Dennis argue, US society in the 1970s was likely to shift from regional to more localized unevenness, and multidimensional approach helps to depict a more complex view of complex reality [42].

Capital accumulation and circulation are the main driving force of urban development, and what kind of leverage to achieve the “scale jump” is a difficult task. Scale reconstruction leads to the change of the weight of local spatial units in the global landscape, and the corresponding spatial power and governance results are also reshaped [43]. Smith’s idea of nested scales emphasizes the interdependence of these processes [38]. Harvey further emphasized that labor surplus and capital surplus can be recycled through scale reconstruction and reproduction to achieve spatial fix [44]. The government-oriented scalar reconstruction tries to reduce the regional disparity by investing in technology and social expenditures. On the other hand, the capital-oriented scalar reproduction improves the efficiency of capital flow through consumption fund and fixed capital. Based on the spatial fix theory, this paper proposes an analytical framework for the reproduction of geographic scales (see Figure 1). The theory of scale politics makes the study of residential heterogeneity turn from flat to structural scale and from unitary to pluralistic scale. However, previous studies are mostly based on national or census scale. This paper exploring residential heterogeneity and its dynamic mechanism through multiscalar perspective provides an important basis for the gradual and refined urban renewal.

3. Materials and Methods

3.1. Study Area

Hangzhou is an important central city of the Yangtze River Delta and a transportation hub in southeastern China and a pioneer in the transformation of the urban service industry and information technology. From 1990 to 2018, the urban built-up area increased from 69 to 615 km2, and the urban scale expanded nearly 10 times. The land expansion of urban built-up area mainly comes from the continuous increase of agricultural land in urban fringe area during the process of administrative territorial entity adjustment. Urban expansion has resulted in a large number of land-lost farmers and migrants living in informal settlements, which makes Hangzhou a typical case to study residential heterogeneity. The study mainly covers the whole urban area of Hangzhou, including the eight districts of Shangcheng, Xiacheng, Xihu, Gongshu, Jianggan, Binjiang, Xiaoshan, and Yuhang, not including Fuyang and Lin'an districts, which were newly established administrative district from cities in these years (see Figure 2).

The data are mainly extracted from the sixth national census, and based on the Chinese house price platform (http://www.creprice.cn), the whole sample housing transaction data of 6,536 residential districts in Hangzhou from 2008 to 2018 are obtained. Census data include the sociodemographic characteristics of all individuals, such as sex, age, race, and education, which are comprehensive but time-lagged. Housing price reflects the unbalanced allocation of residential resources and housing demands, which to some extent overcome the shortage of the time lag. Therefore, we adopt the method of combining population census with housing data and take Hangzhou City as an example to set up a research framework of spatial differentiation on three scales of population census, block, and grid; the heterogeneity of residential space in different population and scale was studied by using spatial information entropy and scale-sensitive profiles.

3.2. The Dimension and Methodology

The dimension and methodology of segregation adopted in this study follows that of Reardon and O’Sullivan [45]. For a long time, scholars have been debating about the different dimensions [14] and measurements [46] in which residential heterogeneity can be conceptualized. Massey and Denton firstly proposed five dimensions of segregation, including unevenness, exposure, concentration, centralization, and clustering. While Reardon proposes that there is overlap between these classic five dimensions and further generalizes them into two dimensions, including evenness and exposure. Here, we use the information entropy index () and spatial information entropy index (), combining with exposure isolation index () and spatial exposure isolation index (), to assess the residential heterogeneity in both nonspatial and spatial versions.

The information entropy index () is considered to be the most effective index to measure the geographical evenness, which refers to the extent to which groups are similarly distributed in residential space. A disorganized complexity system can be defined as an entropy function [47]. Then, we constructed the information entropy index () of residential heterogeneity based on Theil’s information entropy index:where is the global entropy, is the entropy of spatial unit , is the population of group at spatial unit , and is the proportion of group at spatial unit . And isolation () assesses how much a minority group is only exposed to the same group, in other words how much they only interact with the members of the group they belong to. This paper uses Bell’s isolation index () to measure exposure to residential heterogeneity in exposure dimension:

Unless spatial subarea boundaries correspond to meaningful social boundaries, all measures of segregation that rely on population counts aggregated within subareas are sensitive to the definitions of the boundaries of these spatial subareas. Based on local segregation index () developed by Feitosa, we adopted a kernel-based spatial weighted index to improve the aspatial approach:where is the kernel estimator which estimates the influence of each areal unit on the locality . The local population intensity of locality () and the local population of group () are computed using a kernel estimator placed on the centroid of areal unit and the weights are given by the choice of a distance decay function and parameter, and we also computed the local proportion of group in locality (). Based on the spatial proximity and the local environment, we compute the spatially weighted information entropy () at each unit :

Likewise, based on the method of Morgan, we developed an integral version distance decay exposure () as follows:where is the area of unit and is a pairwise distance between area and and is estimated as . could be seen as the probability of contact of members of group to each other weighted by the inverse of distance. The default is and . For the distance measure, we first extract the centroid of each unit and calculate the Euclidean distance.

3.3. Data Processing

In this study, the census-level data is pregiven in the 6th population census, and we preprocess the block-level and grid-level data as follows. Firstly, the study area is divided into 1155 block units surrounded based on the main roads and administrative boundary by using topology and feature to polygon tools in ArcGIS. Comparing remote sensing images, 887 block units were remained by eliminating uninhabited blocks. Secondly, the study area is divided into 3617 grid units by 1000 m Fishnet, and the nonnull grid units were remained by overlaying the whole grid unit as before. Finally, we estimate the population of different housing prices based on housing prices and home size and divide them into five groups through natural breaking points. The population of different housing prices is extracted to different level units by using the extract values to polygon tools in ArcGIS.

The spatial indices have theoretical advantages over the conventional ones but have been rarely used, primarily due to the computational difficulties. With the development of geostatistics models and techniques in the past decade, a number of methods try to improve the implementation of segregation model through GIS and R since 2014 [48, 49]. The advantages of open-source software of urban complex systems in the academic realm have become obvious. Python Spatial Analysis Library (PySAL) is a comprehensive Python Library of spatial analysis, including spatial data analysis, spatial econometrics, geographic processing, and spatial data visualization function, which is the most effective way to study residential heterogeneity. Based on the open-source PySAL module developed by Cortes et al. [50], this paper explores residential heterogeneous through multiscalar lens based on spatial weight matrix and street network distance.

4. Results and Discussion

4.1. Residential Heterogeneity of Different Groups and Dimensions

The method of social area analysis based on census data has been a classic paradigm for the study of residential heterogeneity in the past century. Compared with the globalism and neoliberalism in Western countries, the structure and characteristics of residential heterogeneity in China have gradually changed from a planned economy to a market economy, which has experienced more intense and complex spatial reconstruction and social transformation. Among different groups and backgrounds, migrant, female, education, and occupation have the main factors to study the differentiation of traditional residential areas [51, 52]. While China’s household registration system during the transition period is one of the unique social characteristic variables of residential heterogeneity in China’s metropolitan, which to a large extent results in the separation of welfare treatment and social opportunity for the relevant groups, which greatly affects the segregation of welfare and opportunity of relevant groups. Therefore, besides classical social area analysis, we also add the household registration variable, called Hukou, to analyze the spatial pattern of residential heterogeneity in Hangzhou. According to relevant studies [53, 54], we analyzed the 9 groups in the study area, including migrant, female, hukou, agriculture and husbandry, professional and technical, business and services, highly educated (college degree or above), elderly (over 60), and unemployment based on census data.

From the dimension of evenness, nonspatial segregation degree of different population groups was the highest (0.21603), followed by agriculture and animal husbandry group (0.18064), higher education group (0.13846), and floating population (0.10730). The segregation degree of the female group is the lowest (0.00219), the professional and technical personnel (0.02431), the commercial and service personnel (0.02711), the unemployed group (0.01901), and the information entropy index of the elderly population (0.05324) is also less than 0.1. The results showed that the spatial distribution of different sex groups in Hangzhou was relatively uniform. In general, the nonspatial information entropy index based on census data is not significant on the whole, and the spatial distribution of census population is the most uneven; single-scale analysis will be drawn in 2010. Hangzhou social space has been more homogeneous spatial paradox (see Table 1 and Figure 3).

From the dimension of exposure, except the contact degree of agriculture and animal husbandry groups is less than the uniformity, the isolation of other groups in the contact dimension is higher than the uniformity. Among these groups, professional and technical workers were most exposed (0.71963), indicating a 71.963 percent probability in the life of the population engaged in professional and technical work to have contact with persons within their own group and a high degree of isolation from other groups; perhaps this has to do with the fact that most of the people concerned living mainly near their place of employment. Secondly, the isolation index (0.67199) of household population is higher, which shows that household population has very significant residential isolation in both evenness and exposure dimensions; it is one of the most important factors affecting the residential heterogeneity in Hangzhou. The floating population (0.52312), the female population (0.48744), the highly educated population (0.36930), and the commercial service population (0.27094) are close to each other in the exposure dimension, and there is a certain degree of residential segregation. Finally, the unemployment group (0.03876), the agriculture-related group (0.12330), and the elderly group (0.14636) in the exposure dimension of the isolation are smaller, and the probability of contact with other groups is larger (see Table 1 and Figure 3).

In order to test the statistical significance of results, we generated a prefitted segregation class and used the distribution of these measures under the null hypothesis where segregation does not hold. As shown in Figure 4, this class was plotted to inspect the generated distribution with the estimated value from the sample (vertical red line). It is possible to see that clearly the result of P_value is almost 0.00, which suggests that these groups in Hangzhou city is statistically segregated at 5% of significance level ( value <5%) in terms of and .

4.2. Residential Heterogeneity in Different Periods and Scales

Residential heterogeneity in different geographical scales presents different spatial patterns and mechanisms, and it is difficult to explain the social spatial heterogeneity pattern of the whole city by analyzing only in single scalar or time perspective. In recent years, with the sharp rise in housing prices and the increasing importance of housing in the wealth of urban residents, residential differentiation has intensified social differentiation through such spatial differences as educational opportunities, commuting costs, and social distance [55]. Families of different classes choose different residents and its neighborhood according to their economic conditions and preferences. While low-cost housing is often accompanied by a lack of educational resources, employment opportunities, and public facilities, which aggravate the isolation from the surrounding social environment. Housing price is considered as the marketization projection of the unbalanced distribution and different demands, which makes up for the time delay and static shortcomings of the traditional studies on residential heterogeneity based on a single scale and time perspective. Therefore, based on the full-sample housing price transaction database of 6,536 residential districts in Hangzhou from 2008 to 2018, this paper further explores the scale effect and dynamic characteristics of residential heterogeneity. Firstly, we use the kernel density analysis to simulate the spatial distribution of housing price in different periods in Hangzhou. The results show that the social spatial distribution of Hangzhou is obviously uneven and fragmented, the spatial distribution pattern has changed from the West Lake-centered to Qiantang River-centered, and the residential heterogeneity in different periods shows different spatial distribution characteristics (see Figure 5).

Different from the study of housing price volatility, the study of residential heterogeneity pays more attention to the spatial justice and social mechanism behind housing price. Therefore, we estimate the number of different housing price groups by housing price and housing size and divide them into 5 groups through Jenks Nature breaks method. From the perspective of time, the dynamic characteristics of residential heterogeneity in Hangzhou show an inverted U-shaped trend, which can be roughly divided into four stages (see Table 2). In the first phase (2008–2011), the average house price in Hangzhou rose rapidly from 13,000 yuan to 20,000 yuan, while the government launched the longest-term and most stringent housing purchase restriction policy. In the second phase (2012–2014), the average housing price growth in Hangzhou slowed down and even dropped to 18,000 yuan, and the effect of the policy on the housing market began to appear. In the third phase (2015–2017), the market depression led the government to start to lift the purchase restriction and loan policy to stimulate the market, and the average housing price in Hangzhou rose to 26,000 yuan. In the fourth phase (since 2018), the average house price in Hangzhou rose to 32,000 yuan, and government departments issued a series of regulation and control policies again.

In order to explore the scale effect of residential heterogeneity, we constructed the spatial information entropy profile from 0 to 5000 meters, which is consistent with the basic rule proposed by Reardon that the larger the scale, the smaller the degree of segregation. The heterogeneity based on housing prices is more significant than that based on censuses data, and the problem of evenness is more serious than that of exposure. At the census level, evenness (0.395) and exposure (0.348) are the smallest, and the slope with scale decay was the smallest. It shows that the residential heterogeneity of different housing price groups seems not that significant at the macroscale, which is difficult to reflect the real situation. At the block level, the evenness (0.549) and exposure (0.490) are moderate, and the slope with scale decay is also moderate. It shows that the residential heterogeneity of different housing price groups is more significant at the mesoscale, which took neighbourhood effect into account and better reflected the real situation. At the grid level, the evenness (0.601) and exposure (0.540) are higher and closer to the results at the block scale. It shows that using grid as the basic unit may magnify the residential heterogeneity excessively, which may bring uncertainty in the comparative study of different cities (see Figure 6). The slope of the distance attenuation is most obvious within the 2000 m bandwidth, which is similar to the scale of a neighborhood.

With the simultaneous advancement of outward expansion and interior potential-tapping urbanization, the differentiation of residential space has gradually changed from privilege-led to territory-based. The adoption of multidimensional and multilevel housing policies or regional programs has become an effective way to reshape the space. Based on the conclusions above, we propose the following suggestions: from the dimension of evenness, the interaction between market regulation and government supervision is conducive to the healthy and sustainable development of residential space. Under the background of fully stimulating the market vitality and leverage of the real estate industry, a diversified housing security system should be set up to avoid the further widening of the class differentiation caused by income difference. From the dimension of exposure, providing equal public services and facilities for the relatively vulnerable groups helps to increase the communication opportunities of different groups and promote the mixed-dwelling model of different social strata.

In addition, there are several limitations associated with our study. Firstly, this study is conducted based on the assumption that the housing consumption level represents the mainly social group classification, which is difficult to study the heterogeneity within the same community due to the lack of personal behavior data. Secondly, the multiscalar effects of residential heterogeneity are the comprehensive result of macro strategies and local responses, and it is difficult to consider and measure all relevant factors of residential heterogeneity. Thirdly, the data processing process of real estate speculation cannot effectively remove false information, which may have a slight impact on the results of data processing. These limitations are the direction of follow-up research.

5. Conclusions

Theoretically, residential heterogeneity is a complex multilevel phenomenon, which is closely related to population migration, economic transition, and historical and contemporary crises. On the one hand, it is the spatial representation of differentiation social stratum in social economy and status, and on the other hand, the result of residential heterogeneity will promote and strengthen the reproduction of social strata at the subcultural level. The multiscalar method improves the traditional approach in the following aspects: (1) In the aspect of methodological, while most of the existing research uses the nonspatial model, the multiscalar method overcomes the chalkboard and modifiable areal unit problem (MAUP) and can express the complexity and hierarchy of the social spatial structure. (2) In the aspect of dimension, while most of the existing research only discussed the dimension of evenness, this paper added exposure dimension to explain the residential heterogeneity more comprehensively. (3) In the aspect of time, while most of the existing research uses the census data, the multiscalar method overcomes the problem of time lag and described the dynamic characteristic of residential heterogeneity through the housing price group. (4) In the aspect of exactitude, traditional studies do not test the significance of the results, and the results in many empirical studies seem to be gradually decreasing and not obvious when the actual segregation is still increasing, which is called the paradox of segregation.

Empirically, this study estimated residential heterogeneity and its dynamic characteristic in both nonspatial and spatial versions and reached the following conclusions: (1) The residential heterogeneity based on housing price groups is more significant than that based on census data, which makes up for the time lag and static of census data to some extent. (2) Among different groups, the Hukou factor is tended to be the most important factor that affects the residential heterogeneity of Hangzhou in decay. The social welfare tied up with the household registration system may further lead to the solidification of the social class. (3) Comparing the three basic statistic units of census, block, and grid, the rapid expansion and the suburbanization make the global segregation at macrolevel seem less significant, while the local segregation at microlevel is still serious. And the block units enclosed by the main roads better present the neighborhood characteristics of the residential heterogeneity, which is considered to be the finest scale to analyze the residential heterogeneity. (4) From different periods throughout 2008 to 2018, although the government’s regulation policy does not change the momentum of housing price rise in the short term, it has a long-term positive effect on alleviating residential heterogeneity in Hangzhou. Overall, the modified multiscalar approach is one of the new models and techniques for urban geographical system, which can better explain the real differentiation pattern than the traditional method.

Data Availability

Data for this work are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the Social Science Foundation of Zhejiang Province (20NDQN306YB); the National Natural Science Foundation of China (41771184, 41901196); the Key Foundation of National Social Sciences of China (17ASH003); and the Key Foundation of Education Department Hunan Province (18A001).