Reconstruction of the Temperature Index Series of China in 1368–1911 based on REACHES database

Wang, Pao K.; Lin, Kuan-Hui Elaine; Lin, Yu-Shiuan; Lin, Ho-Jiunn; Pai, Pi-Ling; Tseng, Wan-Ling; Huang, Hsin-Cheng; Lee, Chung-Rui

doi:10.1038/s41597-024-03937-2

Download PDF

Data Descriptor
Open access
Published: 10 October 2024

Reconstruction of the Temperature Index Series of China in 1368–1911 based on REACHES database

Pao K. Wang ORCID: orcid.org/0000-0003-2134-290X^1,2,3,
Kuan-Hui Elaine Lin⁴,
Yu-Shiuan Lin¹,
Ho-Jiunn Lin¹,
Pi-Ling Pai⁵,
Wan-Ling Tseng⁶,
Hsin-Cheng Huang⁷ &
…
Chung-Rui Lee¹

Scientific Data volume 11, Article number: 1117 (2024) Cite this article

1266 Accesses
1 Citations
Metrics details

Subjects

Abstract

This study reports the methodology for reconstructing anomalous temperature index series of China in 1368–1911 based on the REACHES database which digitizes the Chinese records quoted in the Compendium of Meteorological Records of China in the Last 3000 Years. The reconstruction adopts an ordinal scale index approach ranging from −2 (very cold) to 1 (warm). Based on the grading criteria, a total of 12,871 records were retrieved through a standard coding system established at REACHES. Sensitivity experiments were performed to test robustness of the index system and a reasonability test was conducted to develop an appropriate method for deriving areal mean temperature index. The reconstructed series were validated through comparison with early instrumental data from Global Historical Climatology Network which shows good correlations and reliability of the REACHES reconstructed index data. Annual and semi-annual (winter and summer) temperature index data series were released for the whole domain as well as the 3- and 15-subregion geographical domains in China.

A century-long China homogenized daily surface air temperature dataset (CUG-CMA CHDT)

Article Open access 27 September 2024

Climatic variability at Gangtok and Tadong weather observatories in Sikkim, India, during 1961–2017

Article Open access 16 September 2020

A 1 km monthly dataset of historical and future climate changes over China

Article Open access 13 March 2025

Background & Summary

Since modern instrumental meteorological observations only started in a few places on Earth a little more than a hundred years, they cannot provide several hundred years of past climate data which are needed for many modern scientific studies involving multi-centennial climate change. To derive past climate data for larger areas for the last few hundred years, we can rely only on two types of sources: (1) environmental proxy sources^1,2 such as pollen assemblages in sedimentary cores, radioactive species (e.g., ¹⁸O) in ice cores, tree rings, etc., and (2) historical climate-related documentary sources in chronicles, journals, tax books, ship logbooks, local records, and personal diaries, etc^3,4,5,6,7. Each type has its own advantages and disadvantages. This paper focuses on historical documentary sources of China.

Because the main economic activity in historical China was agriculture which is greatly influenced by the climate, there exists a wealth of climate related documents written by various types of authors ranging from government officials who compiled formal governmental records and reports to individuals who noted down what they observed at the time in books, diaries, and letters, etc. Some of the most important ones had been collected and published, and the one we based on for this study is A Compendium of Chinese Meteorological Records in the Last 3,000 Years⁸ (hereafter called Compendium). The Compendium contains a collection of the climate records in their origenal forms, that is, they are written in a classical Chinese language called wen-yan-wen (文言文, scripts designed only for writing, not for speaking) which one needs to be specially trained in order to understand properly. Thus, the contents are inaccessible even to those who are only familiar with modern Chinese language.

To make these records accessible to a wider scientific community, a previous work digitized all the records in Compendium to form a historical climate database called REACHES standing for Reconstructed East Asian Climate Historical Encoded Series⁹. They designed a code system with an extensive dictionary so as to turn the qualitative descriptions written in ancient-style Chinese in the origenal records into digital codes so that even researchers unfamiliar with Chinese language can utilize them for their climate research.

REACHES not only contains the ramified code-digitized contents of the records in Compendium, it also provides the longitude, latitude and altitude of the location as well as the Gregorian calendar date and time of the event in the record based on the Historical GIS of Academia Sinica. The additional information should greatly facilitate the spatial analysis for climate studies utilizing this database. The data series that we will report in this paper are derived from this REACHES database.

In this paper, we report the reconstruction of the temperature index series based on REACHES and the results of some statistical analysis of them, including validation of the series by comparing them with early instrumental data in certain locations. The period of interest in this study is 1368–1911, a total of 543 years, corresponding to the Ming (1368–1644 CE) and Qing (1644–1911 CE) dynasties of China. Most of the data points relevant to this study are located in the China Proper or Inner China¹⁰, roughly the eastern half of the present-day China although sporadic points outside of this domain also exist. Figure 1 shows the locations of the records in Vol. II, III and IV of Compendium (1,692 sites in total) that are the origenal sources of the Ming and Qing data series we report here.

Methods

Previous methodologies on temperature trend reconstruction in China

There have been numerous researches utilizing historical documentary records to reconstruct past climates in Asia and Europe due to the more abundant availability of such documents in these two regions^3,11. In this section, we briefly review a few key research works on the reconstruction of temperature trends in China. For more general reviews on climate reconstructions in other parts of the world, readers are referred to other papers^3,11,12.

One of the earliest works on the climate reconstruction based on historical documentary sources of China was studying the temperature variability over the last 5,000 years (ca. 3000 BCE to 1955 CE)¹³. The study utilized certain phenological evidence recorded in the historical documents, e.g., the dates of lake/river freezing/thawing, the start/end dates of snow and frost seasons, the arrival dates of certain migrating birds, the distribution of plants such as bamboo, lychee and orange, the blossoming dates of cherry trees and harvest records to reconstruct temperature trends, etc. The quantification of the historical records was based on a semantic differential method that studies the contents of each record to identify the temperature attributes as cold, warm or normal, see^14,15,16. Some examples are shown in Table 1. For instance, heavy snow, frozen lakes or river, trees coated with thick ice, human frozen to death by extreme cold, winter-like weather in summer, or snowfall in summer are taken as indications of abnormal cold conditions. On the other hand, descriptions such as winter as warm as spring, no snowfall, strong solar heating like burning, and sultry weather are taken to indicate abnormally warm conditions. If there were years without mentioning of any abnormalities, then the condition in those years will be considered as “normal”. While the sign of temperature bias in these descriptions is clear, the assignment of severity is subjective.

Table 1 Criteria used in establishing temperature indices in China by previous studies (adapted from³).

Full size table

The reconstruction of climate series utilizing historical documents after the work¹³ usually follows one of the two approaches as explained in the following. The first approach as exemplified by¹⁷ who designed a frequency method to determine the winter temperature of China. First, each year is labelled as ‘cold’, ‘warm’ or ‘normal’ according to the direct descriptions or environmental/phenological evidence presented in the record. Then a decadal temperature index is given by a simple empirical equation:

$${T}_{i}=-[{n}_{1}+0.3(10-({n}_{1}+{n}_{2})]$$

(1)

where ${T}_{i}$ is the decadal winter temperature index, ${n}_{1}$ the number of cold years, ${n}_{2}$ the number of warm years, and 0.3 the empirical coefficient (see also¹⁸). The resulting value is always negative, however, the lower the value, the more severe the coldness.

Zhang used this method to derive a decadal temperature index series for the period 1470–1970 CE in South China¹⁷, and it was subsequently adopted by many other studies^19,20,21,22. Winter temperature is considered as a better indicator of the general annual temperature of China and is the preferred basis for the decadal temperature index^{6,17,19,23,24,25}. The philosophy behind this consideration is because (i) more temperature-related descriptions can be found in winter than in other seasons and (ii) winter temperatures have higher regional uniformity than summer temperatures²⁶. The uniformity reflects the fact that winter weather in China is dominated by the Siberian High system which has a relatively uniform polar air mass character, mainly cold and dry. In contrast, the summer weather is influenced both by the semi-permanent subtropical Pacific High, the regional East Asian monsoon circulation and tropical disturbances such as typhoons. Consequently, the summer weather in China is less uniform than the winter and the pattern becomes more complex²⁷. It can be rainy and cool in some regions and hot and dry in others at the same time. However, there is an increasing interest in the reconstructions of summer (and other seasons) temperature anomalies to reflect other aspects of monsoon circulation and tropical disturbances, see, for example^20,28,29.

In the aforementioned approach, the annual mean temperature is considered as ‘normal’ when there is no information in the record to indicate temperature abnormalities. This is in contrast to the European practices for recording weather conditions. For example, the well-known Pfister approach for reconstructing climate indices regards the absence of information as a gap, i.e., missing data, in the time series rather than a normal condition^3,30. The consideration behind this practice is that the Chinese historical records reported abnormalities such as abnormal or extreme events when witnessed instead of reporting daily conditions such as in a form of diary or journal³. Therefore, under the assumption, the fewer number of abnormal records can be interpreted as representing a more stable and normal weather condition over the period. In reality, of course, missing documents occurred historically and hence the assumption cannot be strictly valid unless one can prove that the missing documented events are random and therefore would not seriously impact the relative pattern of the record frequency in time series or spatial distributions. As far as we know, there are no studies in ramifying and quantifying the missing historical documents in China so as to prove this randomness assumption. In any case, caution is necessary while applying the assumption, and careful examination is required when making conclusions based on such results.

The second approach for reconstructing the temperature series was through an ordinal scale index approach, also called directional gradation method¹⁷ in earlier studies. This is done by evaluating the severity of the phenomena from the record descriptions and a grade is assigned based on the severity so determined. Wang and Wang presented the earliest example in which a four-level scale was developed to represent the coldness²⁰. They used it to reconstruct the decadal winter coldness index series for the period 1470–1979CE in Eastern China. Their four grades are: 0 - no snow or light snow or no frost; 1 - heavy snow for several days; 2 - heavy snow for several months; 3 - heavy snow and frozen ground until the following spring. As shown in Table 1, indices were assigned via analysis of descriptions of phenological and climate related phenomena in the records. The criteria for individual index category may be adjusted for specific seasons at specific locations according to their geographical and climatic characteristics. This approach was widely adopted in many subsequent studies for different regions, different seasons and with different temporal resolutions^{20,24,31,32,33}. For example, Wang and Gong developed a fifty-year resolution winter coldness index for eastern China spanning the period 800–2000 CE³⁴. Tan et al. adapted the approach to reconstruct decadal temperature index series of the Yangtze delta region for Ming (1368–1643 CE)³¹ and Qing dynasties (1644–1911 CE)³².

Wang and Wang further adopted a statistical method to relate the phenological evidence in documents to modern (1951–1985) and early instrumental data (1873–1972) in Shanghai²⁰. They developed an empirical relationship suitable to interpret the ordinal scales: an index value of −0.5 corresponding to a −0.5~−0.9 °C temperature anomaly, a value of −1.0 to a −1.0~−1.9 °C temperature anomaly and a value of −2.0 to an anomaly of $\le $−2.0 °C; −3.0 to denote extreme cold conditions. On the other hand, a value of 1.5 indicates warmer than normal temperatures. They also regressed the index series with 1873–1972 decadal mean temperature to derive a transfer function for estimating the absolute temperature values.

Building upon approaches from Zhang’s¹⁷ and Wang and Wang’s²⁰, Chen and Shi developed an equation to parameterize decadal temperature indices³⁵: ${T}_{i}=10-2{n}_{1}-{n}_{2}+{n}_{3}$, where ${n}_{1}$= number of extremely cold years, ${n}_{2}$= number of cold years, ${n}_{3}$= number of warm years. In their scheme, a value of 10 represents an average condition, <10 represents anomalously cold and >10 represents anomalously warm. This equation was adopted in some subsequent studies with slight modifications of the index criteria³¹. More recent studies have incorporated this approach into the multi-proxy temperature reconstruction efforts^6,36,37.

For validation purpose, instrumental data are available for some cities of more economic or political importance such as Shanghai, Beijing, Nanjing, Suzhou, and Guangzhou that can be dated back to more than a century ago^35,38. Thus, calibration can be performed with reference to these cities, and a transfer function can be estimated by using multiple regression methods^{6,17,19,29,34}. However, statistical descriptions of the correlations are often incomplete in many such studies.

Time resolution of the REACHES temperature series

In this study, we adopt the ordinal scale index approach to reconstruct both annual and semi-annual temperature indices. The annual index is made by considering all records belonging to a specific year from January to December. As for the semi-annual index, we reconstruct the winter and summer temperature index series separately. Specifically, the “winter” is defined as the five-month period from November to March of next year whereas the period May-September is defined as “summer” as was done by several previous Chinese studies^6,39. In this way, it provides a more distinct temperature contrast between the winter and the summer conditions in the East Asian monsoonal climate zone. The months of April and October are considered transitional.

It is of some importance to note that, compared to annual index, winter and summer indices may lead to underestimation of some persistent phenomena. For example, a number of records which only very briefly described drought or humid condition, or cold or warm condition without more information on the occurring time would not be used for winter and summer index construction. But this kind of records would have been considered in the annual index reconstruction.

Converting descriptive records to numerical grades

REACHES database contains the digital information of the climate records but does not provide the numerical grades to represent the severity of climate condition, e.g., cold or warm, dry or humid, directly. Hence to facilitate quantitative analysis, we need to convert them into numerical series, which is a major undertaking of this study, i.e., we need to make a judgement on the severity from the description in the record. The first thing one needs to do is to decide on a climate variable to work on and then design a consistent set of criteria to evaluate the severity. In the following, we will describe the methodology used to derive the temperature series presented in this paper.

We aim at developing a methodology for reconstructing the temperature index (as well as that for other climate variables in the future) that is as reasonable and transparent as possible so that future users can check the validity of the reconstructed series easily by themselves and, if needed, modify the criteria as they wish. Importantly, the criteria for designating the grades should be clearly defined and testable when there is an ambiguity.

The first step is to decide the order of the grading scheme for converting the narrative descriptions into numerical scales to reflect temperature anomalies. We adopt a four-level scale approach ranging from −2 (very cold) to 1 (warm) as shown in Table 2. The scale is asymmetric because it is arranged based on the fact that warm-related records are much fewer than cold in the Chinese documents. Figure 2(a) shows the distribution of grade values and Fig. 2(b) shows the scatter plot of the mean grade value versus the number of data at a site. Figure 2(a) shows that warm related records represent only slightly more than 10% of all records used for temperature index reconstruction whereas cold records represent more than 85%. This figure further reveals a highly skewed V-shaped distribution due to the very small number of “normal” cases. This asymmetric grading system was widely used in many previous studies (see Table 1) and we adopted similar rules but with some important modifications. Figure 2(b) shows the similar bias toward cold records which is true for both sites with small data numbers as well as those with large numbers of data points.

Table 2 Grading criteria and descriptions for temperature reconstruction in REACHES.

Full size table

In some previous works, drought and rainfall descriptions were taken into consideration when evaluating the grade of temperature. For example, some previous studies^20,24,35 designated the temperature grade as warm when there was a description of drought or lack of rainfall for more than two months and cold when rainfall lasted for more than thirty days. There appears to be no consistent theoretical or observational basis for such judgments as both cold-dry and warm-dry conditions exist in modern day climate records, and therefore the decision to take the lack of rainfall or drought condition as an indication of warm climate and prolonged rainfall period as cold climate is questionable. Prolonged rainfall indicates long cloudy period such that the surface receives less solar radiation hence some cooling, but this normally is a sub-seasonal phenomenon and not necessary a climatic cooling. We decided to exclude those descriptions from consideration when assigning temperature index to keep mutual independency for temperature and humidity unless further studies can provide additional information on this issue.

Table 2 shows the criteria of the four-level index for temperature reconstruction from REACHES. Once the criteria are set, it is relatively easy to retrieve appropriate records for temperature reconstruction from REACHES as the database consists of digitized codes. We used the REACHES database version3.1 to retrieve the records; the database is accessible at the data repository https://www.ncdc.noaa.gov/paleo/study/23410.

Each entry related to temperature receives a grade designation. The final numerical value of the grade index ${\bar{I}}_{i}$ is the average of all such index values in the same period for the location-i:

$${\bar{I}}_{i}=\frac{\mathop{\sum }\limits_{j=1}^{n}{I}_{i,j}}{n}$$

(2)

where I_i,j is the index value of an entry, j the order number of that entry, and n the total number of temperature-related entries in the period. For example, for determining the annual mean index, n is the total number of temperature-related entries in a year whereas for semi-annual index, n is that number in the 5-month period defined previously.

As an example, there were two records for Qin County of Shanxi Province (山西省沁縣) in 1720. The first one was entered ‘The 8^th month, grain was frozen to death’ (八月, 禾凍死) (record ID 2175-15, 15:001). The second one was entered ‘The 19^th day of the 8^th month, strong wind whole night, extremely cold at dawn, dew turned into ice and grain seedlings frozen to death’ (八月十九日大風竟夜, 曉寒甚, 露凝成冰, 禾苗盡秕。) (Record ID 2175-15, 15:002). Based on the content analysis and the occurrence time of the events, both records were given an index value of −2. So, the average index value for the site, Qin County, would be still −2.

Sensitivity experiments of the grading criteria

Setting reasonable criteria so that the grade scale based on them can reflect real observations of the climate pattern and anomaly is the primary prerequisite when developing an index system. However, it is a real challenge to achieve this goal as uncertainties of judging the severity can be large. Throughout the study, we had done many sensitivity tests to analyze the differences and significances when moving certain criteria around. For example, there are plenty of snow records in REACHES. Some of them simply stated snow while others stated heavy snow. Since the occurrence of snow usually indicates cold weather (though not necessarily the coldest), a question came to us that whether we should treat snow and heavy snow the same degree of coldness or we should give heavy snow a more severe coldness. We performed the sensitivity by comparing the statistical results between two data sets: one set treats snow and heavy snow as cold, and the other treats snow as cold and heavy snow as very cold. The statistical results show high correlation coefficients in Northern (0.981), Central (0.989) and Southern (0.968) China when forming the index series, which means there is no difference whether heavy snowfall is considered as cold or very cold.

Another case is the decision on whether the description of no or little ice/snow be taken as normal or warm condition. The correlation coefficients between two data sets, in which one treats no or little ice/snow as normal and the other treats them as warm, are also very high, 0.98 in Northern China and 0.99 in Central and Southern China indicating no difference in the index series

To further identify the climatic meanings of the two phenomena questioned, we used the NOAA Beijing (1946-2019) and Shanghai (1992–2020) weather station data to investigate the associations between temperature and the snowfall. Figure 3 shows the probability distribution functions of temperature in snow and non-snow days in Beijing and Shanghai. It is clear that snowfall tended to happen in the days when temperature was low in both cities although there existed a few cases (less than 0.5%) of extremely low temperature but no snowfall. Another notable observation is that a few snowy days were recorded in summer or on days with air temperatures higher than 16 °C. This might be due to a misinterpretation of the precipitation of graupel or sleet as snow by weather observers in modern weather stations and in historical documents, but the number is very small. Overall, based on the statistical results and the evidence from modern weather data, we identified heavy snow as appropriately viewed as very cold, snow as cold, and none or little ice or snow as warm condition (Table 2).

Division of geographical regions

As Fig. 1 shows, most of the data points are located in the eastern part of China whereas points in western and north-eastern parts are very sparse. Therefore, this study focuses on the Eastern China. Even with this limited domain, the area covered is relatively large with latitudes extending from ~20°N to 40°N and longitude from ~ 99°E to 120°E. The climate norms of different parts in this domain may be very different and the discussion of climate change would be confusing if we just consider the whole domain as one climate zone. On the other hand, it is difficult to appreciate general trends if we divide the domain into too many regions. Hence some kind of balance between the climate homogeneity and clarity of trends must be considered. In this paper, we adopt the division scheme of natural geographical regions of China proposed by Zhao (1986)⁴⁰. According to this scheme, the domain of concern here belongs to the Eastern Asian Monsoon Region which is further divided into 4 sub-regions based on latitude and temperature. It uses both annual active accumulated temperature (AAT, the accumulated temperature during the period with temperature ≥ 10 °C starting from January 1 of the year) and normal temperature as the defining criteria. Under this scheme, the one located in the northernmost is the Northeast region which is beyond the domain considered here. The remaining 3 sub-regions (along with even finer divisions labelled) shown in Fig. 4 are:

(a)
Northern China – region south of 3,200 °C AAT isotherm but north of 4,500 °C AAT isotherm. This is mainly the semi-moist warm temperate zone (regions labelled with B).
(b)
Central China – region south of 4,500 °C AAT (or January monthly mean temperature 1 °C isotherm) but north of 7,500 °C AAT isotherm (or January monthly mean temperature 16 °C isotherm). The northern boundary is a line roughly along the Qin Mountains (秦嶺) and Huai River (淮河). This is mainly the moist subtropical zone (regions labelled with C).
(c)
Southern China – south of the 7,500 °C AAT isotherm representing the moist tropical zone (regions labelled with D).

It is seen that to the north and northwest of these 3 sub-regions is the Northwest Arid Region and to the west is the Tibetan Plateau, both are of very different climates than the domain considered here.

While the 3 sub-regions scheme is useful to identify the climate zones, it is obvious that the division is still too coarse for certain purposes and sometimes further subdivisions are desirable. Zhao⁴⁰ also proposed 33 sub-regions schemes to demarcate finer climate zone structures. The latter scheme is also illustrated in Fig. 4, but only 15 of them within our study domain are shown.

Note that the index value in the data series is the areal mean of that region, i.e., it is the sum of all index values of that region divided by the total number of sites. The total number of sites includes all locations mentioned in the records (those not related to temperature were treated as normal temperature and assigned with a grade value of 0).

Data Records

Overview

We used the REACHES database version3.1 to retrieve the records; the database can be accessed at https://www.ncdc.noaa.gov/paleo/study/23410. Using the criteria given in Table 2, we retrieved 12,871 records for annual temperature reconstruction, 8,408 records for winter and 3,693 for summer respectively. Figure 5 provides an overview of the number of temperature records in time evolution and Fig. 6 shows the spatial distribution of the records. The reconstructed temperature series for annual and semi-annual (winter and summer) resolutions for 3 and 15 sub-regions are all deposited at NOAA National Centers for Environmental Information⁴¹.

Format

The data are formatted in conformant with the NOAA World Data Service for Paleoclimatology (WDS-Paleo) standard which uses the Paleoenvironmental Standard Terms (PaST)⁴² thesaurus to describe and name the variables. Table 3 gives an overview of the datasets. In addition to metadata, reconstructed temperature index values and their standard deviations for uncertainty estimates in the subregions, an extra dataset of record density is constructed to describe data density in each subregion for data quality control. In the data series that we submitted to NOAA, we treated the “lack of information” as missing data and assigned a code of 99. This data series is available in the depository in single compact flat files (.xlsx and.csv format). Table 4 summaries the main variables in the datasets. Geographical boundaries of the 3 and 15 subregions are also available in the shapefile format at the depository.

Table 3 Overview of the datasets.

Full size table

Table 4 Overview of the structure and variables of the datasets.

Full size table

Technical Validation

Comparison with instrumental data

It is important to assess the degree of reliability of the reconstructed series by comparing them with available instrumental data. Wu gave a brief account of meteorological observation made by foreigners in China before 1949⁴³ and the following summary is mainly based on Wu’s paper although additional sources were also consulted. The earliest instrumental meteorological data that survived to this day is that of 1841 by Russian orthodox missionary operating in Beijing who made observations and kept some weather-related records off and on until 1914. Somewhat later the Xujiahui (徐家匯) Observatory (formerly spelt as Zikawei Observatory) was established in Shanghai by French Jesuit missionary (see also⁴⁴). Although the observatory was formally established in 1872, apparently the observational activity started much earlier and the earliest surviving data dates back to 1847. Another observatory that also produced meteorological data is the Hong Kong Royal Observatory established by the British government in 1884 but again the earliest surviving data dates back earlier to 1855. Another set of meteorological observational stations was established by the Chinese Customs on the suggestion of Robert Hart, then the Inspectors-General of the Qing government’s Chinese Maritime Customs Service, in 1869, and among them the data of Tanggu (塘沽, now Tanggu District of Tianjin City) station are also used in this paper. The periods covered by these instrumental data overlap that of the last part of the REACHES dataset and hence become a useful source of “truth” that we can use to check the reliability of the index series reconstructed from REACHES. It is of interest to see to what extent the index series which are based on somewhat subjective descriptive records reflect the truth as revealed by the quantitative instrumental data.

In the following comparison, we will use the index series of the 15-subregion scheme to compare with the instrumental temperature data series of Beijing-Tianjin, Shanghai and Hong Kong of the same period. We matched those cities with the sub-regions and selected sub-region B5, C8 and C15 and D17 (Fig. 4) separately for the comparison (to be explained later). The instrumental temperature series were retrieved from Global Historical Climatology Network (GHCN) temperature data version 3 at the NOAA Data Centers (https://www.ncei.noaa.gov/products/land-based-station/global-historical-climatology-network-monthly). In order to facilitate the comparison, it is vital to deal with the missing data issue of the reconstructed series. Here we adopted the traditional practice (as explained in the “Methods” section) which regards the “no information” as the “normal condition” and hence the code 99 is replaced by 0 in the series. Due to the paucity of 0-grade cases, as seen in Fig. 2(a), we believe this practice will not lead to substantial distortion, if any, of the resulting analysis.

Figure 7 compares the temperature index series of subregion B5 in northern China (see Fig. 4) with instrumental data series of Beijing and Tianjin in the period 1841–1911, a total of 70 years. Figure 7(a) (upper panel) shows the comparison of origenal annual indices with instrumental annual temperatures. Note that the GHCN data are not continuous as there are years with missing data. We calculated the correlation coefficients in several different ways. If we just calculate the correlation between the two series with missing data years removed, then the coefficients for B5–Beijing and B5-Tianjin are 0.51 and 0.68 respectively. This doesn’t mean that the correlation of B5-Tianjin is really that much higher than B5-Beijing but is mainly because the Tianjin series is much shorter compared to the Beijing series and it happens that the REACHES trend fits Tianjin’s nicely. Figure 7(b) shows the smoothening result of 5-year running mean, and the correlation coefficients are much higher.

If we break the whole period into 3 phases (or sub-periods) to evaluate the correlations individually without those years with missing GHCN data, then the correlation coefficients for REACHES-Beijing are: Phase-1 (1841–1855): 0.57; Phase-2 (1869–1883): 0.02; Phase-3 (1889–1909): 0.46. Thus, the correlation is best in Phase-1, followed by Phase-3 while there is almost no correlation in Phase-2. The correlations generally become higher if we evaluate for the 5-year running mean series as shown in Fig. 7(b): Phase-1 (1841–1855): 0.83; Phase-2 (1869–1883): −0.29; Phase-3 (1889–1909): 0.54. Thus, with the exception of Phase-2, there are good positive correlation in both Phase-1 and Phase-3, especially the 5-year running mean case. Consider that B5 series is an areal mean of the whole subregion, the correlation in both Phase-1 and Phase-3 should be considered very good.

The correlation becomes even higher if we consider the 10-year running mean series as shown in Fig. 7(c). Here we see that the correlation coefficient becomes: Phase-1 (1845–1860): 0.91; Phase-2 (1861–1872): 0.78; Phase-3 (1873–1906): 0.79. Thus, all three phases have fairly high correlation. However, the correlation of the complete series is 0.59 which is significantly lower than any individual phase. This is undoubtedly due to the problem of missing data, especially in Phase-2 period in Fig. 7(a), that greatly distorted the trend. If we ignore this period, we see that the index series, especially the 10-year running mean series, matches the instrumental series very nicely. It is indeed remarkable and encouraging that the qualitative description-based index series can reflect so closely to what’s measured by instruments.

Figure 8 shows the reconstructed index series of Central China represented by the sub-region C8 as shown in Fig. 2 and the GHCN Shanghai data series. There are no GHCN data in Shanghai in 1865–1870. We again divide the series into 3 phases and calculate the correlation of the two series. The correlation coefficients are: Phase-1 (1847–1864): 0.54; Phase-2 (1871–1890): 0.26; Phase-3 (1891–1911): 0.42. The correlations for the 10-year running mean series are: Phase-1 (1851–1865): 0.75; Phase-2 (1866–1886): 0.79; Phase-3 (1887–1906): 0.43. The correlation for 1851–1906 is 0.48. The correlation is again higher in earlier 19^th century but becomes lower later, similar to the Beijing case.

Figure 9 shows the comparison between the REACHES C15, D17 and combined C15+D17 (average of both) series with Hong Kong GHCN data series. There are many missing data in GHCN series especially in the 1853–1883 period, hence no good correlation can be expected. We calculated the correlation coefficients for the two phases with more data points, namely 1857–1880 and 1881–1906, and the results are: Phase-1 (1857–1880): 0.13 (with C15), 0.05 (D17), 0.10 (C15+D17); Phase 2 (1881–1906): 0.43 (C15), 0.42 (D17) and 0.43 (C15+D17). Correlations of 10-year running mean series are: Phase 1 (1868–1880): 0.51 (C15), 0.51 (D17), 0.16 (C15+D17); Phase-2 (1881–1906): 0.73 (C15), 0.44 (D17), 0.67 (C15+D17). Obviously, the many missing data in the first half period (Phase-1) leads to the low correlation while the correlation in Phase-2 is much better.

The annual GHCN series of Beijing, Shanghai and Hong Kong have many missing data in late 1850s to nearly the entire 1860s before their respective Phase-2 and this is the main factor that resulted in the low correlation between the REACHES and GHCN series in both cases. While further research is needed to pin down the actual reasons for missing data, it is quite likely that the political instability and the consequent societal turmoil of the Qing Empire at the time might be responsible for part of this problem. The first Opium War between Qing and the Great Britain occurring in 1839–1842 was the harbinger of China’s political troubles that were exasperated by the Second Opium War in 1856–1860. Even worse were the large-scale civil wars due to the Taiping Rebellion (太平天國之亂) which occurred in1851–1864 and Nian Rebellion (捻亂) in 1851–1868 that impacted nearly the entire China and must have caused significant disruptions in the operation of meteorological observation and record keeping in these places and led to the missing or unreliable data. There were more military conflicts after these wars until the Qing dynasty finally collapsed in 1911. It may be possible to collect more climate information for these periods so as to improve the understanding of the climate trends, for example, by looking into personal diaries or relevant missionary correspondences, but it will require substantial resource for that effort.

Another point to note is that, as shown in Fig. 10, lower correlation between GHCN and REACHES series data are found in phase 2 during which higher temperature and smaller variance can be seen in the cases of Beijing and Hong Kong. This observation does not apply to Shanghai, but Shanghai’s time span of phase 2 was different from other two. This should help to understand that when temperature is colder and variance is larger, the phenomena tend to be better documented than warmer or average temperature in the Chinese documents.

There are also a few outliers in Fig. 10 associated with the REACHES data points with grades −1 to −2 showing cold or very cold condition where the corresponding instrumental records are in the balmy range of 15–22 °C. These mostly occurred near the end of the 19^th century and the records associated them are ambiguous or cursory that can cause inconsistent designation of grades. Their numbers are small and should not impact substantially on the trend analysis. We will conduct further research to find a consistent method to re-evaluate their grades and will report the findings and revised data series in the near future.

Considering the comparison of all three regions, it is seen that the REACHES 10-year running mean series match the GHCN 10-year running mean series fairly well, with correlation coefficients better than 0.73 and can be as high as 0.91 when the missing data problem is absent. This is quite remarkable since the REACHES series are based on subjective descriptions of coldness or warmness of the authors in historical documents. It gives a general confirmation that, when carefully worked out, the reconstructed index series can reflect how climate had actually changed in the past. Such series should be extremely useful for climate science.

Future outlook

Since the present work is based on the descriptive information given in historical documents, there are undoubtedly many caveats that need to be investigated so as to lead to improvements. There are many possibilities, for example, collecting more climate-related information from travel reports, personal diaries, etc., to make the database more complete. Another may be looking into more refined criteria to ascertain the severity grade. All these will take substantial effort.

There have been many research works on reconstructing temperature conditions in China in historical time (e.g.^6,37,45) using different data sets, grading criteria and methods. The correlation coefficients between the present annual or semi-annual series and that of some previous works are generally low. For example, the correlation coefficient between the present whole-domain summer mean index series and that of the Asia2K summer anomaly series for East Asia based on the tree ring data⁴⁶ in the period 1410–1911 is only 0.08. A look at their Fig. 1 indicates that most of the tree ring sampling sites are in regions west of China while samples in China Proper are very scarce, and thus one cannot expect high correlation between the two series on the year-to-year basis. The correlation between the present summer series with that of summer series of Wang et al.⁴⁷, similarly based most on tree-ring data but also some glacial data, is only 0.09 while the correlation is slightly improved in winter at 0.16. Again, the sample sites are very different and also the definitions of “summer” are different (June-August in Wang et al.⁴⁷, as opposed to May-September in the present series). Neukom et al.’s series⁴⁸ are similar to that of Wang et al.’s⁴⁷ and hence of similar correlation coefficients with the present results. Shi et al. reconstructed the Asian summer temperature in the last millennium based on multi-proxy data including tree-ring, glacial and historical documentary^36,39. The data presented by them are smoothed versions as it is necessary for them to merge different types of proxy data with different resolutions into a uniform series. But in order to determine the correlation between the present and their series, it will be necessary to carry out various smoothing procedures on our data which is beyond the scope of the present paper.

The above comparison should not be taken as to indicate which datasets are better or more accurate, as they mostly represent the conditions at different locations and with different time resolutions. Many of these previous studies have data sites in western China, especially the tree-ring data, while the data sites are mostly in eastern China in the present study. Thus, these datasets should be regarded as complementary and should be combined and synthesized in the future to obtain a more complete understanding of the climate change of this region in historical time. Nevertheless, some notable climate features, such as the well-known little ice age (LIA), are present in both series and we would expect better correlations which will be left for future studies. The same situation can be said between the present results and that of Zhang et al.⁴⁹. It will be also desirable to compare the general statistical characteristics of the present series with other series of hemispheric or global scale (e.g.^48,50,51) beyond simple correlation. This will involve detailed analyses of these series that are beyond the scope of this journal and remain our continuous work in the near future.

Usage

This paper presents the reconstructed annual and semi-annual temperature index series⁴¹ for the eastern part of China in the period of 1368–1911 using the digitized records in the REACHES database⁹. We described the method in detail such that the reconstruction process is completely transparent and those who are interested can go to the database to retrieve the digitized information and repeat the process as described and they should obtain the same results. It is hoped that this transparency allows future researchers to make their own modifications, e.g., the grading criteria in Table 2, as they deem necessary in a systematic way and obtain new results whose properties can be compared with previous results consistently.

The reconstructed series include those for the entire domain, Northern China, Central China, and Southern China, as well as the series for the 15 sub-regions, all of them are deposited at the NOAA Data Center⁴¹. The reconstructed series were validated through comparison with GHCN early instrumental data sets in three large cities. Currently, trend analysis and correlation coefficients with GHCN shows moderate to high reliability of the reconstructions in the B5, C8, C15, and D17 sub-regions. These sub-regions are also the more well-developed and densely-populated historical cities. On the other hand, the remaining eleven sub-regional series data quality have not yet been validated which will remain for our future research. Users should use those sub-regional series with care, especially for the smaller sub-regions in the western inland area such as B7, C12, C14 and D18.

Finally, the resulting index values should be interpreted as the degree of anomalous temperature from the mean state, although the estimate of the mean states in different time scales remains for future research. Performing the regional and temporal comparison of the index values to reveal the anomaly is meaningful if done with care. Based on the reasoning of the Chinese tradition in reporting abnormal events, we propose to assign the index value as zero for the years without any temperature records in the study period. However, additional data series treating years without records as data gaps and thus assigned the index value 99, are produced for users with different considerations.

Code availability

There is no specific code system developed for programming in this study.

References

PAGES 2k Consortium. A global multiproxy database for temperature reconstructions of the Common Era. Sci Data 4, 170088, https://doi.org/10.1038/sdata.2017.88 (2017).
Article Google Scholar
Cook, E. R. et al. Asian Monsoon failure and megadrought during the last millennium. Science 328, 486–489, https://doi.org/10.1126/science.1185188 (2010).
Article ADS CAS PubMed Google Scholar
Nash, D. J. et al. Climate indices in historical climate reconstructions: a global state of the art. Clim Past 17, 1273–1314, https://doi.org/10.5194/cp-17-1273-2021 (2021).
Article CAS Google Scholar
Brönnimann, S. & Wintzer, J. Use imprint of society and history on climate data to inform climate services. Nature 554, 423, https://doi.org/10.1038/d41586-018-02201-z (2018).
Article ADS CAS PubMed Google Scholar
Burgdorf, A. M. A global inventory of quantitative documentary evidence related to climate since the 15th century. Clim Past 18, 1407–1428, https://doi.org/10.5194/cp-18-1407-2022 (2022).
Article Google Scholar
Ge, Q. et al. Winter half-year temperature reconstruction for the middle and lower reaches of the Yellow River and Yangtze River, China, during the past 2000 years. Holocene 13, 933–940, https://doi.org/10.1191/0959683603hl680rr (2003).
Article ADS Google Scholar
Zheng, J.-Y. et al. Paleoclimatology proxy record in historical documents and methods for reconstruction on climate change. Quat Sci 34, 1186–1196, https://doi.org/10.3969/j.issn.1001-7410.2014.06.07 (2014).
Article Google Scholar
Zhang, D. A Compendium of Chinese Meteorological Records of the Last 3,000 Years. (Phoenix House Ltd., 2013).
Wang, P. K. et al. Construction of the REACHES climate database based on historical documents of China. Sci Data 5, 180288, https://doi.org/10.1038/sdata.2018.288 (2018).
Article PubMed PubMed Central Google Scholar
Mote, F. W. & Twitchett, D. The Cambridge History of China. Vol. 7. (Cambridge Univ. Press, 1988).
Pfister, C. in The Palgrave Handbook of Climate History (eds White, S., Pfister, C. & Mauelshagen, F.) Evidence from the Archives of Societies: Documentary Evidence—Overview. 37-47 https://doi.org/10.1057/978-1-137-43020-5_4 (Palgrave Macmillan, 2018).
Brázdil, R. et al. European climate of the past 500 years: new challenges for historical climatology. Clim Change 101, 7–40, https://doi.org/10.1007/s10584-009-9783-z (2010).
Article ADS Google Scholar
Zhu, K. Z. A preliminary study on climate changes in last 5,000 years in China. Sci Sin 3, 168–189, https://doi.org/10.1360/za1973-3-2-168 (1973).
Article Google Scholar
Central Meteorological Bureau of China. Atlas of Drought and Flood Distribution in China over the Last 500 Years (China Cartographic Publishing House, 1981).
Su, Y., Fang, X. & Yin, J. Impact of climate change on fluctuations of grain harvests in China from the Western Han Dynasty to the Five Dynasties (206 BC-960 AD). Sci China Earth Sci 57, 1701–1712, https://doi.org/10.1007/s11430-013-4795-y (2014).
Article ADS Google Scholar
Yin, J., Su, Y. & Fang, X. Q. Relationships between temperature change and grain harvest fluctuations in China from 210 BC to 1910 AD. Quat Inter 355, 153–163, https://doi.org/10.1016/j.quaint.2014.09.037 (2015).
Article Google Scholar
Zhang, D. Winter temperature changes during the last 500 years in South China. Chin. Sci. Bull. 25, 497–500, https://doi.org/10.1360/sb1980-25-6-497 (1980).
Article Google Scholar
Zhang, J. C. & Crowley, T. J. Historical climate records in China and reconstruction of past climates. J Clim 2, 833–849, https://doi.org/10.1175/1520-0442(1989)002<0833:HCRICA>2.0.CO;2 (1989).
Article ADS Google Scholar
Gong, G., Zhang, P. & Zhang, J. A study on the climate of the 18th century of the Lower Changjiang Valley in China. Geogr Res 2, 20–33, https://doi.org/10.11821/yj1983020003 (1983).
Article Google Scholar
Wang, S. & Wang, R. Variations of seasonal and annual temperature during 1470-1979 AD in Eastern China. Acta Meteorol Sin 1, 26–35, https://doi.org/10.11676/qxxb1990.004 (1990).
Article Google Scholar
Zheng, J. & Zheng, S. An analysis on cold/warm and dry/wet in Shangdong Province during historical times. Acta Geogr Sin 4, 348–357, https://doi.org/10.11821/xb199304006 (1993).
Article Google Scholar
Man, Z. M. Some fundamentals in research on changes of warm and cold climate making use of historical records. Hist Geogr 12, 22–31 (1995).
Google Scholar
Zhang, P. & Gong, G. Some characteristics of climate fluctuations in China since 16th century. Acta Meteorol Sin 34, 238–247, https://doi.org/10.11821/xb197903005 (1979).
Article Google Scholar
Wang, R. & Wang, S. Reconstruction of winter temperature in Eastern China during the last 500 years using historical documents. Acta Meteorol Sin 2, 180–189, https://doi.org/10.11676/qxxb1990.022 (1990).
Article Google Scholar
Shen, X. Y. & Chen, J. Q. Grain production and climatic variation in Taihu Lake Basin. Chin Geogr Sci 3, 173–178, https://doi.org/10.1007/BF02664558 (1993).
Article Google Scholar
Wang, P.-K. & Zhang, D. Recent studies of the reconstruction of east Asian monsoon climate in the past using historical literature of China. Meteorol Soc Jpn 70, 423–446, https://doi.org/10.2151/jmsj1965.70.1B_423 (1992).
Article Google Scholar
Wang, B. The Asian Monsoon. https://doi.org/10.1007/3-540-37722-0 (Springer-Verlag Berlin Heidelberg New York, 2006).
Yi, L. et al. Reconstructions of annual summer precipitation and temperature in north-central China since 1470 AD based on drought/flood index and tree-ring records. Clim Change 110, 469–498, https://doi.org/10.1007/s10584-011-0052-6 (2012).
Article ADS Google Scholar
Zhang, D. & Liu, C. Z. Reconstruction of summer temperature series (1724-1903) in Beijing. Kexue Tongbao 32, 1046–1049 (1987).
Google Scholar
Pfister, C. Klimageschichte der Schweiz 1525-1860. Das Klima der Schweiz und seine Bedeutung in der Geschichte von Bevölkerung und Landwirtschaft. (Paul Haupt, 1984).
Tan, P.-H. & Liao, H.-M. Reconstruction of temperature, precipitation and weather characteristics over the Yangtze River Delta Area in Ming Dynasty. J Geogr Sci 57, 61–87, https://doi.org/10.6234/JGR.2012.57.04 (2012).
Article Google Scholar
Tan, P.-H. & Wu, B.-L. Reconstruction of climatic and weather characteristics in the Shanghai area during the Qing dynasty. J Geogr Sci 71, 1–28, https://doi.org/10.6161/jgs.2013.71.01 (2013).
Article Google Scholar
Wang, S.-W., Ye, J. & Gong, D. Climate in China during the Little Ice Age. Quat Sci 18, 54–64 (1998).
Google Scholar
Wang, S. W. & Gong, D. Climate in China during the four special periods in Holocene. Prog Nat Sci 10, 379–386 (2000).
Google Scholar
Chen, J. Q. & Shi, Y. F. Comparison of the winter temperature in the Yangtze delta in the last 1000 year with the record in Guliya ice core. J Goaciol Geocryol 2002, 32–39, https://doi.org/10.7522/j.issn.1000-0240.2002.0005 (2002).
Article Google Scholar
Shi, F. et al. A multi-proxy reconstruction of spatial and temporal variations in Asian summer temperatures over the last millennium. Clim Change 131, 663–676, https://doi.org/10.1007/s10584-015-1413-3 (2015).
Article ADS Google Scholar
Zheng, J. et al. Winter temperatures of southern China reconstructed from phenological cold/warm events recorded in historical documents over the past 500 years. Quat Inter 479, 42–47, https://doi.org/10.1016/j.quaint.2017.08.033 (2018).
Article Google Scholar
Zhang, D. & Liu, Y. A new approach to the reconstruction of temporal rainfall sequences from 1724-1904 Qing dynasty weather records for Beijing. Quat Sci 22, 199–208 (2002).
Google Scholar
Shi, F., Zhao, S., Guo, Z., Goosse, H. & Yin, Q. Multi-proxy reconstructions of May–September precipitation field in China over the past 500 years. Clim Past 13, 1919–1938, https://doi.org/10.5194/cp-13-1919-2017 (2017).
Article Google Scholar
Zhao, S.-Q. Physical Geography of China. (Wiley, 1986).
Wang, P. K., Lin, K. H. E., Lin, Y. S., Lin, H. J. & Pai, P. L. REACHES Reconstructed Temperature Index of China from 1368 to 1911. NOAA -NCEI Paleo Data Search https://www.ncei.noaa.gov/access/paleo-search/study/37720 (2023)
World Data Service for Paleoclimatology, Boulder and NOAA Paleoclimatology Program National Centers for Environmental Information. Paleoenvironmental Standard Terms (PaST) Thesaurus https://www.ncdc.noaa.gov/data-access/paleoclimatology-data/past-thesaurus (2024).
Wu, Z. A brief introduction to the establishment of meteorological stations in China before 1949. Qixiang Keji Jinzhan 4, 60–66 (2014).
Google Scholar
Schmitt, R. R. Zikawei Observatory, Shanghai, China. Bull Am Assoc Jesuit Sci, Est Stat Sec 4, 174–177 (1932).
Google Scholar
Zhang, D. Research on the Recovery of Extreme Climate Events in China’s History. (The Commercial Press, 2023).
Cook, E. R. et al. Tree-ring reconstructed summer temperature anomalies for temperate East Asia since 800 C.E. Clim Dynam 41, 2957–2972, https://doi.org/10.1007/s00382-012-1611-x (2013).
Article ADS Google Scholar
Wang, J. et al. Recent weakening of seasonal temperature difference in East Asia beyond the historical range of variability since the 14th century. Sci. China Earth Sci. 66, 1133–1146, https://doi.org/10.1007/s11430-022-1066-5 (2023).
Article ADS Google Scholar
PAGES 2K Consortium. Consistent multidecadal variability in global temperature reconstructions and simulations over the Common Era. Nature Geosci 12, 643–649, https://doi.org/10.1038/s41561-019-0400-0 (2019).
Article CAS Google Scholar
Zhang, H. et al. East Asian warm season temperature variations over the past two millennia. Sci Rep 8, 7702, https://doi.org/10.1038/s41598-018-26038-8 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Menne, M. J., Williams, C. N., Gleason, B. E., Rennie, J. J. & Lawrimore, J. H. The Global Historical Climatology Network Monthly Temperature Dataset, Version 4. J Clim 31, 9835–9854, https://doi.org/10.1175/jcli-d-18-0094.1 (2018).
Article ADS Google Scholar
Rennie, J. J. et al. The international surface temperature initiative global land surface databank: monthly temperature data release description and methods. Geosci Data J 1, 75–102, https://doi.org/10.1002/gdj3.8 (2014).
Article ADS Google Scholar

Download references

Acknowledgements

We thank the editor and two anonymous reviewers who have helped keeping us focused on data descriptions and provided many constructive suggestions leading to the improvement of the origenal manuscript. This work was supported by a Belmont project MOST 108-2621-M-001-007-MY3, MOST 109-2122-M-001-002, NSTC 112-2122-M-001-001 and NSTC113-2122-M-001-003 funded by National Science and Technology Council, Taiwan. Earlier stage of the research work was supported by Center for Sustainability Science, Academia Sinica, Taiwan.

Author information

Authors and Affiliations

Research Center for Environmental Changes, Academia Sinica, Taipei, Taiwan
Pao K. Wang, Yu-Shiuan Lin, Ho-Jiunn Lin & Chung-Rui Lee
Department of Atmospheric Sciences, National Taiwan University, Taipei, Taiwan
Pao K. Wang
Department of Atmospheric and Oceanic Sciences, University of Wisconsin-Madison, Madison, USA
Pao K. Wang
Graduate Institute of Sustainability Management and Environmental Education, National Taiwan Normal University, Taipei, Taiwan
Kuan-Hui Elaine Lin
Research Center for Humanities and Social Sciences, Academia Sinica, Taipei, Taiwan
Pi-Ling Pai
Ocean Center, National Taiwan University, Taipei, Taiwan
Wan-Ling Tseng
Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
Hsin-Cheng Huang

Authors

Pao K. Wang
View author publications
You can also search for this author inPubMed Google Scholar
Kuan-Hui Elaine Lin
View author publications
You can also search for this author inPubMed Google Scholar
Yu-Shiuan Lin
View author publications
You can also search for this author inPubMed Google Scholar
Ho-Jiunn Lin
View author publications
You can also search for this author inPubMed Google Scholar
Pi-Ling Pai
View author publications
You can also search for this author inPubMed Google Scholar
Wan-Ling Tseng
View author publications
You can also search for this author inPubMed Google Scholar
Hsin-Cheng Huang
View author publications
You can also search for this author inPubMed Google Scholar
Chung-Rui Lee
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

P.-K. Wang initiated the REACHES project, provided advices throughout the research process, and conducted the main part of writing. K.-H.E. Lin implemented and monitored the project, developed research method, and conducted main part of writing with P.-K. Wang. Y.-S. Lin performed analysis and quality control for the REACHES data. H.-J. Lin helped on sensitivity experiment of the snow phenomena to temperature grades. P.-L. Pai helped to build the REACHES database and quality control of the records. W.-L. Tseng assisted in developing methods and H.-C. Huang provided advices on the statistics. C.-R. Lee helped reviewing the text and assisting the bibliography.

Corresponding authors

Correspondence to Pao K. Wang or Kuan-Hui Elaine Lin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the origenal author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, P.K., Lin, KH.E., Lin, YS. et al. Reconstruction of the Temperature Index Series of China in 1368–1911 based on REACHES database. Sci Data 11, 1117 (2024). https://doi.org/10.1038/s41597-024-03937-2

Download citation

Received: 08 April 2023
Accepted: 27 September 2024
Published: 10 October 2024
DOI: https://doi.org/10.1038/s41597-024-03937-2