Introduction

Atmospheric aerosols modulate the radiative forcing of the Earth and adversely affect human health1. While inorganic aerosols are well constrained benefitting from abundantly available measurements, the compositions of organic aerosol (OA), representing 20–90% of fine aerosol mass, remain uncharacterized for more than two-thirds of the OA mass2,3. In recent decades, significant progress has been made in quantifying emissions, unravelling atmospheric processes, and characterizing the compositions of OA4,5,6. However, the past efforts are dominantly carbon-centered, with little attention given to the nitrogen (N) element that constitutes many of these OA molecules. Little is known about the abundance, sources, and atmospheric behaviors of nitrogenous organic aerosols (OAN)7. On the other hand, the increasing application of ultrahigh resolution mass spectrometry, which is capable of determining elemental compositions of organic molecules in complex mixture samples, has revealed the presence of hundreds of N-containing organic molecules in ambient OA8,9. Separately, certain organic nitrogen (ON) compounds of known atmospheric presence, are light-absorbing and thus contributing to the aerosol’s role in mediating climate10; some are active participants in the formation of new particles and secondary organic aerosols11,12. Last but not the least, the increasing atmospheric deposition of nitrogen from human activities has greatly disturbed the global nitrogen cycle13,14, and aerosol ON could be a significant contributor of this pool of reactive nitrogen in the atmosphere.

Existing quantitative knowledge of aerosol ON is limited to a small number of ON compound groups (e.g., urea, small amines, amino acids, nitroaromatic compounds, nitro-PAHs)15,16, or to the ON subgroups such as organic nitrates, whose mass can be quantified by aerosol mass spectrometer (AMS) or thermal dissociation couple with optical measurement (TD-LIF)17,18. The bulk ON analysis is mostly limited to the water-soluble fraction. The water-soluble ON (WSON) is commonly measured as this sub-fraction of ON is analytically accessible via water extraction. WSON is quantified by taking the difference between aerosol water-soluble total nitrogen (WSTN) and inorganic nitrogen (IN)7,19. The water-insoluble ON (WION) remained largely unknown, making the total quantity of aerosol ON – constituting WSON and WION – rarely quantified. Although a few studies quantified aerosol ON by taking the difference between total N, which is measured by a Carlo Erba elemental analyzer (EA)20,21,22,23, and IN, EA fails to measure accurately trace-level aerosol nitrogen due to its unsatisfactory limit of detection for nitrogen22,23. In addition, the “difference method” brings large propagation error to ON quantification, especially when ON is a minor contributor to aerosol nitrogen24. Recently, we have developed an aerosol IN&ON analyzer that uses programmed thermo-evolution and chemiluminescent detection coupled with multivariate curve resolution data treatment to achieve sensitive and simultaneous quantification of aerosol IN and ON without any pretreatment24. The method breakthrough enables quantitative and accurate surveying of ON in a large number of samples with ease. The quantified ON represents the total N mass in OAN, which represents the sum of all nitrogen-containing molecules, including both water-soluble and water insoluble fractions. OAN encompasses oxidized OAN, such as organic nitrates and nitroaromatics, as well as reduced OAN, such as urea, amino acids, and indazoles. Therefore, the achievement of total ON determination opens the door for investigating the magnitude and sources of bulk OAN.

In this work, we applied the new method and quantified ON in more than 600 aerosol filter samples collected from multiple sites of varying urban influence in China and spanning a whole year at most sites. The relative quantities of aerosol ON and IN and their typical varying ranges in urban and suburban atmospheres are established in this study. Organic carbon (OC) and other major aerosol chemical constituents, as well as various source-specific elements and organic molecules, are also measured. The joint OC and ON data make it possible to estimate the extent of the presence of OAN in atmospheric organic aerosols, and the estimation results show a dominating importance of OAN. Source analysis further identifies biomass burning and secondary formation as major ON sources, with primary biological particle emission a prominent ON source in non-urban environments. We anticipate that the knowledge about aerosol ON quantities could be greatly valuable in bridging gaps in the description of ambient organic aerosols and in assessing the atmospheric deposition of ON in N nutrient cycle in the environment.

Results and discussion

Abundances of ON

Average concentrations of PM2.5-bound ON ranged from 0.4 to 1.4 μg N m−3 across 12 sampling sites in China (Table 1), including two suburban sites in the North China Plain (NCP), three suburban/rural sites, and seven urban sites in the Pearl River Delta (PRD), a region in South China that includes three of China’s largest cities with a total population of more than 60 million people. The percentages of ON accounting for total aerosol N were 17–31% at the sites, indicating ON represents a significant fraction of aerosol N and therefore atmospheric N deposition budget. We segregate the 12 sites into three groups – Suburban NCP, Suburban/rural PRD, and urban PRD – based on geographic locations and the degree of urban influences. Both IN and ON showed significantly higher levels in northern China than southern China (p < 0.0001), while the ON-in-TN percentages were comparable in the two regions (p = 0.08) (Fig. 1). Between the urban and suburban/rural site groups in the PRD region, the average of ON was slightly higher in urban areas (p < 0.001) while the IN abundances were comparable (p = 0.62). This makes the median of ON-to-TN ratio higher in the urban environment (22%) than in suburban/rural environment (17%) (p < 0.001) (Fig. 1).

Table 1 Summary information of the 12 sampling sites and sample collection details
Fig. 1: Distribution histograms of aerosol IN and ON concentrations and ON-to-TN ratio in three groups of sampling sites in China.
figure 1

Statistical data (i.e., averages, medians, and standard deviations) are presented in each plot. The 12 sampling sites (see details in Table 1) are segregated into three groups, namely urban PRD, suburban/rural PRD, and suburban NCP.

Estimation of OA N contribution to OA

We next take advantage of the jointly measured OC and ON, which was achieved by our newly established method24, to estimate the contribution of nitrogenous organic aerosols (OAN) to OA. Mathematically, the fraction of OAN (i.e., the sum mass of all nitrogen-containing organic molecules) in OA can be expressed to be the products of three ratios, as shown in Eq. (1) and a simplified version in Eq. (2):

$$\frac{{{OA}}_{N}}{{OA}}=\left(\frac{{{OA}}_{N}}{{ON}}\right)\times \left(\frac{{ON}}{{OC}}\right)\times \left(\frac{{OC}}{{OA}}\right)$$
(1)
$$\frac{{{OA}}_{N}}{{OA}}=A\times B\times C$$
(2)

Where A is the OAN-to-ON ratio, B is the ON-to-OC ratio, and C is the reciprocal of OA-to-OC ratio, all in the unit of g/g. More specifically, A is the factor converting ON mass to mass of N-containing organic molecules, i.e, OAN mass. At a molecular level, this information is exactly known for individual molecules of known molecular formula (e.g., 2.14 for urea, 3.21 for ethyl amine, 9.36 for leucine, etc). Estimation of A for ambient OA can be made by assuming molecular formulas for constituent groups of OAN and their relative abundance ranges in OAN by surveying the literature, which is detailed later in this section and in Supplementary Note 2.3 and 2.5 in the supporting material. B is the N-to-C mass ratio of the bulk OA. This ratio has been measured for more than 600 ambient PM2.5 aerosol samples collected in multiple sites covering urban, suburban, and rural environments in China. The reciprocal of C, better known as the OA-to-OC or organic matter (OM)-to-OC ratio in the literature25, converts OC mass to OA (or OM) mass.

We have estimated the population of possible OAN/OA values using Monte Carlo (MC) simulation to individually sample the distribution domains of the three underlying terms in Eq. (2) (see Fig. 2). Monte Carlo simulation is a powerful technique to produce the “most probable” range of a resultant variable (y) from a number of independent variables (x1-i)26. Specific to this study, OAN/OA is the resultant variable y and A, B, and C are the three independent variables. The population distribution of B (i.e., ON/OC) is well described by the 609 samples collected from 12 continental sites. It exhibits a lognormal distribution (Fig. 2B and Supplementary Fig. 1), with a population mean of 0.11, a standard deviation of 0.06, and a median of 0.10. Such a lognormal distribution is common for concentration data of atmospheric pollutants27, and its underlying physical basis has been discussed in a few publications28,29,30. Mathematically, it can be proved that the division and multiplication of two lognormal random variables also yield lognormal distributions. Thus, a log-normal distribution has been adopted for all the three ratio quantities in Eq. (2).

Fig. 2: Distributions of three ratio quantities (OAN/ON, ON/OC, and OA/OC) and the resultant OAN/OA.
figure 2

The OAN/ON distribution is obtained by reviewing available studies on OAN constitutes and then Monte Carlo simulation (A). The ON/OC distribution is from measurements of 600+ aerosol samples, shown in grey bars in (B). The OA/OC values are constrained in the range of 1.3–2.5 and assumed to follow a log-normal distribution (C). The resultant OAN/OA ratio is obtained based on Monte Carlo simulation of the constrained distributions of OAN/ON, ON/OC, and OA/OC (D).

The reciprocal of C (i.e., OA/OC) is constrained in the range of 1.3–2.5, based on the numerous past studies of organic aerosols25,31 and assumed to follow a log-normal distribution (Fig. 2C). In comparison, much less information is available to deduce the distribution domain for A (i.e., OAN/ON). We constrain its distribution domain by taking a bottom-up approach, i.e., the bulk OAN/ON is calculated by summing up individual constituent groups of nitrogenous compounds (see Equation (S3)), such as urea, amino acids, etc (Supplementary Table 2), for which the converting factors of ON-to-organic compound mass are known from their chemical formulas. Due to the complex mixture nature of OAN and the spatiotemporal variation in abundance, MC simulations were carried out to produce the distribution in Fig. 2A. The details are provided in Supplementary Note 2 in the supplementary material. The mean of A is estimated to be 7.16, with a 95% confidence interval of [6.45, 7.78] (Fig. 2A). Namely, the mean MW of nitrogenous molecules constituting OAN is ~100 Da per N atom.

Based on the MC simulation of the constrained distributions of A, B, and C, the resultant OAN/OA ratio is estimated to have a mean of 42.2%, a median of 38.6%, and a 95% confidence interval of [14.1%, 87.3%] (Fig. 2D). Sensitivity evaluations were conducted to test if the estimated OAN/OA can be significantly altered with different assumptions. The details of MC simulations for deriving OAN/OA and sensitivity tests are presented in Supplementary Note 2 in the supporting materials. A brief account is given below.

First, A and C in the MC simulations were assumed to take lognormal, normal, and triangular distributions, which produced the nearly identical means of OAN/OA of 42.2%, 42.3%, and 42.3%, respectively (Supplementary Table 3). Second, the confidence level of numerical ranges of each variable for MC simulations was assumed at three different levels at 90%, 95%, and 99%. The resulting means of OAN/OA ratio were nearly the same, and so were the confidence ranges of OAN/OA, varying from [13.9%, 87.2%] at 90% confidence level to [14.3%, 87.1%] at 99% confidence level (Supplementary Table 4). These results indicate that neither the change of distribution assumption of variable nor the assumed confidence level has much influence on the estimated OAN/OA distribution.

Third, sensitivity tests of OAN/OA to the changes of molecular weight normalized against N atoms (MWNN) for OAN components were conducted. The numerical ranges adopted in Supplementary Table 2 may over- or under-estimate the real-world MWNN of OAN components, as their information in the literature remains partial. For the MWNN of each OAN component, a +/−30% change was applied individually to see the variation of OAN/OA, as a larger percentage change may not accord with realistic situation. With all the perturbation scenarios considered, the OAN/OA mean varied from 39.5% to 43.7% while its 95% confidence interval varied from [12.9%, 86.5%] to [14.8%, 88.5%] (Supplementary Table 5). We conducted another sensitivity test of MWNN change for WION fraction since the molecular information of this part of OAN is more uncertain than other OAN components. MWNN of all WION components were synchronously changed by −30%, −20%, −10%, +10%, +20, +30%. The MC simulation-derived OAN/OA mean changed from 36.5% to 45.9%, with 95% confidence interval varying from [11.9%, 82.6%] to [15.7%, 90.9%] among these changes (Supplementary Table 6).

Fourth, sensitivity tests were conducted to assess the impact of changes in the percent contributions of OAN components to the total OAN (i.e., Xi in Supplementary Table 2). The range for each Xi was expanded. The lower and upper bound of ranges were set as almost the lowest and highest values that were reported in previous studies. For instance, the contribution of urea to OAN was changed from 2–10% to 0.1–20% according to the available measurements. All the ranges were expanded simultaneously in the MC simulation. As a result, the mean OAN/OA and its 95% confidence interval vary from 42.2% and [14.1%, 87.3%] to 45.9% and [14.8%, 91.5%], respectively (Supplementary Table 7). Lastly, we tested the variation of OAN/OA under the synchronous changes of MWNN of the WION components and the percent contributions of each OAN subgroup. The results suggested that the mean of OAN/OA could vary from 40% to 49.5% and its 95% confidence interval change from [12.6%, 86.8%] to [16.2%, 93.9%] (Supplementary Table 8).

Based on the MC simulations and comprehensive sensitivity tests, we estimate that, in the environments of urban influence, the mean of OAN/OA is ~42%, likely in the range of 37–50%, while the 95% confidence interval of OAN/OA would be [12%, 94%]. In other words, our analysis indicates that approximately one-third to one half of the OA mass is contributed by nitrogenous organic molecules (i.e., OAN). The revelation of such a large presence is only made possible with our joint measurements of aerosol bulk ON and OC. This implies that N as a constituent element is widely incorporated in atmospheric organic compounds in the particle phase.

Note that there may be potential dependencies between A (i.e., OAN/ON ratio) and C (i.e., OC/OA ratio) in Eq. (2), which could introduce bias in the MC simulation results of OAN/OA mentioned earlier. To address this issue, we employed an alternative algorithm, which is more complex, to calculate OAN/OA and conducted additional MC simulations. The detailed methodology is described in Supplementary Note 3. In the new equation (Equation (S7)) for determining OAN/OA, all variables can be assumed to be independent of each other. Using the alternative MC simulations, we obtained a mean value of 39.8% and a median value of 37.2% for OAN/OA, with a 95% confidence interval of [13.8%, 78.9%]. These results are quite close to the simulation scheme that ignores dependence among variables, where the mean value is 42.2%, the median value is 38.6%, and the 95% confidence interval is [14.1%, 87.3%]. Based on this comparison, we have decided to stick to the simpler numeric model presented in the main text, as it is more straightforward and easier to understand.

Our results also suggest that the mean of MWNN of OAN molecules is unlikely to exceed 109 Da per N (i.e., 7.78*14, the upper bound of 95% confidence interval). For example, if the mean MWNN is assigned to be 200 Da per N, the MC simulation would derive at a population distribution of OAN/OA that has the mean at ~84% and about one-quarter of OAN/OA values in the distribution would exceed 100%, which is obviously not reasonable. This analysis implicates that most OAN molecules are either low MW (e.g., ~100 Da or less) or contain more than one N atom per molecule. Such information, although crude, provides valuable hints for proposing analytical strategies in elucidating molecular composition.

Despite the loss of semi-volatile organic compounds during sampling would introduce biases in the measurements of OC and ON, and then the estimation of OAN/OA, the efforts made by us have revealed a substantial presence of nitrogenous molecules in atmospheric OA. This revelation implies that the contribution of nitrogenous organic compounds to carbon chemistry may have been under-appreciated in the past. We anticipate that this newfound understanding of OA composition will inspire a reassessment of the priorities in studying and modeling OA. It is recommended that future efforts focus on integrating bulk ON measurements with the quantification of individual N-containing organic molecules. This combination of bulk analyses and speciation work would greatly enhance our ability to identify the significant constituent species of OAN in different environments and across various seasons. Additionally, development of online ON measurement techniques would be invaluable for advancing investigations into the sources and chemical processes linked to OAN.

Source apportionment of ON

So far, we know little about the major sources of aerosol ON. In this study, we have gained quantitative information on the sources of aerosol ON by positive matrix factorization (PMF) analysis of a subset of the 600+ samples, specifically those collected from urban TW, rural NS, and suburban BJ sites, which had comprehensive speciation data available (ref. Methods section). The details of PMF analysis are presented in the supporting materials (see Supplementary Note 4). The site-by-site seasonal and annual average source apportionment results are shown in Fig. 3, while the annual average statistics are tabulated in Supplementary Table 9. The PMF-resolved sources fall into two categories, namely primary and secondary sources. About 70% of aerosol ON was derived from primary emissions, including biomass burning, primary biological particles, vehicular emissions, cooking, ship emissions, soil dust, and sea salt. Among them, biomass burning was a major contributor and accounted for 21–24% (0.14–0.29 μg N m−3) of aerosol ON masses (Fig. 3). Previous studies have revealed that abundant N-containing molecules can be emitted from biomass burning including urea, amino acids, N-heterocyclics, and nitroaromatic compounds32,33. It was also found that biomass burning may induce a remarkable increase of bulk WSON in ambient aerosols19,32. Primary biological aerosol particle (PBAP) emission was also a significant source of aerosol ON and contributed 7.4% (0.04 μg N m−3), 16.3% (0.11 μg N m−3), and 17.6% (0.25 μg N m−3) to ON mass at urban TW, rural NS, and suburban BJ sites, respectively (Fig. 3). Note that a large fraction of PBAP is in coarse mode34, so that PBAP emission is expected to contribute more to atmospheric particulate ON pool than the observation here in PM2.5 alone. Vehicle emission was a significant source of aerosol ON (13.6%) at the urban TW site while a minor contributor at the rural NS site (3.4%) and the suburban BJ (4.6%) (Supplementary Table 9). It has been found that many heterocyclic amines were emitted during cooking of meat and fish35. In this study, cooking emission was identified as a significant ON source (21.5%) at the suburban BJ site. We note that at TW and NS, cooking was not resolved as a separate source due to the lack of data for cooking tracers (i.e., fatty acids and cholesterol). We subsequently examined the compositional data set at two urban sites in Hong Kong (YL and MK), for which we have cooking tracer data (Supplementary Note 4). Cooking accounted for 3.5% of ON at YL and 9.0% at MK, confirming it as an ON source in environments of notable urban influence. ON from the remaining primary sources (i.e., industrial and ship emissions, coal combustion, soil dust, and sea salt) contributed, on an annual average basis for individual sources, normally below 10% to ON mass at the sites (Supplementary Table 9).

Fig. 3: Annual and seasonal average contributions of major source groups to aerosol ON at three sites of varying urban influence in China.
figure 3

The three sites include an urban site in Hong Kong (TW), a rural site in south China (NS), and a suburban site in north China (BJ). The size of pie is proportional to aerosol ON mass concentration (μg N m−3). Individual contributions are shown for three major source groups – biomass burning, secondary formation, and biological particle emission. A lumped source contribution is shown for other primary sources, which include industrial emission, fossil fuel combustions (coal combustion, vehicle emission, ship emission), soil dust, and sea salt. Individual contributions of these sources can be found in Supplementary Table 9.

Secondary formation processes produced a large fraction of aerosol ON, at a similar annual average level of ca 30% for each of the three sites (Fig. 3). Note that two secondary source factors, namely sulfate-rich factor and nitrate-rich factor, are commonly resolved across all the three sites, while two secondary organic aerosol (SOA) factors are only resolved as separate sources for the BJ site. This is due to that SOA tracer data are only available for the BJ samples and their availability supports to resolve explicit SOA sources36. The nitrate-rich factor showed a significantly higher contribution to aerosol ON at BJ (0.103 μg N m−3, 14.3%) than at TW (0.034 μg N m−3, 5.8%) and NS (0.029 μg N m−3, 4.3%), broadly reflecting the much higher NOx level at the BJ site.

Although the detailed mechanisms of secondary ON aerosol formation remain unclear, it is known that particulate organic nitrates can form through reactions between organic peroxy radicals and nitric oxide (NO), as well as between alkenes and the nitrate radical (NO3). These pathways create plausible connections with the nitrate-rich factor, as they share the common NOx precursor17,37. Furthermore, organic nitrate formation can also occur in the aqueous phase through non-radical reactions involving glyoxal and nitric acid38. Additionally, nitro-aromatic compounds can be produced in the atmosphere through the addition of NO2 onto the phenol molecules39. Recent laboratory studies have indicated that heterogenous reactions between NO2 and organic compounds, such as methoxyphenols and fatty acids, have potential to serve as sources of nitro-aromatic compounds40,41. ON in the sulfate-rich factor might implicate certain ON products and sulfate could be produced through the common cloud/fog processing42. For example, the reactions between carbonyls and ammonium/amines/amino acids in the aqueous phase have been established in recent years43,44,45. In addition, the production of N-containing organic salts by acid-base neutralization of ammonia (NH3) and organic acids in deliquesced aerosols may represent a significant formation pathway of secondary ON, considering the abundant presence of NH3 and organic acids in the atmosphere46. The two SOA factors resolved for the BJ samples made a combined contribution of 0.20 μg N m−3 (14.3%) to aerosol ON pool at the site (Supplementary Table 9).

Seasonally, the contribution of biomass burning was the highest during the winter (27–38%), second highest in the fall (18–23%), and lowest in the summer (4–7%) at the sites (Fig. 3). This is consistent with previous studies that biomass burning is more intensive during fall-winter period in many regions of China19,47. Coal combustion also had an elevated contribution to ON during fall and winter period at the TW site (Supplementary Fig. 12). In comparison, the proportion of ON derived from PBAP emission exhibits distinctly different seasonality, with the highest occurring in the summer (14–32%) at all the three sites. Secondary ON linked with sulfate-rich factor is an important source of aerosol ON throughout the year at the TW and NS sites in southern China. At the BJ site, sulfate-rich factor represented 20.8% of aerosol ON source in the summer, significantly higher than the annual average of 6.5% (Supplementary Fig. 12). The more significant contribution of sulfate-rich factor to ON aerosol during the spring or summer periods, is consistent with the deduction above that the ON in sulfate-rich factor might be mostly derived from aqueous formation, caused by more humid air and stronger solar radiation in the two seasons. At BJ, the contribution of biogenic SOA (ref. SOA I in Supplementary Fig. 12) to aerosol ON increased to 21.7% in the summer, while that from the anthropogenic SOA (ref. SOA II in Supplementary Fig. 12) increased to 16.1% in the winter.

We acknowledge that the lack of tracer data for cooking emissions and SOA for the samples collected at the TW and NS sites prevents us from accurately resolving the contributions of ON fractions from their respective sources or formation processes in the PMF analysis. Although our additional analyses with the aid of YL and MK data (Supplementary Note 4) indicate that cooking- and SOA formation-derived ON should not dominate the sources of ON at TW and NS, neglecting these sources introduces uncertainty into the source apportionment results. Nevertheless, our results provided quantitative understanding of aerosol ON sources, which bridged the knowledge gap regarding the geochemical processes of the organic part of N in aerosols.

ON/OC ratios of individual ON sources

Aerosol ON/OC ratio is an essential parameter in modeling the budget and deposition flux of atmospheric ON48. The PMF receptor modeling has provided us with the ON/OC ratio data in individual ON sources, which are tabulated in Supplementary Table 9. The biomass burning source has an atomic ON/OC ratio of 0.077–0.083, showing highly consistent results from the three independent PMF analyses of TW, NS, and BJ samples. Notably, the ON/OC ratio in the PMF-derived biomass burning source profile is comparable to the experimentally determined ON/OC ratio of 0.052–0.079 that we recorded for biomass burning smoke source samples collected in simulated field burning of rice straw and sugarcane leaves49. PBAP emission has an atomic ON/OC ratio of 0.094–0.141. The mixed source of industrial emission and coal combustion has an atomic ON/OC ratio of 0.046–0.067, and the ratios of vehicle and ship emissions are 0.048–0.076 and 0.036–0.052, respectively. Soil dust shows a relatively higher atomic ON/OC ratio of 0.129–0.162, consistent with the study by Wang et al. that documented an increased WSON/WSOC ratio during dust events compared with normal days50. Generally, ON/OC ratios in primary particles from fossil fuel combustion are relatively lower than those from natural sources such as PBAP emission and soil dust. Cooking emission profile derived from the BJ samples shows an atomic ON/OC ratio of 0.291, while a much lower value of 0.052 is obtained with the Hong Kong samples. The notable difference might result from different cooking styles at the two places and more measurements of cooking source emissions are suggested for future studies. SOA linked with the nitrate-rich factor shows a large range of atomic ON/OC ratio of 0.035–0.241 at the three sites. In comparison, SOA linked with the sulfate-rich factor process shows relatively consistent values in the range of 0.097–0.136 in southern and northern China. The two SOA factors resolved from the BJ samples have the ON/OC ratio of 0.103 and 0.082, respectively (Supplementary Table 9). Further studies in different atmospheric environments are needed to uncover the chemical compositions of secondary ON aerosol and to map out the ON/OC variation range.

Kanakidou et al.48 presented the first estimate of the atmospheric budget and deposition flux of WSON to oceans relying on OC emission inventory supplemented with assumed ON/OC ratios and solubilities of OC for individual sources. In their study, an ON/OC atomic ratio of 0.3 was adopted for all combustion-sourced primary OC particles, which is notably higher than our results (ca 0.04–0.08) derived from PMF modeling of observational data shown in Supplementary Table 9. An ON/OC atomic ratio of 0.15 was selected for primary biogenic particles in that study, which is slightly higher than the upper bound of the range of 0.094–0.141 derived in our work. For the soil dust source, our resolved ON/OC atomic ratio (0.13–0.16) is moderately higher than the assumed ratio of 0.1 for soil microbial biomass in Kanakidou et al.48. Soil dust may include complex materials such as decomposed biomass, deposited atmospheric particles, and applied fertilizer. We note that the ON/OC ratios derived from the PMF-resolved source profiles represent bulk measurements and provide an averaged representation of elemental compositions of C and N in each source. Kanakidou et al.48 applied a mean N/C ratio of 0.1 for SOA produced by gas-to-particle conversion, which is approximately in the middle of the range of N/C ratio resolved for the four secondary sources of ON in our work (0.035–0.241). Our study, however, revealed that the secondary ON could have multiple formation pathways that produce a wide range of N/C ratio. One single value would certainly be too coarse a representation in modeling secondary OAN. As secondary formation is a major OAN source, more targeted laboratory and field studies are needed to characterize the ON/OC ratio. Note that the calculated ON/OC ratio for each source resolved from the PMF analysis may be subject to biases. Nevertheless, our findings provide valuable references for modeling studies of investigating N cycles and the related environmental impacts, considering the challenges associated with obtaining organic N/C ratios of emission sources.

Methods

Sampling

PM2.5 samples were collected from a total of 12 sites under varying urban influence in China. The locations, as well as sampling periods at each site, are presented in Table 1 and Supplementary Fig. 19. Supplementary Fig. 19 also presents the geomorphologic maps of the sampling sites, indicating the rough distances of the sampling sites away from urban centers. Quartz fiber filter samples were collected using high-volume samplers at each of the sites. Teflon filter samples were concurrently collected using mid-volume samplers at the sites except for QD, GIG, and DHS sites. Among the 12 sites, three sites (TW, NS, and BJ) had comprehensive chemical composition data for samples covering a year, providing opportunities for source apportionment by receptor modeling with year-round samples. More descriptions of TW, NS, and BJ sites are therefore provided here. TW (Tsuen Wan) is an urban site in Hong Kong in a mixed commercial/residential neighborhood. NS (Nansha) is in a suburban/rural peripheral district of the Guangzhou municipality in southern China. The BJ (Beijing) site is on the roof of a building on the campus of China University of Petroleum, located in a suburban district of Beijing on its northwest outskirt. Sampling inlets at the three sites were all ~20 m above the ground. All samplers were calibrated before and after sampling. Field blanks were also collected at each site. All samples were immediately transferred to laboratories and stored at −20 °C before analysis. The samples were shipped with blue ice packs to keep them cool (~4 °C) during transport. To assess the effects of long-term storage on the chemical compositions of filter samples, we examined the consistency of IN concentrations measured by ion chromatography soon after sample collection, and by the aerosol IN&ON analyzer, conducted two to four years later in 2020. As depicted in Supplementary Fig. 20, a strong correlation (R2 = 0.96) was observed between the two independent sets of IN measurements, suggesting little alteration in aerosol IN species during storage.

Chemical analysis

All the samples were analyzed for ON, IN, OC, elemental carbon (EC), and major ions. Additional measurements of elemental and organic source markers were carried out for the samples collected at BJ, TW, NS, YL, and MK sites. Non-polar organic compounds (NPOCs) and sugars were measured for the TW and NS samples, while more comprehensive measurements, including NPOCs, sugars, fatty acids, sterol, and secondary organic aerosol (SOA) tracers, were available for BJ, YL, and MK sites. The comprehensive compositional data at BJ, TW, NS, YL, and MK allowed source apportionment analysis of ON by positive matrix factorization (PMF) receptor model. All analyses are detailed in Supplementary Note 1 in the supporting materials.

Monte Carlo simulation

Monte Carlo simulation is a computational approach that relies on repeated random sampling to solve problems that might be deterministic in principle26. Commonly, Monte Carlo simulation consists of the following steps: (1) Define a domain of input variables; (2) Generate inputs randomly from a probability distribution (e.g., normal or lognormal distributions) over the domain; (3) Perform a deterministic computation on the inputs; and (4) Aggregate the results. The simulation specifics tailored for this study are detailed in supplementary materials (Supplementary Note 2).

Positive matrix factorization

The sources of aerosol ON were identified and resolved using EPA positive matrix factorization (PMF) receptor model version 5.0. The PMF model is a multivariate factor analysis tool based on chemical mass balance. The detailed instructions of the PMF model can be found in previous studies31,36, and it has been widely applied in the source apportionments of ambient PM2.519,31,36,45. The detailed PMF runs (e.g., input variables, interpretability of resolved source profiles, and result validations) for aerosol ON source apportionment are presented in Supplementary Note 4.