Mapping the Global Antigenic Evolution of Human Influenza A/H3N2 Neuraminidase Based on a Machine Learning Model — 1968–2024

Jingru Feng; Rui Shi; Huixin Zhou; Shijie Wu; Junyu Hu; Taijiao Jiang; Wenjie Han; Xiangjun Du

doi:10.46234/ccdcw2025.164

Article Navigation > China CDC Weekly > 2025, 7(29): 973-978

Methods and Applications: Mapping the Global Antigenic Evolution of Human Influenza A/H3N2 Neuraminidase Based on a Machine Learning Model — 1968–2024

Jingru Feng^1,&;
Rui Shi^2,&;
Huixin Zhou^3,&;
Shijie Wu^1,2;
Junyu Hu^1,2;
Taijiao Jiang^4,5,6;
Wenjie Han^2, ,;
Xiangjun Du^1,3,7, ,

View author affiliations

Abstract
Introduction
Human influenza A/H3N2 imposes a substantial global disease burden. Beyond hemagglutinin (HA), neuraminidase (NA) also plays a critical role in the antigenic evolution of influenza viruses. However, a comprehensive understanding of NA antigenic evolution remains lacking.
Methods
NA inhibition (NAI) data were collected and structural epitopes for A/H3N2 NA were identified. A machine learning model was developed to accurately predict antigenic relationships by integrating four feature groups: epitopes, physicochemical properties, N-glycosylation, and catalytic sites. An antigenic correlation network (ACNet) was constructed and antigenic clusters were identified using the Markov clustering algorithm.
Results
The best random forest model (PREDEC-N2) achieved an accuracy of 0.904 in cross-validation and 0.867 in independent testing. Eight main antigenic clusters were identified on the ACNet. Spatiotemporal analysis revealed the continuous replacement and rapid global spread of new antigenic clusters for human influenza A/H3N2 NA.
Conclusions
This study developed a timely and accurate computational model to map the antigenic landscape of A/H3N2 NA, revealing both its relative antigenic conservation and continuous evolution. These insights provide valuable guidance for improved antigenic surveillance, vaccine recommendations, and prevention and control strategies for human influenza viruses.
Conflicts of interest: No conflicts of interest.
Funding: Supported by the National Key Research and Development Program under grant 2022YFC2303800, the Major Program of Guangzhou National Laboratory under grant GZNL2024A01002, the National Natural Science Foundation of China under grant 81961128002, the Science and Technology Planning Project of Guangdong Province, China under grant 2021B1212040017, and the Shenzhen Science and Technology Program under grant ZDSYS20230626091203007

Author Affiliations

1.
School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou City, Guangdong Province, China
2.
School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen City, Guangdong Province, China
3.
Shenzhen Key Laboratory of Pathogenic Microbes & Biosafety, Shenzhen Campus of Sun Yat-sen University, Shenzhen City, Guangdong Province, China
4.
Guangzhou National Laboratory, Guangzhou City, Guangdong Province, China
5.
State Key Laboratory of Respiratory Disease, The Key Laboratory of Advanced Interdisciplinary Studies Center, the First Affiliated Hospital of Guangzhou Medical University, Guangzhou City, Guangdong Province, China
6.
Suzhou Institute of Systems Medicine, Suzhou City, Jiangsu Province, China
7.
Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou City, Guangdong Province, China
^& Joint first authors.

Corresponding authors: Wenjie Han, hanwj7@mail2.sysu.edu.cn; Xiangjun Du, duxj9@mail.sysu.edu.cn
Online Date: July 18 2025
Issue Date: July 18 2025
doi: 10.46234/ccdcw2025.164

References

[1]	Petrova VN, Russell CA. The evolution of seasonal influenza viruses. Nat Rev Microbiol 2018;16(1):47 − 60.
[2]	Chen YQ, Wohlbold TJ, Zheng NY, Huang M, Huang YP, Neu KE, et al. Influenza infection in humans induces broadly cross-reactive and protective neuraminidase-reactive antibodies. Cell 2018;173(2):417 − 29.e10.
[3]	Weiss CD, Wang W, Lu Y, Billings M, Eick-Cost A, Couzens L, et al. Neutralizing and neuraminidase antibodies correlate with protection against influenza during a late season A/H3N2 outbreak among unvaccinated military recruits. Clin Infect Dis 2020;71(12):3096 − 102.
[4]	Sandbulte MR, Westgeest KB, Gao J, Xu XY, Klimov AI, Russell CA, et al. Discordant antigenic drift of neuraminidase and hemagglutinin in H1N1 and H3N2 influenza viruses. Proc Natl Acad Sci USA 2011;108(51):20748 − 53.
[5]	Khare S, Gurry C, Freitas L, Schultz MB, Bach G, Diallo A, et al. GISAID's role in pandemic response. China CDC Wkly 2021;3(49):1049 − 51.
[6]	Zhai K, Dong JZ, Zeng JF, Cheng PW, Wu XS, Han WJ, et al. Global antigenic landscape and vaccine recommendation strategy for low pathogenic avian influenza A (H9N2) viruses. J Infect 2024;89(2):106199.
[7]	Catani JPP, Smet A, Ysenbaert T, Vuylsteke M, Bottu G, Mathys J, et al. The antigenic landscape of human influenza N2 neuraminidases from 2009 until 2017. eLife 2024;12:RP90782
[8]	Gao J, Li X, Klenow L, Malik T, Wan HQ, Ye ZP, et al. Antigenic comparison of the neuraminidases from recent influenza A vaccine viruses and 2019–2020 circulating strains. npj Vaccines 2022;7(1):79
[9]	Schild GC, Oxford JS, Dowdle WR, Coleman M, Pereira MS, Chakraverty P. Antigenic variation in current influenza A viruses: evidence for a high frequency of antigenic ‘drift’ for the Hong Kong virus. Bull World Health Organ 1974;51(1):1-11. https://pubmed.ncbi.nlm.nih.gov/4218138/.
[10]	Kilbourne ED, Johansson BE, Grajower B. Independent and disparate evolution in nature of influenza A virus hemagglutinin and neuraminidase glycoproteins. Proc Natl Acad Sci USA 1990;87(2):786 − 90
[11]	Meng J, Liu JZ, Song WK, Li HL, Wang JY, Zhang L, et al. PREDAC-CNN: predicting antigenic clusters of seasonal influenza A viruses with convolutional neural network. Brief Bioinform 2024;25(2):bbae033.
[12]	Tubiana J, Schneidman-Duhovny D, Wolfson HJ. ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nat Methods 2022;19(6):730 − 9.
[13]	Blom N, Sicheritz-Pontén T, Gupta R, Gammeltoft S, Brunak S. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 2004;4(6):1633 − 49.
[14]	Jagadesh A, Salam AAA, Mudgal PP, Arunkumar G. Influenza virus neuraminidase (NA): a target for antivirals and vaccines. Arch Virol 2016;161(8):2087 − 94.
[15]	Du XJ, Dong LB, Lan Y, Peng YS, Wu AP, Zhang Y, et al. Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat Commun 2012;3(1):709.
[16]	Qiu JX, Qiu TY, Dong QL, Xu DP, Wang X, Zhang Q, et al. Predicting the antigenic relationship of foot-and-mouth disease virus for vaccine selection through a computational model. IEEE/ACM Trans Comput Biol Bioinf 2021;18(2):677 − 85.
[17]	Westgeest KB, de Graaf M, Fourment M, Bestebroer TM, van Beek R, Spronken MIJ, et al. Genetic evolution of the neuraminidase of influenza A (H3N2) viruses from 1968 to 2009 and its correspondence to haemagglutinin evolution. J Gen Virol 2012;93(9):1996 − 2007.
[18]	Harris A, Cardone G, Winkler DC, Heymann JB, Brecher M, White JM, et al. Influenza virus pleiomorphy characterized by cryoelectron tomography. Proc Natl Acad Sci USA 2006;103(50):19123 − 7.

FIGURE 1. Model performance and feature contributions. (A) Model performance on the independent testing set; (B) Feature contributions at the sample level, where color indicates the magnitude and position reflects the absolute contribution of each feature; (C) Feature contributions at the population level.
Note: Different groups were color-coded, and size represents the magnitude of individual features. Abbreviation: RF=random forest; XGBoost=extreme gradient boosting; KNN=K-nearest neighbors; LR=logistic regression; SVM=support vector machine; AUC=receiver operating characteristic area under the curve; ASA=access surface area; SHAP=SHapley Additive exPlanation.

Download: Full-Size Img PowerPoint

FIGURE 2. Antigenic landscape of A/H3N2 NA. (A) The ACNet and phylogenetic tree for representative sequences from eight major antigenic clusters; (B) Yearly spatiotemporal distribution of eight antigenic clusters; (C) Replacement patterns of dominant antigenic clusters.

Note: The legend displays colors for HA while the color of NA antigenic clusters correspond to those used in the ACNet. For (C), color changes indicating that a new antigenic cluster is dominant or becoming dominant.
Abbreviation: NA=neuraminidase; HA=hemagglutinin; ACNet=antigenic correlation network.

Download: Full-Size Img PowerPoint

Citation:

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Turn off MathJax

Article Contents

Get Citation

PDF

Article Metrics

Article views(226) PDF downloads(1) Cited by()

Introduction

Human influenza A/H3N2 imposes a substantial global disease burden. Beyond hemagglutinin (HA), neuraminidase (NA) also plays a critical role in the antigenic evolution of influenza viruses. However, a comprehensive understanding of NA antigenic evolution remains lacking.

Methods

NA inhibition (NAI) data were collected and structural epitopes for A/H3N2 NA were identified. A machine learning model was developed to accurately predict antigenic relationships by integrating four feature groups: epitopes, physicochemical properties, N-glycosylation, and catalytic sites. An antigenic correlation network (ACNet) was constructed and antigenic clusters were identified using the Markov clustering algorithm.

Results

The best random forest model (PREDEC-N2) achieved an accuracy of 0.904 in cross-validation and 0.867 in independent testing. Eight main antigenic clusters were identified on the ACNet. Spatiotemporal analysis revealed the continuous replacement and rapid global spread of new antigenic clusters for human influenza A/H3N2 NA.

Conclusions

This study developed a timely and accurate computational model to map the antigenic landscape of A/H3N2 NA, revealing both its relative antigenic conservation and continuous evolution. These insights provide valuable guidance for improved antigenic surveillance, vaccine recommendations, and prevention and control strategies for human influenza viruses.

HTML

Human influenza A/H3N2 has been a predominant seasonal influenza strain globally since its emergence in 1968. The main surface proteins of the influenza virus, hemagglutinin (HA) and neuraminidase (NA), evolve antigenically to escape immune recognition by the human host (1). Vaccination is the most effective intervention against influenza, but the vaccine effectiveness (VE) against H3N2 remains low (2). This low VE is mainly attributable to the rapid antigenic drift of HA and the insufficient induction of a robust NA-mediated immune response by current vaccines (2). Several studies have highlighted the critical role of NA-induced protection (3). Compared to human influenza A/H1N1, the antigenic divergence of NA in A/H3N2 is minimal but antigenic changes still occur (4). However, the antigenic evolution and landscape of NA in human influenza A/H3N2 remain poorly understood. To address this gap, we developed an antigenic classification model for human influenza A/H3N2 NA and identified distinct antigenic clusters to provide a more systematic understanding of the antigenic evolution of NA.

DISCUSSION

In the present study, we developed a novel machine learning model for timely and effective prediction of antigenic relationships in the neuraminidase of human influenza A/H3N2. We identified eight main antigenic clusters between 1968 and 2024. Spatiotemporal analysis revealed continuous global replacement and rapid spread of new antigenic clusters. Our findings were robust across different sampling approaches. Among forty-eight vaccine strains, only one (A/Wellington/01/2004) during the cluster transition period was classified differently, likely due to significant differences in sequence distribution.

The antigenic prediction model for NA was developed using an approach similar to that for HA. A key adjustment was replacing the receptor-binding features of HA with catalytic sites, which are more pertinent to NA function. Additionally, we identified NA epitopes de novo for feature calculation. While these adjustments did not represent significant innovations, the framework has proven effective with only minor modifications across different contexts (16). This suggests that with appropriate adjustments, our model can provide accurate predictions for NA antigenic correlations.

Both antigenic clusters and phylogenetic clades reflect evolutionary relationships between viral strains, representing phenotype and genotype, respectively. Unlike the continuous branching of phylogeny, antigenic clusters represent important discrete phenotypes for HA and NA, with nonlinear relationships to genetic changes (4). Variations at different sites have inconsistent effects on antigenicity. The phylogenetic tree for NA displayed a single-trunk structure, indicating minimal selection pressure. Furthermore, spatiotemporal analysis confirmed the continuous global replacement of older antigenic clusters by newer ones. Only eight major antigenic clusters were identified for NA over the past 60 years (approximately one cluster every 8 years), significantly fewer than for HA (approximately every 2 or 3 years). This phenomenon might be explained by the relatively lower mean rate of nucleotide substitution in NA, which could be partly attributed to stronger structural constraints on this enzyme compared to the receptor-binding protein, as well as the stronger selection pressure and greater immune pressure on HA, likely due to its role as the primary vaccine target and its higher distribution on the virus surface (17–18).

This study has several limitations that warrant consideration. First, while our dataset was sufficient for model development, a larger dataset would enable the construction of more sophisticated models with improved prediction performance. Second, although we developed the first ML antigenic classification model for NA, integration with HA and other important viral components is necessary for a comprehensive understanding of antigenic evolution and its implications for seasonal influenza. Third, due to variations in sequencing coverage, some regions had insufficient sequence data, leading to incomplete characterization of their antigenic landscapes. Similarly, the analysis of early antigenic clusters may be subject to sequencing bias. Finally, while our model demonstrated high predictive accuracy, validation with experimental data or real-world outcomes would further strengthen its applicability.

The findings of this study highlight the crucial role of NA in the antigenic evolution of human influenza A/H3N2 and its contribution to viral circulation and spread. Although NA currently receives less consideration in vaccine strain recommendation and antigenic surveillance, the tools developed in this study can facilitate improved antigenic monitoring, inform vaccine selection, and ultimately aid in the prevention and control of influenza epidemics as knowledge deepens and relevant technologies advance.

Acknowledgements

We gratefully acknowledge all the authors from the original laboratories who submitted and shared data on which this study is based. This work was supported by the BrightWing High-performance Computing Platform, School of Public Health (Shenzhen), and High-performance Computing Public Platform (Shenzhen Campus), Sun Yat-sen University.

Conflicts of interest: No conflicts of interest.

Reference (18)

Citation:

[1]	Petrova VN, Russell CA. The evolution of seasonal influenza viruses. Nat Rev Microbiol 2018;16(1):47 − 60.
[2]	Chen YQ, Wohlbold TJ, Zheng NY, Huang M, Huang YP, Neu KE, et al. Influenza infection in humans induces broadly cross-reactive and protective neuraminidase-reactive antibodies. Cell 2018;173(2):417 − 29.e10.
[3]	Weiss CD, Wang W, Lu Y, Billings M, Eick-Cost A, Couzens L, et al. Neutralizing and neuraminidase antibodies correlate with protection against influenza during a late season A/H3N2 outbreak among unvaccinated military recruits. Clin Infect Dis 2020;71(12):3096 − 102.
[4]	Sandbulte MR, Westgeest KB, Gao J, Xu XY, Klimov AI, Russell CA, et al. Discordant antigenic drift of neuraminidase and hemagglutinin in H1N1 and H3N2 influenza viruses. Proc Natl Acad Sci USA 2011;108(51):20748 − 53.
[5]	Khare S, Gurry C, Freitas L, Schultz MB, Bach G, Diallo A, et al. GISAID's role in pandemic response. China CDC Wkly 2021;3(49):1049 − 51.
[6]	Zhai K, Dong JZ, Zeng JF, Cheng PW, Wu XS, Han WJ, et al. Global antigenic landscape and vaccine recommendation strategy for low pathogenic avian influenza A (H9N2) viruses. J Infect 2024;89(2):106199.
[7]	Catani JPP, Smet A, Ysenbaert T, Vuylsteke M, Bottu G, Mathys J, et al. The antigenic landscape of human influenza N2 neuraminidases from 2009 until 2017. eLife 2024;12:RP90782
[8]	Gao J, Li X, Klenow L, Malik T, Wan HQ, Ye ZP, et al. Antigenic comparison of the neuraminidases from recent influenza A vaccine viruses and 2019–2020 circulating strains. npj Vaccines 2022;7(1):79
[9]	Schild GC, Oxford JS, Dowdle WR, Coleman M, Pereira MS, Chakraverty P. Antigenic variation in current influenza A viruses: evidence for a high frequency of antigenic ‘drift’ for the Hong Kong virus. Bull World Health Organ 1974;51(1):1-11. https://pubmed.ncbi.nlm.nih.gov/4218138/.
[10]	Kilbourne ED, Johansson BE, Grajower B. Independent and disparate evolution in nature of influenza A virus hemagglutinin and neuraminidase glycoproteins. Proc Natl Acad Sci USA 1990;87(2):786 − 90
[11]	Meng J, Liu JZ, Song WK, Li HL, Wang JY, Zhang L, et al. PREDAC-CNN: predicting antigenic clusters of seasonal influenza A viruses with convolutional neural network. Brief Bioinform 2024;25(2):bbae033.
[12]	Tubiana J, Schneidman-Duhovny D, Wolfson HJ. ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction. Nat Methods 2022;19(6):730 − 9.
[13]	Blom N, Sicheritz-Pontén T, Gupta R, Gammeltoft S, Brunak S. Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 2004;4(6):1633 − 49.
[14]	Jagadesh A, Salam AAA, Mudgal PP, Arunkumar G. Influenza virus neuraminidase (NA): a target for antivirals and vaccines. Arch Virol 2016;161(8):2087 − 94.
[15]	Du XJ, Dong LB, Lan Y, Peng YS, Wu AP, Zhang Y, et al. Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat Commun 2012;3(1):709.
[16]	Qiu JX, Qiu TY, Dong QL, Xu DP, Wang X, Zhang Q, et al. Predicting the antigenic relationship of foot-and-mouth disease virus for vaccine selection through a computational model. IEEE/ACM Trans Comput Biol Bioinf 2021;18(2):677 − 85.
[17]	Westgeest KB, de Graaf M, Fourment M, Bestebroer TM, van Beek R, Spronken MIJ, et al. Genetic evolution of the neuraminidase of influenza A (H3N2) viruses from 1968 to 2009 and its correspondence to haemagglutinin evolution. J Gen Virol 2012;93(9):1996 − 2007.
[18]	Harris A, Cardone G, Winkler DC, Heymann JB, Brecher M, White JM, et al. Influenza virus pleiomorphy characterized by cryoelectron tomography. Proc Natl Acad Sci USA 2006;103(50):19123 − 7.

Methods and Applications: Mapping the Global Antigenic Evolution of Human Influenza A/H3N2 Neuraminidase Based on a Machine Learning Model — 1968–2024