Antibody responses to SARS-CoV-2 train machine learning to assign likelihood of past infection during virus emergence in Sweden
Abstract Serology is critical for understanding pathogen-specific immune responses, but is fraught with difficulty, not least because the strength of antibody (Ab) response varies greatly between individuals and mild infections generally generate lower Ab titers1–3. We used robust IgM, IgG and IgA Ab tests to evaluate anti-SARS-CoV-2 responses in individuals PCR+ for virus RNA (n=105) representing different categories of disease severity, including mild cases. All PCR+ individuals in the study became IgG-positive against pre-fusion trimers of the virus spike (S) glycoprotein, but titers varied greatly. Elevated IgA, IL-6 and neutralizing responses were present in intensive care patients. Additionally, blood donors and pregnant women (n=2,900) sampled throughout the first wave of the pandemic in Stockholm, Sweden, further demonstrated that anti-S IgG titers differed several orders of magnitude between individuals, with an increase of low titer values present in the population at later time points4,5. To improve upon current methods to identify low titers and extend the utility of individual measures6,7, we used our PCR+ individual data to train machine learning algorithms to assign likelihood of past infection. Using these tools that assigned probability to individual responses against S and the receptor binding domain (RBD), we report SARS-CoV-2-specific IgG in 13.7% of healthy donors five months after the peak of spring COVID-19 deaths, when mortality and ICU occupancy in the country due to the virus were at low levels. These data further our understanding of antibody responses to the virus and provide solutions to problems in serology data analysis. Significance statement Antibody testing provides critical clinical and epidemiological information during an emerging disease pandemic. We developed robust SARS-CoV-2 IgM, IgG and IgA antibody tests and profiled COVID-19 patients and exposed individuals throughout the outbreak in Stockholm, Sweden, where full societal lockdown was not employed. As well as elucidating several disease immunophenotypes, our data highlight the challenge of identifying low IgG titer individuals, who comprise a significant proportion of the population following mild/asymptomatic infection, especially as antibody titers wane following peak responses. To provide a solution to this, we used SARS-CoV-2 PCR+ individual data to develop machine learning approaches that assigned likelihood of past infection to blood donors and pregnant women, improving the accuracy and utility of individual and population-level Ab measures.