In silico typing maps the natural diversity of Escherichia coli transporter-dependent capsules.

Miravet-Verde S., Cacace E., Mores CR., Rutschmann C., Lin C-W., Ruscheweyh H-J., Cuénod A., Barazzone EC., Marrec E., Vershynina K., Schumann R., Bower DJ., Schubert M., Egli A., Fiebig T., Slack E., Sunagawa S., Keys TG.

Serotyping identifies bacterial variants based on surface antigens, traditionally using antibody-based assays, but has been increasingly replaced by in silico methods that infer serotypes from genomic sequences for faster, scalable and more reproducible analyses. However, traditional Escherichia coli capsule serotyping has largely fallen out of use since the 1990s, leaving gaps in our knowledge of capsule genetics, diversity, distribution and epidemiology. As capsules influence bacterial interactions with phages, host immune systems and the environment, this gap limits our understanding of E. coli ecology and pathogenicity as well as vaccine and diagnostic development. Here we established a definitive genotype-serotype map for 35 serologically identified and structurally characterized transporter-dependent capsules. We then surveyed 37,723 E. coli genomes, cataloguing 85 transporter-dependent capsule types (K-types), including 55 types that were not part of the reference collection. We leveraged this catalogue to develop a hidden Markov model-based in silico serotyping tool, kTYPr, and applied it to curated sets of 24,015 E. coli genomes and 2,762 metagenome-assembled genomes spanning diverse environmental and clinical sources. We found previously uncharacterized K-types enriched in undersampled environments and associated with E. coli disease. This study expands our understanding of E. coli surface structures, supporting efforts for precision targeting with phage therapy or vaccines.

DOI

10.1038/s41564-026-02323-5

Type

Journal article

Publication Date

2026-04-23T00:00:00+00:00

Permalink More information Close