The complexity and diversity of the Pathogenicity Locus in Clostridium difficile clade 5.
Elliott B., Dingle KE., Didelot X., Crook DW., Riley TV.
The symptoms of Clostridium difficile infection are caused by two closely related toxins, TcdA and TcdB, which are encoded by the 19.6 kb Pathogenicity Locus (PaLoc). The PaLoc is variably present among strains, and in this respect it resembles a mobile genetic element. The C. difficile population structure consists mainly of five phylogenetic clades designated 1-5. Certain genotypes of clade 5 are associated with recently emergent highly pathogenic strains causing human disease and animal infections. The aim of this study was to explore the evolutionary history of the PaLoc in C. difficile clade 5. Phylogenetic analyses and annotation of clade 5 PaLoc variants and adjoining genomic regions were undertaken using a representative collection of toxigenic and nontoxigenic strains. Comparison of the core genome and PaLoc phylogenies obtained for clade 5 and representatives of the other clades identified two distinct PaLoc acquisition events, one involving a toxin A(+)B(+) PaLoc variant and the other an A(-)B(+) variant. Although the exact mechanism of each PaLoc acquisition is unclear, evidence of possible homologous recombination with other clades and between clade 5 lineages was found within the PaLoc and adjacent regions. The generation of nontoxigenic variants by PaLoc loss via homologous recombination with PaLoc-negative members of other clades was suggested by analysis of cdu2, although none is likely to have occurred recently. A variant of the putative holin gene present in the clade 5 A(-)B(+) PaLoc was likely acquired via allelic exchange with an unknown element. Fine-scale phylogenetic analysis of C. difficile clade 5 revealed the extent of its genetic diversity, consistent with ancient evolutionary origins and a complex evolutionary history for the PaLoc.