Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Coeliac disease (CeD) is a T-cell mediated enteropathy triggered by dietary gluten which remains substantially under-diagnosed around the world. The diagnostic gold-standard requires histological assessment of intestinal biopsies taken at endoscopy while consuming a gluten-containing diet. However, there is a lack of concordance between pathologists in histological assessment, and both endoscopy and gluten challenge are burdensome and unpleasant for patients. Identification of gluten-specific T-cell receptors (TCRs) in the TCR repertoire could provide a less subjective diagnostic test, and potentially remove the need to consume gluten. We review published gluten-specific TCR sequences, and develop an interpretable machine learning model to investigate their diagnostic potential. To investigate this, we sequenced the TCR repertoires of mucosal CD4+ T cells from 20 patients with and without CeD. These data were used as a training dataset to develop the model, then an independently published dataset of 20 patients was used as the testing dataset. We determined that this model has a training accuracy of 100% and testing accuracy of 80% for the diagnosis of CeD, including in patients on a gluten-free diet (GFD). We identified 20 CD4+ TCR sequences with the highest diagnostic potential for CeD. The sequences identified here have the potential to provide an objective diagnostic test for CeD, which does not require the consumption of gluten.

Original publication




Journal article



Publication Date





T-cell repertoire, coeliac disease, gluten-free diet, machine learning, next generation sequencing, Humans, Celiac Disease, Glutens, T-Lymphocytes, Receptors, Antigen, T-Cell, Diet