Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

T cell activation is governed through T cell receptors (TCRs), heterodimers of two sequence-variable chains (often an alpha [α] and beta [β] chain) that synergistically recognise antigen fragments presented on cell surfaces. Despite this, there only exist repositories dedicated to collecting single-chain, not paired-chain, TCR sequence data. We have addressed this gap by creating the Observed T cell receptor Space (OTS) database, a source of consistently processed and annotated, full-length, paired-chain TCR sequences. Currently, OTS contains 5.35M redundant (1.63M non-redundant) predominantly human sequences from across 50 studies and at least 75 individuals. Using OTS, we identify pairing biases, public TCRs, and distinct chain coherence patterns relative to antibodies. We also release a paired-chain TCR language model, providing paired embedding representations and a method for residue in-filling conditional on the partner chain. OTS will be updated as a central community resource, freely downloadable and available as a web application at https://opig.stats.ox.ac.uk/webapps/ots.

Type

Journal article

Journal

Cell Reports

Publisher

Elsevier (Cell Press)

Publication Date

16/08/2024