Project overview

Exon skipping RNA therapy: Antisense oligonucleotide (ASO)-mediated exon skipping is a precision medicine strategy for genetic diseases. Exons containing pathogenic variants, like a truncating DNA variant, can be skipped (removed from the mRNA) while maintaining the reading frame and produce a truncated, but partially functional protein. This principle can be applied to develop custom-made treatments for patients with very rare disorders and unique pathogenic variants.

Challenges with current targets:Exon skipping therapies usually focus on truncating mutations (nonsense or frameshift), because removing a frameshifting variant can restore the protein’s reading frame. In contrast, pathogenic missense variants typically reside in crucial parts of a protein. Skipping an exon carrying such a variant often removes an important functional domain, yielding a nonfunctional protein. For this reason, missense variants or pathogenic variants in protein domains are generally considered poor candidates for exon skipping, and potentially useful targets may be overlooked.

Tandem repeat domains as an opportunity: Some proteins contain multiple adjacent, similar domains (tandem repeat domains). In such proteins, losing one repeat may not fully disrupt function, as the remaining repeats can potentially compensate. This makes tandem-repeat proteins potential exceptions: if a missense mutation affects one repeat, skipping the corresponding exon could still produce a functional protein, presenting an opportunity for exon skipping therapy.

Challenges with current targets:Some proteins contain multiple adjacent, similar domains (tandem repeat domains). In such proteins, losing one repeat may not fully disrupt function, as the remaining repeats can potentially compensate. This makes tandem-repeat proteins potential exceptions: if a missense mutation affects one repeat, skipping the corresponding exon could still produce a functional protein, presenting an opportunity for exon skipping therapy.

The TReXome DB: To leverage this idea, we created TReXome DB, a web-accessible database of human genes with tandem repeat protein domains. Users can search and filter these genes by exon frame, exon length, and repeat count, and even restrict to disease-associated genes curated for genetic therapy suitability. The database highlights which exons fall in repeat regions and provides metrics for each. It also integrates external tools – for example, AlphaFold protein structures with annotated repeats to visualize the different regions. In practice, researchers or clinicians can look up a gene of interest, see if it contains protein tandem repeats, and use the filters to find feasible skipping candidates that might restore function.

Conditions of Use and Data Availability

TReXome DB is intended for researchers and clinicians working on RNA therapies, genetic disease, and protein function. It can help guide lab and clinical investigations by suggesting non-obvious exon-skipping targets.

TReXome DB code is freely available via GitHub: https://github.com/DCRT-LUMC/Tandem-Repeat-Domain-Database under an AGPL-3.0 license.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

Credits & Sources

This project was made possible through the use of the following resources and data sources:

Data Sources
Ensembl

Dyer, S. C. et al. Ensembl 2025. Nucleic Acids Research 53, D948–D957 (2024) (v113).

Visit Ensembl
UniProt

Bateman, A. et al. UniProt: the Universal Protein Knowledgebase in 2025. Nucleic Acids Research 53, D609–D617 (2024).

Visit UniProt
AlphaFold

Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).

Visit AlphaFold
Tools & Libraries
3Dmol.js

Rego, N. & Koes, D. 3Dmol.js: molecular visualization with WebGL. Bioinformatics 31, 1322–1324 (2014).

Visit 3Dmol.js
ensembldb R Package

Rainer, J., Gatto, L. & Weichenberger, C. X. ensembldb: an R package to create and use Ensembl-based annotation resources. Bioinformatics 35, 3151–3153 (2019).

Visit ensembldb
Bootstrap

The world's most popular front-end open source toolkit

MIT License

Visit Bootstrap
DataTables

Table plug-in for jQuery

MIT License

Visit DataTables
People & Institutions
Dutch Center for RNA Therapeutics

Leiden University Medical Center

https://rnatherapy.nl/

Okke Faber

MSc student

Jack Morgan

PhD candidate

Dr. Marlen Lauffer

Senior researcher

Project lead

Acknowledgments

The DCRT Leiden team for feedback!