Darwin Tree of Life: Sequencing All Complex Life in UK and Ireland
- The Darwin Tree of Life project is utilizing advanced genomic sequencing and computational tools to map the genetic blueprints of approximately 70,000 eukaryotic species found across Britain and...
- The project is a collaboration involving ten biodiversity, genomics, and analysis partners, including the Wellcome Sanger Institute and Genome Research Limited.
- The initiative focuses on the biota of Britain and Ireland because these regions represent some of the most deeply studied and well-documented biological environments in the world.
The Darwin Tree of Life project is utilizing advanced genomic sequencing and computational tools to map the genetic blueprints of approximately 70,000 eukaryotic species found across Britain and Ireland. By generating high-quality reference genomes for animals, plants, fungi, and protists, the initiative aims to transform the application of biotechnology, medicine, and conservation science.
The project is a collaboration involving ten biodiversity, genomics, and analysis partners, including the Wellcome Sanger Institute and Genome Research Limited. This effort is part of a larger global movement known as the Earth BioGenome Project, which seeks to sequence all complex life on Earth.
Technical Approach and Implementation
The initiative focuses on the biota of Britain and Ireland because these regions represent some of the most deeply studied and well-documented biological environments in the world. The project team collects representative specimens of each species and employs advanced DNA sequencing technologies to produce high-resolution genome sequences.

Once the sequences are generated, the project uses cutting-edge computational tools to analyze how DNA sequences translate into the observable diversity of life. The workflow involves extracting and sequencing both DNA and RNA, followed by the assembly and annotation of the genomes.
The project was structured with an initial two-year phase designed to establish necessary technical skills and prove the utility of the data. During this first phase, the project aimed to sequence 2,000 species, providing reference genomes for approximately one-third of the organism families present in the region.
Applications in Biotechnology and Conservation
The genomic data produced by the Darwin Tree of Life is intended for open release, allowing researchers and developers to use the information for various applied goals. The project identifies several key areas where this data provides value:
- Biotechnology and Medicine: Providing new tools and genetic insights for medical research and industrial biotech applications.
- Conservation: Aiding efforts to mitigate the effects of what is described as the
sixth great extinction
, an event threatening the biodiversity essential to human society. - Evolutionary Biology: Using genomic data to understand the evolution of life’s diversity and the biology of specific organisms and ecosystems.
Open Data and Community Integration
A central component of the project is the commitment to open science. The resulting genomic data is made available for re-use in biological research and beyond, ensuring that the high-quality reference genomes can be accessed by the wider scientific community.
To prioritize the sequencing process, the project incorporates community input to determine which species should be sequenced first. Information regarding species currently undergoing sequencing is tracked and shared via the Darwin Tree of Life Data Portal.
By linking all life through the common thread of DNA sequences, the project provides a scalable framework for understanding biodiversity. The integration of genomics into conservation efforts is intended to help reveal and understand biodiversity to better mitigate catastrophic environmental changes.
