Enabling Complex Scientific Research
Scientists everywhere can now access Evo 2, a powerful new foundation model that understands the genetic code for all domains of life. Unveiled today as the largest publicly available AI model for genomic data, it was built on the NVIDIA DGX Cloud platform in a collaboration led by nonprofit biomedical research organization Arc Institute and Stanford University.
Evo 2: A Major Milestone for Generative Genomics
Trained on an enormous dataset of nearly 9 trillion nucleotides, Evo 2 can be applied to biomolecular research applications, including predicting the form and function of proteins based on their genetic sequence, identifying novel molecules for healthcare and industrial applications, and evaluating how gene mutations affect their function.
"The Evo 2 represents a major milestone for generative genomics," said Patrick Hsu, Arc Institute cofounder and core investigator. "By advancing our understanding of these fundamental building blocks of life, we can pursue solutions in healthcare and environmental science that are unimaginable today."
NVIDIA NIM Microservice for Evo 2
The NVIDIA NIM microservice for Evo 2 enables users to generate a variety of biological sequences, with settings to adjust model parameters. Developers interested in fine-tuning Evo 2 on their proprietary datasets can download the model through the open-source NVIDIA BioNeMo Framework, a collection of accelerated computing tools for biomolecular research.
Applications Across Biomolecular Sciences
Evo 2 can provide insights into DNA, RNA, and proteins. Trained on a wide array of species across domains of life, including plants, animals, and bacteria, the model can be applied to scientific fields such as healthcare, agricultural biotechnology, and materials science.
Meet the Requirements of Complex Scientific Research
Established in 2021 with $650 million from its founding donors, Arc Institute empowers researchers to tackle long-term scientific challenges by providing scientists with multi-year funding, allowing them to focus on innovative research instead of grant writing.
Conclusion
Evo 2 is a powerful tool for scientists to understand the genetic code for all domains of life, enabling complex scientific research and applications across biomolecular sciences. With its ability to process lengthy sequences of genetic information, Evo 2 can unlock insights into the connection between distant parts of an organism’s genetic code and the mechanics of cell function, gene expression, and disease.
Frequently Asked Questions
Q: What is Evo 2?
A: Evo 2 is a powerful new foundation model that understands the genetic code for all domains of life.
Q: What is the purpose of Evo 2?
A: Evo 2 is designed to enable complex scientific research and applications across biomolecular sciences, including healthcare, agricultural biotechnology, and materials science.
Q: How was Evo 2 trained?
A: Evo 2 was trained on an enormous dataset of nearly 9 trillion nucleotides.
Q: What are the applications of Evo 2?
A: Evo 2 can be applied to biomolecular research applications, including predicting the form and function of proteins, identifying novel molecules for healthcare and industrial applications, and evaluating how gene mutations affect their function.
Q: How can I access Evo 2?
A: Evo 2 is available to global developers on the NVIDIA BioNeMo platform, including as an NVIDIA NIM microservice for easy, secure AI deployment.

