ColabFold: GPU-accelerated protein structure prediction comes to UseGalaxy.org
Optimized AlphaFold2-based protein structure prediction now available for all
The US server of Galaxy, UseGalaxy.org, is excited to announce the full release of the tool for ColabFold, offering accelerated prediction of protein structures and complexes by combining the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold. The Galaxy version of this tool separates the run into two parts: the MSA step and the AlphaFold2 step, each completing runs on individual sequences within a few seconds to a few minutes. The output of the MSA step is intended as a direct input to the AlphaFold2 step, but this separation allows a more iterative approach than previous releases. Additionally, as the Galaxy version of ColabFold is run entirely on Galaxy servers, this significantly increases the number of proteins that can be analyzed using ColabFold globally, easing previous run limitations on this highly anticipated tool.
For more information about ColabFold, please visit the ColabFold repository on github and read the original publication, which announces its release and explains how it works in greater depth.
Please note some important details about the implementation of ColabFold on UseGalaxy.org:
- To ensure adequate resource availability, each Galaxy user is limited to two concurrent ColabFold MSA jobs and four concurrent ColabFold AlphaFold2 jobs.
- To ensure runtimes are manageable, a maximum of 20 sequences per FASTA/a3m file are allowed as inputs to the MSA tool.
- Because the AlphaFold2 tool runs on a limited shared resource, you may experience additional wait times before your jobs begin running - please leave any jobs you submit queued until they are complete. Other Galaxy jobs can be run while the ColabFold tools are queued. For additional help or to report jobs that do not run after a considerable time (more than 24 hours), please see the Galaxy Help forum.
Large Memory resources for the MSA step are supported by Advanced Research Computing at Hopkins. GPU resources for the AlphaFold2 step are provided by the Frontera computing project at the Texas Advanced Computing Center. Frontera is made possible by National Science Foundation award OAC-1818253. Additionally, we would like to thank Milot Mirdita (who also developed ColabFold) for his help with the Galaxy tool. This work was also supported by NHGRI award U24HG006620 and NSF award DBI-2419522.