Article

MAI-TargetFisher: A proteome-wide drug target prediction method synergetically enhanced by artificial intelligence and physical modeling

Shi-wei Li1, Peng-xuan Ren1, Lin Wang1, Qi-lei Han1, Feng-lei Li1,2, Hong-lin Li3, Fang Bai1,2,4
1 Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
2 School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
3 Innovation Center for AI and Drug Discovery, East China Normal University, Shanghai 200062, China
4 Shanghai Clinical Research and Trial Center, Shanghai 201210, China
Correspondence to: Fang Bai: baifang@shanghaitech.edu.cn,
DOI: 10.1038/s41401-024-01444-z
Received: 4 October 2024
Accepted: 24 November 2024
Advance online: 27 January 2025

Abstract

Computational target identification plays a pivotal role in the drug development process. With the significant advancements of deep learning methods for protein structure prediction, the structural coverage of human proteome has increased substantially. This progress inspired the development of the first genome-wide small molecule targets scanning method. Our method aims to localize drug targets and detect potential off-target effects early in the drug discovery process, thereby improving the success rate of drug development. We have constructed a high-quality database of protein structures with annotated potential binding sites, covering 82% of the protein-coding genome. On the basis of this database, to enhance our search capabilities, we have integrated computational techniques, including both artificial intelligence-based and biophysical model-based methods. This integration led to the development of a target identification method called Multi-Algorithm Integrated Target Fisher (MAI-TargetFisher). MAI-TargetFisher leverages the complementary strengths of various methods while minimizing their weaknesses, enabling precise database navigation to generate a reliably ranked set of candidate targets for an active query molecule. Importantly, our work is the first comprehensive scan of protein surfaces across the entire human genome, aimed at evaluating potential small molecule binding sites on each protein. Through a series of evaluations on benchmark and a target identification task, the results demonstrate the high hit rates and good reliability of our method under the validation of wet experiments. We have also made available a freely accessible web server at https://bailab.siais.shanghaitech.edu.cn/mai-targetfisher for non-commercial use.

Keywords: target identification; deep-learning; binding site identification; ligand-based searching; reverse docking

Article Options

Download Citation

Cited times in Scopus