Research
Our lab focuses on the application of machine learning and deep learning to complex challenges in bioinformatics and cheminformatics. We specialize in developing novel computational methods for protein function prediction, drug-target interaction analysis, and de novo molecular design, aiming to accelerate the pace of scientific discovery.
Highlighted
      
      
      
        Target-specific de novo design of drug candidate molecules with graph-transformer-based generative adversarial networks
      
      
      
  
        Nature Machine Intelligence
         · 
        15 Sep 2025
         · 
        doi:10.1038/s42256-025-01082-y
      
      
        
        
        
      
    All
2025
      
      
      
        Molecular Contrastive Learning with Graph Attention Network (MoCL-GAT) for Enhanced Molecular Representation
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        19 Sep 2025
         · 
        doi:10.1101/2025.09.17.676724
      
      
        
        
        
      
    
      
      
      
        Target-specific de novo design of drug candidate molecules with graph-transformer-based generative adversarial networks
      
      
      
  
        Nature Machine Intelligence
         · 
        15 Sep 2025
         · 
        doi:10.1038/s42256-025-01082-y
      
      
        
        
        
      
    
      
      
      
        Mpox: disease manifestations and therapeutic development
      
      
      
  
        Journal of Virology
         · 
        11 Aug 2025
         · 
        doi:10.1128/jvi.00152-25
      
      
        
        
        
      
    
      
      
      
        ProtHGT: Heterogeneous Graph Transformers for Automated Protein Function Prediction Using Biological Knowledge Graphs and Language Models
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        24 Apr 2025
         · 
        doi:10.1101/2025.04.19.649272
      
      
        
        
        
      
    
      
      
      
        A Benchmarking Platform for Assessing Protein Language Models on Function-related Prediction Tasks
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        16 Apr 2025
         · 
        doi:10.1101/2025.04.10.648084
      
      
        
        
        
      
    
      
      
      
        Protein language models for predicting drug–target interactions: Novel approaches, emerging methods, and future directions
      
      
      
  
        Current Opinion in Structural Biology
         · 
        01 Apr 2025
         · 
        doi:10.1016/j.sbi.2025.103017
      
      
        
        
        
      
    
      
      
      
        A Benchmarking Platform for Assessing Protein Language Models on Function-Related Prediction Tasks
      
      
      
  
        Methods in Molecular Biology
         · 
        01 Jan 2025
         · 
        doi:10.1007/978-1-0716-4662-5_14
      
      
        
        
        
      
    2024
      
      
      
        Design, synthesis, and evaluation of novel Indole‐Based small molecules as sirtuin inhibitors with anticancer activities
      
      
      
  
        Drug Development Research
         · 
        21 Oct 2024
         · 
        doi:10.1002/ddr.70008
      
      
        
        
        
      
    
      
      
      
        Mutual annotation‐based prediction of protein domain functions with Domain2GO 
      
      
      
  
        Protein Science
         · 
        16 May 2024
         · 
        doi:10.1002/pro.4988
      
      
        
        
        
      
    2023
      
      
      
        Democratizing knowledge representation with BioCypher
      
      
      
  
        Nature Biotechnology
         · 
        19 Jun 2023
         · 
        doi:10.1038/s41587-023-01848-y
      
      
        
        
        
      
    
      
      
      
        Transfer learning for drug–target interaction prediction
      
      
      
  
        Bioinformatics
         · 
        01 Jun 2023
         · 
        doi:10.1093/bioinformatics/btad234
      
      
        
        
        
      
    
      
      
      
        SELFormer: molecular representation learning via SELFIES language models
      
      
      
  
        Machine Learning: Science and Technology
         · 
        01 Jun 2023
         · 
        doi:10.1088/2632-2153/acdb30
      
      
        
        
        
      
    
      
      
      
        How to approach machine learning-based prediction of drug/compound–target interactions
      
      
      
  
        Journal of Cheminformatics
         · 
        06 Feb 2023
         · 
        doi:10.1186/s13321-023-00689-w
      
      
        
        
        
      
    
      
      
      
        ProFAB—open protein functional annotation benchmark
      
      
      
  
        Briefings in Bioinformatics
         · 
        03 Feb 2023
         · 
        doi:10.1093/bib/bbac627
      
      
        
        
        
      
    
      
      
      
        Target Specific De Novo Design of Drug Candidate Molecules with Graph Transformer-based Generative Adversarial Networks
      
      
      
  
        arXiv
         · 
        01 Jan 2023
         · 
        doi:10.48550/ARXIV.2302.07868
      
      
        
        
        
      
    
      
      
      
        ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
      
      
      
  
        Computational and Structural Biotechnology Journal
         · 
        01 Jan 2023
         · 
        doi:10.1016/j.csbj.2023.09.017
      
      
        
        
        
      
    2022
      
      
      
        Machine learning-based prediction of drug approvals using molecular, physicochemical, clinical trial, and patent-related features
      
      
      
  
        Expert Opinion on Drug Discovery
         · 
        02 Dec 2022
         · 
        doi:10.1080/17460441.2023.2153830
      
      
        
        
        
      
    
      
      
      
        Mutual Annotation-Based Prediction of Protein Domain Functions with Domain2GO
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        04 Nov 2022
         · 
        doi:10.1101/2022.11.03.514980
      
      
        
        
        
      
    
      
      
      
        SLPred: a multi-view subcellular localization prediction tool for multi-location human proteins
      
      
      
  
        Bioinformatics
         · 
        08 Jul 2022
         · 
        doi:10.1093/bioinformatics/btac458
      
      
        
        
        
      
    
      
      
      
        How to Best Represent Proteins in Machine Learning-based Prediction of Drug/Compound-Target Interactions
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        01 May 2022
         · 
        doi:10.1101/2022.05.01.490207
      
      
        
        
        
      
    
      
      
      
        Learning functional properties of proteins with language models
      
      
      
  
        Nature Machine Intelligence
         · 
        21 Mar 2022
         · 
        doi:10.1038/s42256-022-00457-9
      
      
        
        
        
      
    2021
      
      
      
        Editorial: Machine Learning Methodologies to Study Molecular Interactions
      
      
      
  
        Frontiers in Molecular Biosciences
         · 
        03 Dec 2021
         · 
        doi:10.3389/fmolb.2021.806474
      
      
        
        
        
      
    
      
      
      
        Data Centric Molecular Analysis and Evaluation of Hepatocellular Carcinoma Therapeutics Using Machine Intelligence-Based Tools
      
      
      
  
        Journal of Gastrointestinal Cancer
         · 
        01 Dec 2021
         · 
        doi:10.1007/s12029-021-00768-x
      
      
        
        
        
      
    
      
      
      
        Protein domain-based prediction of drug/compound–target interactions and experimental validation on LIM kinases
      
      
      
  
        PLOS Computational Biology
         · 
        29 Nov 2021
         · 
        doi:10.1371/journal.pcbi.1009171
      
      
        
        
        
      
    
      
      
      
        CROssBAR: comprehensive resource of biomedical relations with knowledge graph representations
      
      
      
  
        Nucleic Acids Research
         · 
        28 Jun 2021
         · 
        doi:10.1093/nar/gkab543
      
      
        
        
        
      
    
      
      
      
        Protein Domain-Based Prediction of Compound–Target Interactions and Experimental Validation on LIM Kinases
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        14 Jun 2021
         · 
        doi:10.1101/2021.06.14.448307
      
      
        
        
        
      
    2020
      
      
      
        MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in drug discovery
      
      
      
  
        Bioinformatics
         · 
        07 Dec 2020
         · 
        doi:10.1093/bioinformatics/btaa858
      
      
        
        
        
      
    
      
      
      
        Evaluation of Methods for Protein Representation Learning: A Quantitative Analysis
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        28 Oct 2020
         · 
        doi:10.1101/2020.10.28.359828
      
      
        
        
        
      
    
      
      
      
        CROssBAR: Comprehensive Resource of Biomedical Relations with Deep Learning Applications and Knowledge Graph Representations
      
      
      
  
        Cold Spring Harbor Laboratory
         · 
        15 Sep 2020
         · 
        doi:10.1101/2020.09.14.296889
      
      
        
        
        
      
    
      
      
      
        iBioProVis: interactive visualization and analysis of compound bioactivity space
      
      
      
  
        Bioinformatics
         · 
        14 May 2020
         · 
        doi:10.1093/bioinformatics/btaa496
      
      
        
        
        
      
    
      
      
      
        DEEPScreen: high performance drug–target interaction prediction with convolutional neural networks using 2-D structural compound representations
      
      
      
  
        Chemical Science
         · 
        01 Jan 2020
         · 
        doi:10.1039/c9sc03414e
      
      
        
        
        
      
    2019
      
      
      
        DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks
      
      
      
  
        Scientific Reports
         · 
        14 May 2019
         · 
        doi:10.1038/s41598-019-43708-3
      
      
        
        
        
      
    2018
      
      
      
        ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature
      
      
      
  
        BMC Bioinformatics
         · 
        21 Sep 2018
         · 
        doi:10.1186/s12859-018-2368-y
      
      
        
        
        
      
    
      
      
      
        Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases
      
      
      
  
        Briefings in Bioinformatics
         · 
        31 Jul 2018
         · 
        doi:10.1093/bib/bby061
      
      
        
        
        
      
    
      
      
      
        HPO2GO: prediction of human phenotype ontology term associations using cross ontology annotation co-occurrences
      
      
      
  
        PeerJ
         · 
        29 Jul 2018
         · 
        doi:10.7287/peerj.preprints.26663v2
      
      
        
        
        
      
    
      
      
      
        A Structural Perspective on the Modulation of Protein-Protein Interactions with Small Molecules
      
      
      
  
        Current Topics in Medicinal Chemistry
         · 
        18 Jul 2018
         · 
        doi:10.2174/1568026618666180601080824
      
      
        
        
        
      
    
      
      
      
        Phylogenetic and Other Conservation-Based Approaches to Predict Protein Functional Sites
      
      
      
  
        Methods in Molecular Biology
         · 
        01 Jan 2018
         · 
        doi:10.1007/978-1-4939-7756-7_4
      
      
        
        
        
      
    2017
      
      
      
        Large‐scale automated function prediction of protein sequences and an experimental case study validation on PTEN transcript variants
      
      
      
  
        Proteins: Structure, Function, and Bioinformatics
         · 
        29 Nov 2017
         · 
        doi:10.1002/prot.25416
      
      
        
        
        
      
    2016
      
      
      
        UniProt-DAAC: domain architecture alignment and classification, a new method for automatic functional annotation in UniProtKB
      
      
      
  
        Bioinformatics
         · 
        07 Mar 2016
         · 
        doi:10.1093/bioinformatics/btw114