Tuan Le

prof_pic.jpg
📍 Berlin, Germany

I work on generative models for molecular design at Pfizer. My work sits at the intersection of probability, statistics, geometry, and chemistry. I’m interested in building systems that can generate molecules across several settings, including small-molecule drug discovery, protein sequence design, and other inverse problems in molecular design.

I earned a Ph.D. in computer science from Freie Universität Berlin, where I worked with Frank Noé on machine learning for molecular design. My Ph.D. research also grew out of a close collaboration with Djork-Arné Clevert. I’ve continued that collaboration at Bayer and Pfizer, working with teams in computational chemistry and machine learning.

Research interests

I’m interested in how generative models, especially diffusion models and flow matching, can be used for inverse design across molecular domains. That includes small-molecule drug discovery, where the goal may be to generate candidates for a target or property, as well as protein and sequence design.

I also care about how domain knowledge enters the model. In my experience, the math, physics, and chemistry behind the problem should shape the system itself. Respecting physical structure, using 3D geometry, conditioning on the right information, and building in domain-specific features usually leads to more realistic and more useful molecules.

Just as important to me is the step after the method paper: turning a model into something people can actually use. Strong benchmark results matter, but they are only part of the story. The work becomes more useful when it fits the way chemists and other domain experts actually work.

selected publications

  1. Coupled fragment-based generative modeling with stochastic interpolants
    Digital Discovery Mar 2026
  2. Diffusion Generative Modeling on Lie Group Representations
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems Dec 2025
  3. Equivariant diffusion for structure-based de novo ligand generation with latent-conditioning
    Journal of Cheminformatics May 2025
  4. PILOT: equivariant diffusion for pocket-conditioned de novo ligand generation with multi-objective guidance via importance sampling
    Julian Cremer*Tuan Le*Frank NoéDjork-Arné Clevert, and 1 more author
    Chem. Sci. May 2024
  5. Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation
    Tuan Le*Julian Cremer*Frank NoéDjork-Arné Clevert, and 1 more author
    In The Twelfth International Conference on Learning Representations May 2024
  6. Representation Learning on Biomolecular Structures using
    Equivariant Graph Attention
    Tuan Le*Frank Noé, and Djork-Arné Clevert
    In Learning on Graphs Conference May 2022
  7. Parameterized Hypercomplex Graph Neural Networks for Graph Classification
    In Artificial Neural Networks and Machine Learning – ICANN 2021 May 2021
  8. Neuraldecipher – reverse-engineering extended-connectivity fingerprints (ECFPs) to their molecular structures
    Chem. Sci. May 2020