Gabriel Huang (he/him)

Research Scientist @ ServiceNow | LLM Agent Safety, Red-Teaming, Evaluation & Post-Training | PhD ML

gabriel huang profile picture

I build safety, evaluation, and post-training systems for LLM agents that use tools, APIs, browsers, and enterprise UIs. At ServiceNow Research, my work spans three areas:

I am strongest at the intersection of research and implementation: designing attack and evaluation harnesses, building annotator workflows, calibrating LLM-as-a-Judge systems, and turning research ideas into reusable tools. I am especially interested in making agentic AI systems useful, robust, and safe under realistic deployment conditions.

I did my PhD at Mila, Université de Montréal with Simon Lacoste-Julien. During my PhD I interned at Google Research, where I led multimodal pretraining for dense video captioning on YouTube-scale instructional video, processing terabyte-scale corpora and achieving state-of-the-art on dense video captioning (AACL-IJCNLP 2020). I hold two MSc. from École Normale Supérieure and CentraleSupélec (ranked 1st of 500 graduates). Canadian PR based in Montréal. Native French, bilingual English, fluent everyday Mandarin.

Skills & Stack

PyTorch · HuggingFace (transformers, PEFT, TRL) · DeepSpeed · Accelerate · LLaMA-Factory · SFT / RLHF · LLM-as-a-Judge · rubric-based human eval · synthetic data pipelines · Agent frameworks (TapeAgents, BrowserGym) · red-teaming / prompt-injection tooling · Distributed LLM training · multi-VM network simulation (KVM/QEMU)

Publications

See Google Scholar for my most recent publications.

paper Indirect Prompt Injections: Are Firewalls All You Need, or Stronger Benchmarks?
Rishika Bhagwatkar, Kevin Kasa, Abhay Puri, Gabriel Huang, Irina Rish, Graham W. Taylor, Krishnamurthy Dvijotham, Alexandre Lacoste
NeurIPS Workshop 2025.
paper DoomArena: A Framework for Testing AI Agents Against Evolving Security Threats
Léo Boisvert, Mihir Bansal, Chandra Kiran Reddy Evuru, Gabriel Huang, Abhay Puri, Avinandan Bose, Maryam Fazel, Quentin Cappart, Jason Stanley, Alexandre Lacoste, Alexandre Drouin, Krishnamurthy Dvijotham
COLM 2025.
arXiv TapeAgents: A Holistic Framework for Agent Development and Optimization
Dzmitry Bahdanau, Nicolas Gontier, Gabriel Huang, Ehsan Kamalloo, Rafael Pardinas, Alex Piché, Torsten Scholak, Oleh Shliazhko, Jordan Prince Tremblay, Karam Ghanem, Soham Parikh, Mitul Tiwari, Quaizar Vohra
Tech Report.
paper Geo-Bench: Toward Foundation Models for Earth Monitoring
Alexandre Lacoste, Nils Lehmann, Pau Rodriguez, Evan David Sherwin, Hannah Kerner, Björn Lütjens, Jeremy Andrew Irvin, David Dao, Hamed Alemohammad, Alexandre Drouin, Mehmet Gunturkun, Gabriel Huang, David Vazquez, Dava Newman, Yoshua Bengio, Stefano Ermon, Xiao Xiang Zhu
NeurIPS 2023 (Datasets & Benchmarks).
paper A Survey of Self-Supervised and Few-Shot Object Detection
Gabriel Huang, Issam Laradji, David Vazquez, Simon Lacoste-Julien, Pau Rodriguez
IEEE TPAMI 2022.
paper Repurposing Pretrained Models for Robust Out-of-domain Few-Shot Learning
Namyeong Kwon, Hwidong Na, Gabriel Huang, Simon Lacoste-Julien
ICLR'21.
paper Multimodal Pretraining for Dense Video Captioning
Gabriel Huang, Bo Pang, Zhenhai Zhu, Clara Rivera, Radu Soricut
Introduces the Video Timeline Tags dataset (ViTT).
AACL-IJCNLP 2020. Google Research.
arXiv Are Few-Shot Learning Benchmarks too Simple ? Solving them without Task Supervision at Test-Time
This paper introduces Centroid Networks for Few-shot Clustering and Unsupervised Few-shot Classification
Gabriel Huang, Hugo Larochelle, Simon Lacoste-Julien
ICLR'19 workshop.
paper Negative Momentum for Improved Game Dynamics
Gauthier Gidel, Reyhane Askari Hemmat, Mohammad Pezeshki, Gabriel Huang, Rémi Lepriol, Simon Lacoste-Julien, Ioannis Mitliagkas.
AISTATS 2019
arXiv Parametric Adversarial Divergences are Good Task Losses for Generative Modeling
Gabriel Huang, Hugo Berard, Ahmed Touati, Gauthier Gidel, Pascal Vincent, Simon Lacoste-Julien.
ICML'17 Workshop, ICLR'18 Workshop, Montreal AI Symposium 2018, Submitted to JMLR
paper Scattering Networks for Hybrid Representation Learning
Edouard Oyallon, Sergey Zagoruyko, Gabriel Huang, Nikos Komodakis, Simon Lacoste-Julien, Matthew Blaschko, Eugene Belilovsky.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2018

Negative Momentum

Below is an interactive visualization of our paper Negative Momentum for Improved Game Dynamics:
(a) Learning rate (lr) and momentum (beta) hyperparameters.
(b) Resulting eigenvalues in the complex plane for SGD and SGD+momentum.

There is convergence if and only if all eigenvalues are inside the convergence ball (green).
Try to find the hyperparameters for convergence.

(a) Hyperparameters.

(b) Eigenvalues in complex plane.

SGD without momentum: using , eigenvalues are the convergence ball →
SGD with momentum: using and momentum , eigenvalues are the convergence ball →

Thin-8 dataset

The Thin-8 dataset consists of 1585 grayscale handwritten images of the digit 8, with resolution 512x512.
16 people were asked to draw the digit 8 about 100 times using a pen on a tablet PC running Microsoft Windows.
It was collected in October 2017 at the University of Montreal.
Download Thin-8 dataset here

If you use the Thin-8 dataset, please cite our paper :

@article{huang2018parametric,
                    title={Parametric Adversarial Divergences are Good Task Losses for Generative Modeling},
                    author={Huang, Gabriel and Berard, Hugo and Touati, Ahmed and Gidel, Gauthier and Vincent, Pascal and Lacoste-Julien, Simon},
                    journal={arXiv preprint arXiv:1708.02511},
                    year={2017}
                  }

Thanks to Alex, Akram, Aristide, David, Dendi, Eugene, Jae, Joao, Liam, Rémi, Rosemary, Shawn, Sina, and Xing for scribbling all those samples!

Contact

Email: gbxhuang@gmail.com
Montreal, QC, Canada