Publications

A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models
Provable Unlearning with Gradient Ascent on Two-Layer ReLU Neural Networks
MALT Powers Up Adversarial Attacks
Explaining high-dimensional text classifiers
Adversarial Examples Exist in Two-Layer ReLU Networks for Low Dimensional Linear Subspaces
The dimpled manifold model of adversarial examples in machine learning