Publications
2024
-
A Safe Harbor for AI Evaluation and Red Teaming
Shayne Longpre*, Sayash Kapoor*, Kevin Klyman*, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Yong Zheng Xin, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson
> Preprint Mar 5
-
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Jiabao Ji*, Bairu Hou*, Alexander Robey*, George J Pappas, Hamed Hassani, Yang Zhang, Eric Wong, Shiyu Chang
> Preprint Feb 25
2023
-
Data-Driven Modeling and Verification of Perception-Based Autonomous Systems
Thomas Waite, Alexander Robey, Hamed Hassani, George J. Pappas, Radoslav Ivanov
> Preprint Dec 11
-
Jailbreaking Black Box Large Language Models in Twenty Queries
Patrick Chao, Alexander Robey, Edgar Dobriban, Hamed Hassani, George J. Pappas, Eric Wong
> Preprint Oct 12
-
SmoothLLM: Defending LLMs Against Jailbreaking Attacks
Alexander Robey, Eric Wong, Hamed Hassani, George J. Pappas
> Preprint Oct 7
-
Adversarial Training Should Be Cast As a Non-Zero-Sum Game
[Best paper award @ ICML 2023 AdvML workshop]
Alexander Robey, Fabian Latorre, Hamed Hassani, George J. Pappas, Volkan Cevher
> ICLR 2024 Jul 19
-
Toward Certified Robustness Against Real-World Distribution Shifts
Haoze Wu*, Teruhiro Tagomori*, Alexander Robey*, Fengjun Yang*, Nikolai Matni, George J. Pappas, Hamed Hassani, Corina Pasareanu, Clark Barrett
> IEEE Conference on Secure and Trustworthy Machine Learning Feb 8
2022
-
Provable tradeoffs in adversarially robust classification
Edgar Dobriban, Hamed Hassani, David Hong, Alexander Robey
> IEEE Transactions on Information Theory Sep 15
-
Probable Domain Generalization via Quantile Risk Minimization
Cian Eastwood*, Alexander Robey*, Shashank Singh, Julius von Kügelgen, Hamed Hassani, George J. Pappas, Bernhard Schölkopf
> NeurIPS 2022 Sep 15
-
On the Sample Complexity of Stability Constrained Imitation Learning
[Oral @ L4DC 2022]
Stephen Tu, Alexander Robey, Tingnan Zhang, and Nikolai Matni
> L4DC 2022 Jun 23
-
Chordal Sparsity for Lipschitz Constant Estimation of Deep Neural Networks
Anton Xue, Lars Lindemann, Alexander Robey, Hamed Hassani, George J. Pappas, Rajeev Alur
> CDC 2022 Apr 2
-
Probabilistically Robust Learning: Balancing Average- and Worst-case Performance
Alexander Robey, Luiz F. O. Chamon, George J. Pappas, Hamed Hassani
> ICML 2022 Feb 15
-
Do deep networks transfer invariances across classes?
Allan Zhou*, Fahim Tajwar*, Alexander Robey, Tom Knowles, George J. Pappas, Hamed Hassani, Chelsea Finn
> ICLR 2022 Jan 20
2021
-
Model-Based Domain Generalization
Alexander Robey, George J. Pappas, Hamed Hassani
> NeurIPS 2021 Nov 26
-
Adversarial Robustness with Semi-Infinite Constrained Learning
Alexander Robey*, Luiz F. O. Chamon*, George J. Pappas, Hamed Hassani, Alejandro Ribeiro
> NeurIPS 2021 Nov 26
-
Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations
Alexander Robey, Lars Lindemann, Lejun Jiang, Stephen Tu, Nikolai Matni
> Preprint Oct 22
-
Learning Robust Hybrid Control Barrier Functions for Uncertain Systems
Alexander Robey, Lars Lindemann, Stephen Tu, Nikolai Matni
> ADHS 2021 Jul 7
-
Optimal Algorithms for Submodular Maximization with Distributed Constraints
Alexander Robey, Arman Adibi, Brent Schlotfeldt, Hamed Hassani, George J. Pappas
> L4DC 2021 Jun 7
2020
-
Learning control barrier functions from expert demonstrations
Alexander Robey*, Haimin Hu*, Lars Lindemann, Hanwen Zhang, Dimos V. Dimarogonas, Stephen Tu, Nikolai Matni
> CDC 2020 Dec 14
-
Learning Robust Hybrid Control Barrier Functions from Data
Lars Lindemann, Haimin Hu, Alexander Robey, Hanwen Zhang, Dimos V. Dimarogonas, Stephen Tu, and Nikolai Matni
> CoRL 2020 Oct 16
-
Model-Based Robust Deep Learning
Alexander Robey, Hamed Hassani, George J. Pappas
> Preprint May 20
2019