Hi, I am a Ph.D. candidate at UCLA in Computer Science, working with Prof. Cho-Jui Hsieh. I received my B.Eng. degree in Computer Science and Technology from Tsinghua University in June 2020. My research interest is machine learning, especially improving trustworthiness and generalization of machine learning models. I am currently supported by Amazon Fellowship.
For anyone interested in my research: Please feel free to email me if you are interested in a discussion on research or potential collaborations.
* indicates equal contribution.
-
Defending LLMs against Jailbreaking Attacks via Backtranslation
Yihan Wang*, Zhouxing Shi* , Andrew Bai , and Cho-Jui Hsieh
arXiv preprint arXiv:2402.16459, 2024
-
Red teaming language model detectors with language models
Zhouxing Shi* , Yihan Wang*, Fan Yin* , Xiangning Chen , Kai-Wei Chang , and Cho-Jui Hsieh
Transactions of the Association for Computational Linguistics, 2024
-
Universality and limitations of prompt tuning
Yihan Wang, Jatin Chauhan , Wei Wang , and Cho-Jui Hsieh
Advances in Neural Information Processing Systems, 2024
-
Two-stage LLM Fine-tuning with Less Specialization and More Generalization
Yihan Wang, Si Si , Daliang Li , Michal Lukasik , Felix Yu , Cho-Jui Hsieh , Inderjit S Dhillon , and Sanjiv Kumar
In The Twelfth International Conference on Learning Representations , 2023
-
On the Convergence of Certified Robust Training with Interval Bound Propagation
Yihan Wang*, Zhouxing Shi* , Quanquan Gu , and Cho-Jui Hsieh
In International Conference on Learning Representations , 2021
-
Fast certified robust training with short warmup
Zhouxing Shi* , Yihan Wang*, Huan Zhang , Jinfeng Yi , and Cho-Jui Hsieh
Advances in Neural Information Processing Systems, 2021
-
On lp-norm robustness of ensemble decision stumps and trees
Yihan Wang, Huan Zhang , Hongge Chen , Duane Boning , and Cho-Jui Hsieh
In International Conference on Machine Learning , 2020
-
Automatic perturbation analysis for scalable certified robustness and beyond
Kaidi Xu , Zhouxing Shi , Huan Zhang , Yihan Wang, Kai-Wei Chang , Minlie Huang , Bhavya Kailkhura , Xue Lin , and Cho-Jui Hsieh
Advances in Neural Information Processing Systems, 2020