Yihan Wang

UCLA Computer Science

prof_pic.jpg

Hi, I am a Ph.D. candidate at UCLA in Computer Science, working with Prof. Cho-Jui Hsieh. I received my B.Eng. degree in Computer Science and Technology from Tsinghua University in June 2020. My research interest is machine learning, especially improving trustworthiness and generalization of machine learning models. I am currently supported by Amazon Fellowship.

For anyone interested in my research: Please feel free to email me if you are interested in a discussion on research or potential collaborations.

selected publications

* indicates equal contribution.

  1. Arxiv Preprint
    Defending LLMs against Jailbreaking Attacks via Backtranslation
    Yihan Wang*, Zhouxing Shi* , Andrew Bai , and Cho-Jui Hsieh
    arXiv preprint arXiv:2402.16459, 2024
  2. TACL
    Red teaming language model detectors with language models
    Zhouxing Shi* , Yihan Wang*, Fan Yin* , Xiangning Chen , Kai-Wei Chang , and Cho-Jui Hsieh
    Transactions of the Association for Computational Linguistics, 2024
  3. NeurIPS 2023
    Universality and limitations of prompt tuning
    Yihan Wang, Jatin Chauhan , Wei Wang , and Cho-Jui Hsieh
    Advances in Neural Information Processing Systems, 2024
  4. ICLR 2024
    Two-stage LLM Fine-tuning with Less Specialization and More Generalization
    Yihan Wang, Si Si , Daliang Li , Michal Lukasik , Felix Yu , Cho-Jui Hsieh , Inderjit S Dhillon , and Sanjiv Kumar
    In The Twelfth International Conference on Learning Representations , 2023
  5. ICLR 2022
    On the Convergence of Certified Robust Training with Interval Bound Propagation
    Yihan Wang*, Zhouxing Shi* , Quanquan Gu , and Cho-Jui Hsieh
    In International Conference on Learning Representations , 2021
  6. NeurIPS 2021
    Fast certified robust training with short warmup
    Zhouxing Shi* , Yihan Wang*, Huan Zhang , Jinfeng Yi , and Cho-Jui Hsieh
    Advances in Neural Information Processing Systems, 2021
  7. ICML 2020
    On lp-norm robustness of ensemble decision stumps and trees
    Yihan Wang, Huan Zhang , Hongge Chen , Duane Boning , and Cho-Jui Hsieh
    In International Conference on Machine Learning , 2020
  8. NeurIPS 2020
    Automatic perturbation analysis for scalable certified robustness and beyond
    Kaidi Xu , Zhouxing Shi , Huan Zhang , Yihan Wang, Kai-Wei Chang , Minlie Huang , Bhavya Kailkhura , Xue Lin , and Cho-Jui Hsieh
    Advances in Neural Information Processing Systems, 2020