Jianwei Li

Hi! I am currently a PhD student at North Carolina State University, under the guidance of Prof. JUNG-EUN KIM. Previously, I worked as a machine learning scientist in Moffett AI, and was guided by the chief scientist and co-founder of Moffett.AI: Ian En-Hsu Yen. Before that, I got my master's degree from the CS department of San Jose State University and was advised by Prof. Mark Stamp. I earned my bachelor's degree from Shandong University.

Currently, I am a Research Scientist Intern at the TikTok NLP Trust & Safety (TNS) team for Summer 2026. Previously, in Summer 2025, I interned with TikTok Responsible Recommendation Systems (RRS) team, where I worked on safety-aware recommendation for multimodal LLM.

/ Google Scholar / GitHub / LinkedIn / Twitter

Shadow LLM Guardians

Address: Raleigh, North Carolina
Email: ljw040426 AT gmail DOT com | jli265 AT ncsu DOT edu
Research Interest

My researches primarily focus on the field of AI Safety & Security and AI Efficiency, especially in the topic of Alignment and Attacks/Defenses.

News
Scroll down for more news โ†“
  • [May, 2026] ๐Ÿฅ‡ Selected as Gold Reviewer of ICML 2026
  • [May, 2026] ๐Ÿ’ผ Start a research internship program at TikTok
  • [April, 2026] ๐Ÿ“ Invited to serve as Reviewer for NeurIPS 2026
  • [April, 2026] ๐ŸŽ‰ One paper is accepted by ICML 2026
  • [March, 2026] ๐Ÿ“ serve as Reviewer for ICML 2026
  • [Jan, 2026] ๐ŸŽ‰ Two papers are accepted by ICLR 2026
  • [Oct, 2025] ๐ŸŽ“ Pass PhD written preliminary exam at NCSU
  • [Mayโ€“Aug, 2025] ๐Ÿ’ผ Start a research internship program at TikTok
  • [May, 2025] ๐ŸŽ‰ One paper is accepted by ICML 2025
  • [Feb, 2025] ๐ŸŽ‰ One paper is accepted by CPAL 2025
  • [Jul, 2024] ๐Ÿค Start research under the guidance of Prof. Jung-Eun Kim
  • ----------------------------------------
  • [Dec, 2023] ๐ŸŽค One paper selected as Oral of FL@FM-NeurIPS 2023
  • [Nov, 2023] ๐Ÿ›ก๏ธ Initiated the Shadow-LLM-Guardians Group: https://github.com/Shadow-LLM
  • [Oct, 2023] ๐ŸŽ‰ Two papers are accepted by EMNLP 2023
  • [Aug, 2023] ๐ŸŽ“ Start PhD student life at NC State University; Serve as official affiliation of New York University
  • ----------------------------------------
  • [Apr, 2023] ๐Ÿ“ข Publicity Chair of the International Workshop onResource-Efficient Learning for Knowledge Discovery @KDD 2023.
  • [Oct, 2022] ๐Ÿ“ข Publicity Chair of the first workshop on DL-Hardware Co-Design for AI Acceleration @AAAI 2023.
  • [Sep, 2022] Participated in AI Hardware Summit 2022.
  • [Sep, 2022] ๐Ÿ† Moffett S30 Accelerator wins MLPerf V2.1
Preprints
Jianwei Li, Jung-Eun KimSecurity Before Safety: A Backdoor-Centric View of LLM Output Risks in the Private AI Era PDF
Xingli Fang, Jianwei Li, Varun Mulchandani, Jung-Eun KimTrustworthy AI: Safety, Bias, and Privacy โ€” A Survey PDF
Selected Publications
pub
Jianwei Li, Jung-Eun Kim
Position: Retire the โ€œPositive Backdoorโ€ Labelโ€”Secret Alignment Requires Strict and Systematic Evaluation
Position PaperSecret Alignment Evaluation
ICML 2026
pub
Jianwei Li, Jung-Eun Kim
Purifying Generative LLMs from Backdoors without Prior Knowledge or Clean Reference
Main PaperDefense Backdoor Attack PDF Code Project page
ICLR 2026
pub
Jianwei Li, Jung-Eun Kim
Safety Alignment Can Be Not Superficial With Explicit Safety Signals
Main PaperDefense Jailbreak Attack PDF Code Project page
ICML 2025
pub
Jianwei Li, Jung-Eun Kim
Superficial Safety Alignment Hypothesis
Main PaperDefense Fine-tuning Attack PDF Code Project page
ICLR 2026 (arXiv 2024)
pub
Jianwei Li, Sheng Liu, Qi Lei
Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning
Workshop OralDefense Privacy Leakage
FL@FM NeurIPS 2023
Teaching & Research Assistant
  • 2023.08-2024.05, NCSU, Teachin Assistant
  • 2024.06~present, NCSU, Reserach Assistant
ยฉ 2022 jianwei.li All rights reserved
(Last update: May 14, 2026.)