Jianwei Li's Personal Website

Hi! I am currently a PhD student at North Carolina State University, under the guidance of Prof. JUNG-EUN KIM. Previously, I worked as a machine learning scientist in Moffett AI, and was guided by the chief scientist and co-founder of Moffett.AI: Ian En-Hsu Yen. Before that, I got my master's degree from the CS department of San Jose State University and was advised by Prof. Mark Stamp. I earned my bachelor's degree from Shandong University.

Currently, I am a Research Scientist Intern at the TikTok NLP Trust & Safety (TNS) team for Summer 2026. Previously, in Summer 2025, I interned with TikTok Responsible Recommendation Systems (RRS) team, where I worked on safety-aware recommendation for multimodal LLM.

Address: Raleigh, North Carolina

Email: ljw040426 AT gmail DOT com | jli265 AT ncsu DOT edu

Research Interest

My researches primarily focus on the field of AI Safety & Security and AI Efficiency, especially in the topic of Alignment and Attacks/Defenses.

News

Scroll down for more news ↓

[May, 2026] 🥇 Selected as Gold Reviewer of ICML 2026
[May, 2026] 💼 Start a research internship program at TikTok
[April, 2026] 📝 Invited to serve as Reviewer for NeurIPS 2026
[April, 2026] 🎉 One paper is accepted by ICML 2026
[March, 2026] 📝 serve as Reviewer for ICML 2026
[Jan, 2026] 🎉 Two papers are accepted by ICLR 2026
[Oct, 2025] 🎓 Pass PhD written preliminary exam at NCSU
[May–Aug, 2025] 💼 Start a research internship program at TikTok
[May, 2025] 🎉 One paper is accepted by ICML 2025
[Feb, 2025] 🎉 One paper is accepted by CPAL 2025
[Jul, 2024] 🤝 Start research under the guidance of Prof. Jung-Eun Kim
[Dec, 2023] 🎤 One paper selected as Oral of FL@FM-NeurIPS 2023
[Nov, 2023] 🛡️ Initiated the Shadow-LLM-Guardians Group: https://github.com/Shadow-LLM
[Oct, 2023] 🎉 Two papers are accepted by EMNLP 2023
[Aug, 2023] 🎓 Start PhD student life at NC State University; Serve as official affiliation of New York University
[Apr, 2023] 📢 Publicity Chair of the International Workshop onResource-Efficient Learning for Knowledge Discovery @KDD 2023.
[Oct, 2022] 📢 Publicity Chair of the first workshop on DL-Hardware Co-Design for AI Acceleration @AAAI 2023.
[Sep, 2022] Participated in AI Hardware Summit 2022.
[Sep, 2022] 🏆 Moffett S30 Accelerator wins MLPerf V2.1

Preprints

Jianwei Li, Jung-Eun KimSecurity Before Safety: A Backdoor-Centric View of LLM Output Risks in the Private AI Era PDF

Xingli Fang, Jianwei Li, Varun Mulchandani, Jung-Eun KimTrustworthy AI: Safety, Bias, and Privacy — A Survey PDF

Selected Publications

Jianwei Li, Jung-Eun Kim

Position: Retire the “Positive Backdoor” Label—Secret Alignment Requires Strict and Systematic Evaluation

Position PaperSecret Alignment Evaluation

ICML 2026

Jianwei Li, Jung-Eun Kim

Purifying Generative LLMs from Backdoors without Prior Knowledge or Clean Reference

Main PaperDefense Backdoor Attack PDF Code Project page

ICLR 2026

Jianwei Li, Jung-Eun Kim

Safety Alignment Can Be Not Superficial With Explicit Safety Signals

Main PaperDefense Jailbreak Attack PDF Code Project page

ICML 2025

Jianwei Li, Jung-Eun Kim

Superficial Safety Alignment Hypothesis

Main PaperDefense Fine-tuning Attack PDF Code Project page

ICLR 2026 (arXiv 2024)

Jianwei Li, Sheng Liu, Qi Lei

Beyond Gradient and Priors in Privacy Attacks: Leveraging Pooler Layer Inputs of Language Models in Federated Learning

Workshop OralDefense Privacy Leakage

FL@FM NeurIPS 2023

Teaching & Research Assistant

(Last update: May 14, 2026.)