What? Papers and resources related to the security and privacy of LLMs.

Why? I am reading, skimming, and organizing these papers for my research in this nascent field anyway. So why not share it? I hope it helps anyone trying to look for quick references or getting into the game.

When? Updated whenever my willpower reaches a certain threshold (aka pretty frequent).

Where? GitHub and Notion. Notion is more up-to-date; I periodically transfer the updates to GitHub.

Who? Me and you (see Contribution below).


Overall Legend

Symbol Description
I personally like this paper! (not a measure of any paper’s quality; see interpretation at the end)
💽 Dataset, benchmark, or framework
📍 Position paper
🔭 Survey paper
👁️ Vision-language models
💸 Experiment with closed-source models

Vulnerabilities

Prompt Injection

Ignore the previous instructions…