2025-06-09 HSF: Defending against Jailbreak Attacks with Hidden State Filtering Cheng Qian et.al. 2409.03788 null 2024-11-29 Conversational Complexity for Assessing Risk in Large Language Models John ...
+ +## 动态 +- 2023/05/27 [CPM-Bee](https://github.com/OpenBMB/CPM-Bee) 发布了! +- 2023/04/12 CPM-Ant 可以在[HuggingFace Transformers](https://huggingface.co ...