Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models & Tamper-Resistant Safeguards for Open-Weight LLMs
The SaferAI Roundup #3: Technical Efforts to Make Safe Open Model Weights Possible
The SaferAI Roundup #3: Technical Efforts to…
The SaferAI Roundup #3: Technical Efforts to Make Safe Open Model Weights Possible
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models & Tamper-Resistant Safeguards for Open-Weight LLMs