Navigating AI Risks
Subscribe
Sign in
Home
AI Policy Proposals
Governance Matters
SaferAI Roundup
Archive
About
Latest
Top
Discussions
The SaferAI Roundup #4: Capabilities Improvement and Safety Testing of GPT-4o and Claude 3.5
GPT-4o System Card & Claude 3.5 Sonnet Model Card Addendum
Aug 28
2
Share this post
The SaferAI Roundup #4: Capabilities Improvement and Safety Testing of GPT-4o and Claude 3.5
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
The SaferAI Roundup #3: Technical Efforts to Make Safe Open Model Weights Possible
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models & Tamper-Resistant Safeguards for Open-Weight LLMs
Aug 13
2
Share this post
The SaferAI Roundup #3: Technical Efforts to Make Safe Open Model Weights Possible
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
July 2024
The SaferAI Roundup #2
Observational Scaling Laws and the Predictability of Language Model Performance & Lessons from the Trenches on Reproducible Evaluation of Languageā¦
Jul 30
Share this post
The SaferAI Roundup #2
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
The SaferAI Roundup #1
Welcome to "The SaferAI Roundup", our new format.
Jul 16
4
Share this post
The SaferAI Roundup #1
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
December 2023
#16 - A Democratic "Cautious Coalition": What Grand Strategy for AI Safety? + Sycophancy
āExpected 1 unit of progress, got 2, remaining 998.ā Eliezer Yudkowsky, writer and researcher, reacting to a positive discovery in AI interpretabilityā¦
Dec 7, 2023
2
Share this post
#16 - A Democratic "Cautious Coalition": What Grand Strategy for AI Safety? + Sycophancy
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
November 2023
#15 - Altman, Jinping, Biden, and O
In this weekās newsletter: The OpenAI Debacle, Governance of AI with Chinese Characteristics, The White House Tightens AI Oversight, and the EU's AI Act
Nov 22, 2023
Share this post
#15 - Altman, Jinping, Biden, and O
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
#14 - Day 1 of Global AI Safety Governance
Last week, 28 countries met in Bletchley Park for the worldās first AI Safety Summit.
Nov 8, 2023
2
Share this post
#14 - Day 1 of Global AI Safety Governance
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
October 2023
#13: The UK's Multilateral AI Safety Institute + Open-sourcing Advanced AI + Biden's Executive Order
Welcome to Navigating AI Risks, where we explore how to govern the risks posed by transformative artificial intelligence.
Oct 18, 2023
4
Share this post
#13: The UK's Multilateral AI Safety Institute + Open-sourcing Advanced AI + Biden's Executive Order
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
#12: Global Summitry for AI Safety + Will Responsible Scaling Policies Secure the Future?
Welcome to Navigating AI Risks, where we explore how to govern the risks posed by transformative artificial intelligence. āWe cannot keep the U.Kā¦
Oct 3, 2023
3
Share this post
#12: Global Summitry for AI Safety + Will Responsible Scaling Policies Secure the Future?
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
September 2023
#11 - AI in the Public Eye + AI Governance is (mostly) Compute Governance
Welcome back to NAIR after the summer break. This will be a pivotal year for AI governance, and weāre all here for it. Donāt hesitate to send us tipsā¦
Sep 6, 2023
3
Share this post
#11 - AI in the Public Eye + AI Governance is (mostly) Compute Governance
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
August 2023
#10 - Voluntary Commitments for Safe AI Development + Inter-Lab Cooperation
On July 21, the White House announced a set of voluntary commitments made by 7 leading AI labs, as part of the Biden administrationās ongoing efforts toā¦
Aug 2, 2023
4
Share this post
#10 - Voluntary Commitments for Safe AI Development + Inter-Lab Cooperation
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
July 2023
#9: The World is Getting Worried + Corporate Structures + Regulatory Challenges in the US and China
Welcome to Navigating AI Risks, where we explore how to govern the risks posed by transformative artificial intelligence. In this 9th edition, youāllā¦
Jul 19, 2023
3
Share this post
#9: The World is Getting Worried + Corporate Structures + Regulatory Challenges in the US and China
www.navigatingrisks.ai
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts