AI Alignment in Mitigating Risk: Frameworks for Benchmarking and Improvement

October 7, 2024
Read the Full Report

AI Alignment in Mitigating Risk: Frameworks for Benchmarking and Improvement from Aileen Niu continues to share the insightful work from the Center for AI Policy’s summer 2024 policy fellows. The report is linked below.

It was wonderful to have the opportunity to work with Aileen and Vedant Patel this past summer. Each made valuable contributions to the CAIP community, and their work toward AI safety was very much appreciated.

Read the full report here.

Whistleblower Protections for AI Employees

Whistleblowers are a powerful tool to minimize the risk of public harm from AI. Our latest research shows how proper protections can be designed to avoid concerns such as the violation of trade secrets.

Read more

AI Agents: Governing Autonomy in the Digital Age

A report on policies to address the emerging risks of increasingly autonomous AI agents.

Read more

AI at the Cyber Frontier: Securing America's Digital Future

Report on the cybersecurity implications of evolving AI capabilities, including actionable policy guidance for Congress.

Read more