AI Alignment in Mitigating Risk: Frameworks for Benchmarking and Improvement

October 7, 2024

AI Alignment in Mitigating Risk: Frameworks for Benchmarking and Improvement from Aileen Niu continues to share the insightful work from the Center for AI Policy’s summer 2024 policy fellows. The report is linked below.

It was wonderful to have the opportunity to work with Aileen and Vedant Patel this past summer. Each made valuable contributions to the CAIP community, and their work toward AI safety was very much appreciated.

Read the full report here.

Whistleblower Protections for AI Employees

Whistleblowers are a powerful tool to minimize the risk of public harm from AI. Our latest research shows how proper protections can be designed to avoid concerns such as the violation of trade secrets.

AI Agents: Governing Autonomy in the Digital Age

A report on policies to address the emerging risks of increasingly autonomous AI agents.

AI at the Cyber Frontier: Securing America's Digital Future

Report on the cybersecurity implications of evolving AI capabilities, including actionable policy guidance for Congress.