AI Safety

16 Posts

Safety, Evaluations and Alignment Lab (SEAL) Leaderboards.

AI Safety

Private Benchmarks for Fairer Tests: Scale AI launches SEAL leaderboards to benchmark model performance

Scale AI offers new leaderboards based on its own benchmarks.

AI Safety

U.S. Restricts AI Robocalls: U.S. cracks down on AI-generated voice robocalls to combat election interference.

The United States outlawed unsolicited phone calls that use AI-generated voices.

AI Safety

New Leaderboards Rank Safety, More: Hugging Face introduces leaderboards to evaluate model performance and trustworthiness.

Hugging Face introduced four leaderboards to rank the performance and trustworthiness of large language models (LLMs). The open source AI repository now ranks performance on tests of workplace utility, trust and safety, tendency to generate falsehoods, and reasoning.

AI Safety

Standard for Media Watermarks: C2PA introduces watermark tech to combat media misinformation.

An alliance of major tech and media companies introduced a watermark designed to distinguish real from fake media starting with images. The Coalition for Content Provenance and Authenticity (C2PA) offers an open standard that marks media files with information about their creation and editing.

AI Safety

OpenAI Revamps Safety Protocol: Inside OpenAI's framework to evaluate and mitigate model risks

Retrenching after its November leadership shakeup, OpenAI unveiled a new framework for evaluating risks posed by its models and deciding whether to limit their use.

AI Safety

High Anx-AI-ety: A recap of 2023's battle between AI doomsday warnings and regulatory measures

Angst at the prospect of intelligent machines boiled over in moves to block or limit the technology. Fear of AI-related doomsday scenarios prompted proposals to delay research and soul searching by prominent researchers. Amid the doomsaying, lawmakers took dramatic regulatory steps.

AI Safety

Champion for Openness: Top companies launch the AI Alliance to ensure safe and open source AI.

A new consortium aims to support open source AI. Led by Meta and IBM, dozens of organizations from the software, hardware, nonprofit, public, and academic sectors formed the AI Alliance, which plans to develop tools and programs that aid open development.

AI Safety

Europe Clamps Down: The AI Act, Europe's biggest AI law, moves closer to approval.

Europe’s sweeping AI law moved decisively toward approval. After years of debate, representatives of the European Union’s legislative and executive branches agreed on a draft of the AI Act, a comprehensive approach to regulating AI.

Colorado flag with a neural network over it

AI Safety

Limits on AI in Life Insurance: All about the first law that regulates use of AI in life insurance in the U.S.

The U.S. state of Colorado started regulating the insurance industry’s use of AI. Colorado implemented the first law that regulates the use of AI in life insurance and proposed extending the limits to auto insurers.

Diagram showing how open source tool Giskard works

AI Safety

Testing for Large Language Models: Meet Giskard, an automated quality manager for LLMs.

An open source tool automatically tests language and tabular-data models for social biases and other common issues. Giskard is a software framework that evaluates models using a suite of heuristics and tests based on GPT-4.

AI Safety

The Politics of Generative AI: AI-generated imagery flooded Argentina's presidential race.

Argentina’s recent presidential race was a battleground of AI-generated imagery. Candidates Javier Milei and Sergio Massa flooded social media with generated images of themselves and each other, The New York Times reported. On Sunday, Milei won the election’s final round.

AI Safety

The CEO Is O̶u̶t̶ In: All about the leadership shakeup at OpenAI

OpenAI abruptly fired and rehired its CEO Sam Altman, capping five days of chaos within the company. On Friday, the OpenAI board of directors — whose membership since has changed — ousted CEO and co-founder Sam Altman from his leadership position and his seat on the board.

AI Safety

Cyberattack Strikes OpenAI: ChatGPT and API outages linked to DDoS attack by Anonymous Sudan

ChatGPT suffered a cyberattack apparently tied to the Kremlin. A ChatGPT outage on November 8 most likely was caused by a distributed denial of service (DDoS) attack, OpenAI revealed.

AI Safety

AI Safety Summit Mulls Risks: Countries and tech giants collaborate on global AI safety regulation.

An international conference of political leaders and tech executives agreed to regulate AI. 28 countries including China and the United States as well as the European Union signed a declaration aimed at mitigating AI risks.

AI Safety

White House Moves to Regulate AI: All about the U.S. executive order on AI use and development

U.S. President Biden announced directives that control AI based on his legal power to promote national defense and respond to national emergencies. The White House issued an executive order that requires AI companies and institutions...

AI Safety

Private Benchmarks for Fairer Tests: Scale AI launches SEAL leaderboards to benchmark model performance

U.S. Restricts AI Robocalls: U.S. cracks down on AI-generated voice robocalls to combat election interference.

New Leaderboards Rank Safety, More: Hugging Face introduces leaderboards to evaluate model performance and trustworthiness.

Standard for Media Watermarks: C2PA introduces watermark tech to combat media misinformation.

OpenAI Revamps Safety Protocol: Inside OpenAI's framework to evaluate and mitigate model risks

High Anx-AI-ety: A recap of 2023's battle between AI doomsday warnings and regulatory measures

Champion for Openness: Top companies launch the AI Alliance to ensure safe and open source AI.

Europe Clamps Down: The AI Act, Europe's biggest AI law, moves closer to approval.

Limits on AI in Life Insurance: All about the first law that regulates use of AI in life insurance in the U.S.

Testing for Large Language Models: Meet Giskard, an automated quality manager for LLMs.

The Politics of Generative AI: AI-generated imagery flooded Argentina's presidential race.

The CEO Is O̶u̶t̶ In: All about the leadership shakeup at OpenAI

Cyberattack Strikes OpenAI: ChatGPT and API outages linked to DDoS attack by Anonymous Sudan

AI Safety Summit Mulls Risks: Countries and tech giants collaborate on global AI safety regulation.

White House Moves to Regulate AI: All about the U.S. executive order on AI use and development

Subscribe to The Batch