Business Wire

AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI

Share

Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.

By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.

Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.

To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.

Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.

The key benefits of this breakthrough include:

  • Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
  • Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
  • Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
  • Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.

“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”

Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.

“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”

Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.

To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.

About Aligned AI:

Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Media:
Alana Bannan
Matter Communications
360-975-1812
AlignedAI@matternow.com

About Business Wire

For more than 50 years, Business Wire has been the global leader in press release distribution and regulatory disclosure.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Everen Specialty Appoints Carla Greaves Chief Underwriting Officer25.4.2025 20:00:00 EEST | Press release

Everen Specialty, a Bermuda-based (re)insurer for energy markets worldwide, today announced the appointment of Carla Greaves as its new Chief Underwriting Officer (CUO). This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250425273777/en/ Carla Greaves Ms. Greaves will join the Executive Leadership Team of the Everen Group, based in the Bermuda office, later this year. She succeeds Jane Peterson, Interim CUO, who will continue in a consultancy capacity to facilitate the transition. With more than 30 years of underwriting and leadership experience in the (re)insurance industry, Ms. Greaves brings a wealth of expertise and a proven track record of success in the Casualty market where she is recognized for building high-performing teams, driving profitable growth, and successfully navigating complex market environments. Prior to joining Everen Specialty, Ms. Greaves held increasingly senior leadership positions, most recently servin

Incyte to Highlight Early-Stage Oncology Data at American Association for Cancer Research Annual Meeting 202525.4.2025 15:00:00 EEST | Press release

Incyte (Nasdaq:INCY) today announced that the Company will present new early-stage data from its oncology portfolio at the American Association of Cancer Research (AACR) Annual Meeting 2025 in Chicago, IL, from April 25–30. “At AACR we will be presenting data from early-stage programs across our oncology portfolio, including for patients with myeloproliferative neoplasms, ovarian cancer and other solid tumors,” said Pablo J. Cagnoni, M.D., President and Head of Research and Development, Incyte. “These data will guide our approach as we advance our pipeline and seek to transform the treatment landscape for patients with cancer and myeloproliferative neoplasms.” Abstracts accepted for presentation at AACR include: Mini Symposium INCB177054 INCB177054: A Novel, Potent, Orally Bioavailable DGKα/ζ Dual Inhibitor Enhances T-Cell Function and Demonstrates Potent Antitumor Activity (Session Title: Novel Antitumor Agents. April 28, 4:50 p.m. – 5:05 p.m. ET (3:50 p.m. – 4:05 p.m. CT). Abstract #

SLB Announces First-Quarter 2025 Results; Remains Committed to Return a Minimum of $4 Billion to Shareholders in 202525.4.2025 13:50:00 EEST | Press release

SLB (NYSE: SLB) today announced results for the first-quarter 2025. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250423635499/en/ The exterior of the SLB headquarters in Houston, Texas. First-Quarter Results (Stated in millions, except per share amounts) Three Months EndedChange Mar. 31, 2025 Dec. 31, 2024 Mar. 31, 2024 Sequential Year-on-year Revenue $8,490 $9,284 $8,707 -9% -3% Income before taxes - GAAP basis $1,063 $1,387 $1,357 -23% -22% Income before taxes margin - GAAP basis 12.5% 14.9% 15.6% -241 bps -306 bps Net income attributable to SLB - GAAP basis $797 $1,095 $1,068 -27% -25% Diluted EPS - GAAP basis $0.58 $0.77 $0.74 -25% -22% Adjusted EBITDA* $2,020 $2,382 $2,057 -15% -2% Adjusted EBITDA margin* 23.8% 25.7% 23.6% -186 bps 18 bps Pretax segment operating income* $1,556 $1,918 $1,649 -19% -6% Pretax segment operating margin* 18.3% 20.7% 18.9% -232 bps -60 bps Net income attributable to SLB, excluding charges &

Corona, The World’s Most Valuable Beer Brand 1 , Announces Its 100-Year Anniversary with Global Celebration25.4.2025 11:00:00 EEST | Press release

Today, Corona proudly celebrates its 100-year anniversary, a remarkable milestone for the iconic brand that has been synonymous with the beach and enjoyed by consumers worldwide for the past century. Since 1925, Corona has cultivated a deep association with the beach; fully embodying a lifestyle connected to nature and relaxation. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20250425804516/en/ Corona 100 This Is Living Since 1925 In honor of the occasion, Corona invites everyone to live their “beach side” – a.k.a. their best side – at top-tier beach locations across the globe. The Corona 100 platform includes a film highlighting 100 years of beach culture, a definitive list of the top 100 beaches in the world to visit, and a signed multi-year sponsorship of a renowned concert at Copacabana Beach in Rio de Janeiro — all offering people across the globe opportunities to connect with their beach side. “For 100 years, Corona has

Ant Group Unveils New Recruitment Initiative for Top AI Talents, Ramping Up AI Innovation Efforts25.4.2025 10:28:00 EEST | Press release

Ant Group today unveiled Plan A, a new recruitment initiative to attract top artificial intelligence researchers, reinforcing its commitment to accelerating AI research and development under the “AI First” corporate strategy. Operating within the framework of Ant Star—Ant Group’s year-round campus recruitment program—Plan A specifically targets AI talents who are ambitious, adaptable, altruistic, and analytical. Outstanding graduates from universities worldwide with STEM majors are encouraged to apply for Plan A. Relevant fields include computer science, software engineering, artificial intelligence, cybersecurity, information and telecommunication engineering, mathematics, statistics, and other emerging interdisciplinary areas. To better foster the development of technological innovators in this new AI era, Plan A offers candidates comprehensive support and resources, including unrestricted access to AI hardware and tailored career paths that allow for significant research freedom. Ad

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye