This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AIM Intelligence and BMW Group Examine Gaps in Evaluating Enterprise AI Policy Compliance

Research reveals LLMs follow allowlist policies but systematically fail to enforce organizational prohibitions, exposing a critical gap in enterprise AI safety

SF, CA, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Seoul, South Korea / Munich, Germany – January 2026 – BMW Group and AIM Intelligence, a leading AI safety startup, today announced the publication of COMPASS (Company/Organization Policy Alignment Assessment), the first systematic framework for evaluating whether large language models (LLMs) comply with organization-specific policies. The research, now available on arXiv, reveals a critical gap that remains under-measured in current evaluation practices: models that pass standard safety benchmarks often fail dramatically when enforcing the nuanced, context-dependent rules that govern real-world business operations.

Why Enterprise AI Policies Break Down in Practice

As organizations across healthcare, finance, automotive, and government sectors rapidly adopt LLMs for customer-facing applications, the research team discovered a fundamental asymmetry that poses significant risks for policy-critical deployments.
Key Findings:
Strong Allowlist Compliance: Models reliably handle legitimate requests with over 95% accuracy
Critical Denylist Failures: Models fail to correctly refuse prohibited requests in up to 97% of cases
Catastrophic Adversarial Vulnerability: Under adversarial conditions, some models refuse fewer than 5% of policy-violating requests
“Most AI safety tests focus on whether a model behaves safely in general,” said Dasol Choi, AI Safety Researcher at AIM Intelligence. “COMPASS looks at a more practical question: can an AI system reliably follow the specific rules of an organization? Our findings show that, in many real-world deployments today, the answer is often no.”

Why Generic AI Safety Isn’t Enough

The research addresses a critical disconnect between how AI systems are evaluated and how they are deployed. While existing safety benchmarks focus on universal harms such as toxicity and violence, real enterprises operate under complex internal policies—compliance manuals, operational playbooks, legal edge cases, and brand-specific constraints.
COMPASS evaluates models across four dimensions that typical benchmarks ignore:
1. Policy Selection: Can the model identify which policy applies to a given situation?
2. Policy Interpretation: Can it reason through conditionals, exceptions, and vague clauses?
3. Conflict Resolution: When rules collide, does the model resolve conflicts as the organization intends?
4. Justification: Can the model ground its decisions in actual policy text?

“Our evaluation revealed a striking asymmetry,” noted DongGeon Lee, AI Safety Researcher at AIM Intelligence. “While models achieve near-perfect accuracy on what they can do, they remain structurally vulnerable in enforcing what they must not do. This gap persists across model scales and architectures, indicating that scaling alone cannot solve the problem.”

Industry-Scale Validation

The research team applied COMPASS across eight diverse industry scenarios—Automotive, Government, Financial, Healthcare, Travel, Telecom, Education, and Recruiting—generating and validating 5,920 queries that test both routine compliance and adversarial robustness. Fifteen state-of-the-art models were evaluated, including leading proprietary and open-source systems.

Making Misalignment Measurable

Perhaps the most significant contribution of COMPASS is transforming alignment from a philosophical concern into an engineering problem. The framework and benchmark datasets are publicly available on GitHub and Hugging Face, enabling organizations to evaluate their AI systems against their own policies.

About the Research Collaboration

This research represents a collaboration between AIM Intelligence, BMW Group, Yonsei University, Pohang University of Science and Technology, and Seoul National University. The full paper, “COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs,” is available at https://arxiv.org/abs/2601.01836.

About AIM Intelligence

AIM Intelligence is a Seoul-based AI safety company specializing in automated red-teaming, real-time guardrails, and AI monitoring solutions. Founded in 2024, AIM Intelligence serves major enterprises and conducts research across large language models, multimodal systems, autonomous agents, and emerging physical AI. The company has published over 15 research papers at top-tier conferences including ICML, ACL, NeurIPS, and IEEE.

Team Cookie Official
Team Cookie
email us here
Visit us on social media:
LinkedIn
Facebook

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

RFA Seafarers to Strike in March

RFA Seafarers to Strike in March

LONDON, UNITED KINGDOM, February 21, 2026 /EINPresswire.com/ — Royal Fleet Auxiliary (RFA) seafarers will take strike

February 21, 2026

United Better Homes Saves Massachusetts Homeowners on Winter Heating Costs with Energy-Efficient Window Installation

United Better Homes Saves Massachusetts Homeowners on Winter Heating Costs with Energy-Efficient Window Installation

Seal out drafts and lower heating bills with energy-efficient window upgrades this winter. ATTLEBORO, MA, UNITED STATES, February 10, 2026 /EINPresswire.com/ — With temperatures dropping…

February 21, 2026

New Book Come Back to Love by Robyn Vogel Reveals a Science-Backed Path to Emotional Healing and Self-Love

New Book Come Back to Love by Robyn Vogel Reveals a Science-Backed Path to Emotional Healing and Self-Love

Integrating Trauma Therapy, Somatic Healing, and Relationship Science to Help Rebuild Trust and Connection The path back to love is not about becoming someone new….

February 21, 2026

MarieBelle New York Unveils Valentine’s Day 2026 Chocolate Collection With Limited Edition Packaging and Assortments

MarieBelle New York Unveils Valentine’s Day 2026 Chocolate Collection With Limited Edition Packaging and Assortments

NEW YORK, NY, UNITED STATES, February 10, 2026 /EINPresswire.com/ — Just in time for Valentine’s Day, MarieBelle New York, the chocolatier known for artisanal craftsmanship…

February 21, 2026

Crime Scene Cleaners Launches New Website Focused on Trust, Accessibility, and Compassionate Service

Crime Scene Cleaners Launches New Website Focused on Trust, Accessibility, and Compassionate Service

Crime Scene Cleaners Unveils New Website Designed to Better Serve Families, First Responders, and Property Professionals Across Missouri and Kansas For more than 25 years,…

February 21, 2026

Euphoria Institute Hosts Grand Reopening Celebration in Las Vegas on Feb. 12, 2026

Euphoria Institute Hosts Grand Reopening Celebration in Las Vegas on Feb. 12, 2026

The Las Vegas community is invited to attend Euphoria Institute’s reopening celebration and learn more about the school’s beauty career training programs. LAS VEGAS, NV,…

February 21, 2026

Origin Detector (OD) Surpasses 457,000 Viewers in Record-Breaking Product Awareness Campaign

Origin Detector (OD) Surpasses 457,000 Viewers in Record-Breaking Product Awareness Campaign

The OD, innovative consumer awareness platform powered by QR codes, today announced that its latest product awareness campaign has reached 457,000 viewers This level of…

February 21, 2026

Goin’ Postal Launches Weekly ‘Teacher Tuesday’ Initiative to Support Educators

Goin’ Postal Launches Weekly ‘Teacher Tuesday’ Initiative to Support Educators

Goin’ Postal of Jacksonville, NC has launched a weekly initiative to support local teachers and is also available at Sneads Ferry and Camp Lejeune locations….

February 21, 2026

Art Melanated Presents SAVAGE – Opening Feb 14 in Los Angeles

Art Melanated Presents SAVAGE – Opening Feb 14 in Los Angeles

A Global Emerging Artist Exhibition Spotlighting the Future of Contemporary Art. Opening Feb 14 in Los Angeles LOS ANGELES, CA, UNITED STATES, February 9, 2026…

February 21, 2026

SideHustlr.ai Reports Early Growth as Users Prioritize Modest Income Goals Over High-Risk Ambition

SideHustlr.ai Reports Early Growth as Users Prioritize Modest Income Goals Over High-Risk Ambition

New platform data shows most users seek financial breathing room rather than rapid wealth LOS ANGELES, CA, UNITED STATES, February 11, 2026 /EINPresswire.com/ — SideHustlr.ai,…

February 21, 2026

Municipal Waste Systems Leave a Gap in Residential Sanitation, Local Companies Are Stepping In

Municipal Waste Systems Leave a Gap in Residential Sanitation, Local Companies Are Stepping In

Communities are rethinking residential sanitation as local companies address gaps left by traditional waste systems. Communities are paying more attention to what happens inside the…

February 21, 2026

Houston Attorney Husein Hadi Reaffirms Trial-First Approach to Personal Injury Representation

Houston Attorney Husein Hadi Reaffirms Trial-First Approach to Personal Injury Representation

HOUSTON, TX, UNITED STATES, February 10, 2026 /EINPresswire.com/ — Houston personal injury attorney Husein Hadi is reaffirming his commitment to a trial-first approach to legal…

February 21, 2026

Senior Tech Executive Unveils ‘Media-SDN’ to Unleash Streaming Possibilities and Eliminate Betting Courtsiding

Senior Tech Executive Unveils ‘Media-SDN’ to Unleash Streaming Possibilities and Eliminate Betting Courtsiding

Media-SDN: A hardware-free protocol that synchronizes devices via audio to solve betting courtsiding and enable spoiler-free streaming globally. SãO PAULO, SãO PAULO, BRAZIL, February 10,…

February 21, 2026

Finland’s Health Authority Launches ‘2-4-2’ Gambling Risk Limits Ahead of Expected Advertising Boom

Finland’s Health Authority Launches ‘2-4-2’ Gambling Risk Limits Ahead of Expected Advertising Boom

THL is cautioning that gambling-related problems are increasing as Finland prepares for its 2027 licensing reform. To promote safer play, the institute has introduced new…

February 21, 2026

Amana Care Clinic Announces Enhanced Urgent Care Capabilities for Muscatine Residents

Amana Care Clinic Announces Enhanced Urgent Care Capabilities for Muscatine Residents

MUSCATINE, Iowa – February 21, 2026 – PRESSADVANTAGE – Amana Care Clinic – Muscatine has announced enhanced diagnostic

February 21, 2026

Families Invited to Make Traditional Bánh Tét Together at San Diego Lunar New Year Festival

Families Invited to Make Traditional Bánh Tét Together at San Diego Lunar New Year Festival

Families are invited to make traditional bánh tét together at the 2026 San Diego Lunar New Year Festival, celebrating culture, memory, and connection. SAN DIEGO,…

February 21, 2026

Reputation Pros Named Among Best Reputation Management Companies in London

Reputation Pros Named Among Best Reputation Management Companies in London

Leading U.S.-Based Firm Recognized by Both Manchester Digital and London Post for Excellence in Online Reputation Management LONDON, LONDON, UNITED KINGDOM, February 10, 2026 /EINPresswire.com/…

February 21, 2026

Beauty Brand Madame Gabriela Launches at Nordstrom.com

Beauty Brand Madame Gabriela Launches at Nordstrom.com

Non-Toxic Lipstick Line Created for Mature Women Now Available Through Premium Retailer I spent two years developing formulas that are for mature lips but don’t…

February 21, 2026

Anne Frank to be Honored Worldwide on SWAN Day 2026

Anne Frank to be Honored Worldwide on SWAN Day 2026

If we’re ever going to have peace in this world, we have to continue to make stories that join cultures.”— Lisa France

February 21, 2026

Human Touch Launches the Vesta ZG Chair for Everyday Relaxation and Comfort

Human Touch Launches the Vesta ZG Chair for Everyday Relaxation and Comfort

Featuring Zero Gravity Positioning, Soothing Heat, and Gentle Vibration, the Vesta ZG Chair Supports Circulation and Pressure Relief in a Refined Design LONG BEACH, CA,…

February 21, 2026

Quechan Casino Resort Announces Exciting Live Entertainment Lineup – Tickets Now On Sale

Quechan Casino Resort Announces Exciting Live Entertainment Lineup – Tickets Now On Sale

WINTERHAVEN, CA, UNITED STATES, February 10, 2026 /EINPresswire.com/ — Quechan Casino Resort is pleased to announce that tickets are now on sale for an exciting…

February 21, 2026

TNS Associates Expands Family Law Services for Divorce and Custody Matters

TNS Associates Expands Family Law Services for Divorce and Custody Matters

DENVER, CO – February 21, 2026 – PRESSADVANTAGE – Thomas N. Scheffel & Associates, P.C. has announced an expansion

February 21, 2026

In Stock Today Cabinets Expands Fairfax Showroom Amid Rising Wholesale Kitchen Cabinet Demand

In Stock Today Cabinets Expands Fairfax Showroom Amid Rising Wholesale Kitchen Cabinet Demand

Fairfax, VA – February 21, 2026 – PRESSADVANTAGE – In Stock Today Cabinets has relocated its Fairfax, Virginia showroom

February 21, 2026

Sasso Guerrero & Henderlite Launches Media Center for Family Law Resources

Sasso Guerrero & Henderlite Launches Media Center for Family Law Resources

JACKSONVILLE, FL – February 21, 2026 – PRESSADVANTAGE – Sasso Guerrero & Henderlite, a Jacksonville-based law firm

February 21, 2026

Zambuki Launches Specialized Lead Generation Service for Concrete Contractors

Zambuki Launches Specialized Lead Generation Service for Concrete Contractors

Saint Petersburg, Florida – February 21, 2026 – PRESSADVANTAGE – Zambuki, a digital marketing technology company based

February 21, 2026

Markhoff & Mittman, P.C. Announces Expansion Of Work Injury Law Firm Services

Markhoff & Mittman, P.C. Announces Expansion Of Work Injury Law Firm Services

City of White Plains, New York – February 21, 2026 – PRESSADVANTAGE – Markhoff & Mittman, P.C. has announced the

February 21, 2026

Kitchen and Bath Masters Design & Remodeling Expands Cabinet and Vanity Offerings for Arlington Area Homeowners

Kitchen and Bath Masters Design & Remodeling Expands Cabinet and Vanity Offerings for Arlington Area Homeowners

February 21, 2026 – PRESSADVANTAGE – Kitchen and Bath Masters Design & Remodeling has expanded its selection of

February 21, 2026

Newport Author Releases 239-Page Guided Grief Journal After 15 Years of Real-World Use

Newport Author Releases 239-Page Guided Grief Journal After 15 Years of Real-World Use

Grief doesn’t follow a timeline — and most people are wildly unprepared for it. Grief doesn’t follow a timeline. GOOD

February 21, 2026

Why Leather Jackets Maintain Strong Presence Across Evolving Fashion Markets

Why Leather Jackets Maintain Strong Presence Across Evolving Fashion Markets

Leather jackets maintain relevance in evolving fashion markets as changing design trends, craftsmanship standards, and consumer behavior shape demand. HOUSTON, TX, UNITED STATES, February 10,…

February 21, 2026

Pipe17 Turns Agentic Commerce Readiness Into a 3PL Differentiator

Pipe17 Turns Agentic Commerce Readiness Into a 3PL Differentiator

Agentic order operations, a live onX standard, and the Powered by Pipe17 program give 3PLs a faster path to AI-ready fulfillment. 3PLs powered by Pipe17…

February 21, 2026

YourMedPlan Introduces Expanded Health Insurance Options for Employers

YourMedPlan Introduces Expanded Health Insurance Options for Employers

Enhanced options include group and individual-based health insurance solutions together under one advisory model. CLEARWATER, FL, UNITED STATES, February 10, 2026 /EINPresswire.com/ — YourMedPlan has…

February 21, 2026

Understanding the Difference Between Basic and Deep Cleaning Services for Residential Homes

Understanding the Difference Between Basic and Deep Cleaning Services for Residential Homes

Routine cleaning maintains a home’s appearance, while deep cleaning addresses buildup that develops over time”—

February 21, 2026

Progress Trust Present ‘Meeting of the Griots’ for the 100th Anniversary of Black History Month

Progress Trust Present ‘Meeting of the Griots’ for the 100th Anniversary of Black History Month

A festival-style Black History Month event honoring legacy, modern-day griots, art, music, and storytelling at historic

February 21, 2026

Eddy Vera Walks Premios Lo Nuestro Magenta Carpet as Special Guest of Emilio Estefan

Eddy Vera Walks Premios Lo Nuestro Magenta Carpet as Special Guest of Emilio Estefan

Best-selling author and global speaker Eddy Vera attends as Estefan’s guest, spotlighting purpose, gratitude, and

February 21, 2026

Christine Hopkins Named a 2026 Enterprising Women of the Year Award Winner

Christine Hopkins Named a 2026 Enterprising Women of the Year Award Winner

Christine Hopkins, President & CEO of the ASCI, has been named a winner of the 2026 Enterprising Women of the Year

February 21, 2026

Investably amplía asesoría para ayudar a inversionistas inmobiliarios a transitar de gestión activa hacia la jubilación

Investably amplía asesoría para ayudar a inversionistas inmobiliarios a transitar de gestión activa hacia la jubilación

Investably amplía asesoría para ayudar a inversionistas inmobiliarios con transiciones fiscalmente eficientes de la gestión activa hacia la jubilación. Se trata de alinearse con tu…

February 21, 2026

Zion Health Launches Deep Cleansing Scalp & Hair Scrub Pheromone Infused Dark Orchid for Healthier, Revitalized Hair

Zion Health Launches Deep Cleansing Scalp & Hair Scrub Pheromone Infused Dark Orchid for Healthier, Revitalized Hair

Zion Health introduces Deep Cleansing Scalp & Hair Scrub in Pheromone-Infused Dark Orchid, a mineral-rich exfoliating treatment for scalp and hair renewal. SAN FRANCISCO, CA,…

February 21, 2026

DUI Law Firm Denver Addresses Unique Legal Challenges Facing Military Personnel Charged with DUI

DUI Law Firm Denver Addresses Unique Legal Challenges Facing Military Personnel Charged with DUI

DENVER, CO – February 20, 2026 – PRESSADVANTAGE – DUI Law Firm Denver has expanded its focus to address the specialized

February 21, 2026

Go Industries Expands Custom OEM Manufacturing Capabilities to Meet Growing Demand Across Multiple Industries

Go Industries Expands Custom OEM Manufacturing Capabilities to Meet Growing Demand Across Multiple Industries

Richardson, TX – February 20, 2026 – PRESSADVANTAGE – Go Industries has announced the expansion of its custom

February 21, 2026

Oekoboiler Swiss AG Advances Sustainable Hybrid Boiler System Technology for Swiss Homes

Oekoboiler Swiss AG Advances Sustainable Hybrid Boiler System Technology for Swiss Homes

Hildisrieden, LU – February 20, 2026 – PRESSADVANTAGE – Oekoboiler Swiss AG continues to expand its presence in the

February 21, 2026