
A new research paper introduces Annotator Policy Models (APMs), interpretable machine learning models that infer annotators' internal safety policies directly from their labeling behavior without requiring additional explanation. The approach achieves over 80% accuracy in modeling annotator policies and can distinguish between three sources of disagreement: operational failures, policy ambiguity, and value pluralism across demographic groups.
Scientists have developed BALAR (Bayesian Agentic Loop for Active Reasoning), an algorithm that enables large language models to engage in structured, multi-turn conversations by strategically asking clarifying questions rather than responding reactively. The system uses Bayesian reasoning to maintain beliefs about what information is missing and selects questions that maximize expected information gain, achieving 14–38% accuracy improvements over baselines across detective cases, logic puzzles, and clinical diagnosis tasks.
Ten major Asian economies have now enacted dedicated AI legislation or national strategies that emphasize government incentives and sovereign capability-building over punitive compliance, with Vietnam's new standalone AI law and South Korea's Framework Act standing as exceptions with enforcement teeth. China is pursuing open-source model release as industrial policy—committing $98B to AI development with over 100,000 Qwen derivatives now dominating open-weight ecosystems—while Japan's penalty-free AI Promotion Act focuses on closing a 9% individual adoption gap through ¥1 trillion in semiconductor and AI funding.
Intel, AMD, and Micron shares jumped double digits this week as investors shift focus from GPU leaders to processors and memory chips powering the next generation of AI infrastructure. The rally reflects growing conviction that CPU and memory manufacturers will capture significant value as AI workloads evolve beyond training toward inference and specialized applications.
Zyphra has introduced ZAYA1-8B, a mixture-of-experts reasoning model with only 700M active parameters that matches or exceeds DeepSeek-R1-0528 on mathematics and coding benchmarks despite being substantially smaller. The model was trained entirely on AMD infrastructure and incorporates a novel test-time compute method called Markovian RSA, which achieves 91.9% accuracy on AIME'25 and 89.6% on HMMT'25 while maintaining a compressed reasoning context of just 4K tokens.
A new benchmark called Partial Evidence Bench measures a critical failure mode in enterprise AI agents: producing answers that appear complete while withholding material evidence due to access controls. The benchmark includes 72 tasks across due diligence, compliance, and security scenarios, and reveals that silent filtering of restricted information is catastrophically unsafe, while explicit fail-and-report behaviors prevent the problem without making systems useless.
A cyberattack has disrupted Canvas, a widely-used learning management platform, causing schools and colleges across the country to postpone year-end exams and tests. The incident highlights vulnerabilities in critical education infrastructure that millions of students rely on during peak academic periods.
In a lecture on large language models, AI researcher Andrej Karpathy highlights a significant gap in AI understanding: while engineers know how LLMs are trained through iterative parameter updates, they cannot explain why specific neural circuits emerge or why parameters organize as they do. This interpretability challenge—how complex learning behaviors arise from optimization but remain unexplained—is a well-documented problem in the AI research community that raises concerns about deploying systems we don't fully understand.
Intel's stock has climbed 490% over the past year, driven by investor optimism about the chipmaker's revival strategy. However, analysts warn that Wall Street's bullish outlook may be getting ahead of the company's actual business progress and execution.
Cloudflare announced its first major layoff, eliminating approximately 1,100 positions primarily in support roles, citing AI-driven efficiency improvements. The move comes even as the company achieved record revenue, with CEO Matthew Prince attributing the reduction to automation capabilities that diminish the need for certain job functions.
Tech companies' massive expansion of AI data centers has sparked widespread conflict over their strain on power grids, rising electricity costs, and environmental impact, with 43% of Americans blaming data centers for higher utility bills. From community resistance to proposed construction moratoriums and congressional investigations, regulators and residents are increasingly pushing back against the energy-hungry infrastructure that underpins the AI revolution.