arXiv cs.AI AI News

AINews has published 367 articles sourced from arXiv cs.AI, tracking recurring developments and primary announcements. Key themes include conflict-aware guidance, diffusion models, flow models.

Overview

AINews has published 367 articles sourced from arXiv cs.AI, tracking recurring developments and primary announcements. Key themes include conflict-aware guidance, diffusion models, flow models.

Browse all source hubs Browse topic hubs Browse archive

Published articles

367

Latest update

May 22, 2026

Quality score

Topic diversity

1505

Primary host

arxiv.org

Latest coverage from arXiv cs.AI

Untitled

arXiv cs.AI 05/25, 03:08 AM

For years, inference-time guided sampling has faced a critical bottleneck: when a model must satisfy multiple constraints simultaneously—like a drug molecule needing high target af…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The data engineering world has hit a wall. Traditional AI agents tasked with building data infrastructure rely on a brute-force loop: write code, run it, parse error logs, fix bugs…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The industrial sector has been quietly suffering from a 'latency disaster' as AI agents, tasked with querying sensor data, work orders, fault libraries, and predictive models, repe…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

AINews has learned that Mahjax, a novel GPU-accelerated mahjong simulator, has been officially released. Built on Google's JAX framework, it is purpose-designed for reinforcement l…

Source page reinforcement learning May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

For decades, negotiation theory has been a discipline built on case studies, intuition, and the hard-won wisdom of practitioners. The fundamental tension—balancing empathy for the …

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

For the past two years, the AI agent ecosystem has been trapped in a single-metric arms race. Benchmarks like GAIA, SWE-bench, and ToolBench each measure a narrow slice of agent pe…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

A new research breakthrough has demonstrated that neural networks can be trained to produce high-quality numerical embeddings for logical statements using triplet loss, a technique…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

A groundbreaking research framework, OSCToM (Opponent-Structured Counterfactual Theory of Mind), is redefining how we measure AI's ability to understand others' mental states. Unli…

Source page reinforcement learning May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The industrial design world has long suffered from a 'semantic gap': the stress distributions, thermal fields, and flow streamlines output by CAE simulations must be manually trans…

Source page reinforcement learning May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The AI community is witnessing a fundamental shift in how intelligent agents can operate in dynamic, real-world environments. SOLAR, a novel autonomous agent architecture, directly…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

Natural language to SQL (NL2SQL) has long faced an awkward reality: while large language models can grasp human intent, their error rates on multi-table joins, aggregate functions,…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

Large language model training has become a high-stakes gamble. Aggressive learning rates, parameter stress at scale, and runtime anomalies routinely cause training runs to diverge …

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

For years, the promise of Personal Health Records (PHRs) has been hollow: patients own their data but cannot understand it. A landmark study, analyzing 2,257 authentic user queries…

Source page large language models May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

For years, the document intelligence field has suffered a glaring disconnect: academia releases ever-more-powerful understanding models, while production teams struggle to maintain…

Source page AI engineering May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The current state of large language model (LLM) development is plagued by a fundamental irony: we feed models terabytes of data but understand almost nothing about how individual d…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

PopuLoRA represents a paradigm shift in how large language models (LLMs) can autonomously improve their reasoning capabilities. Traditional self-play methods, where a single model …

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The fundamental limitation of current AI world models is their tendency to learn superficial semantic correlations—mapping inputs to outputs—rather than the underlying causal laws …

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

GRID represents a paradigm shift in how security knowledge graphs are built. For years, the cybersecurity industry has struggled to transform the vast, unstructured flow of threat …

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The AI industry has been locked in a race to expand context windows, with models like GPT-4 Turbo boasting 128K tokens and Gemini 1.5 Pro reaching 1 million. Yet a deeper, more ins…

Source page AI memory May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

MetaKGEnrich represents a fundamental shift in how AI systems handle their own limitations. Instead of relying on human-curated datasets or expensive retraining, this pipeline equi…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

LinAlg-Bench, a rigorous new benchmark for mathematical reasoning, has delivered a sobering verdict on the current generation of large language models. By testing 10 frontier model…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

Multimodal AI has long faced a fundamental trade-off: chain-of-thought (CoT) reasoning dramatically improves embedding quality, but the computational cost of generating full reason…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

The discovery of new materials has long been a bottleneck in industries from energy storage to microelectronics. Traditional approaches—high-throughput screening of known databases…

Source page May 2026

Untitled

arXiv cs.AI 05/25, 03:08 AM

A landmark study on LLM-based negotiation agents has uncovered a startling asymmetry: these models can infer an opponent's hidden preferences — such as whether they value price ove…

Source page LLM agents May 2026