Agentic Orchestration and the Turn Toward Principled Decision Theory

Today’s batch highlights a clear shift from heuristics-based LLM architectures toward formal decision-making frameworks. From Bayesian orchestration to uncertainty quantification in deep learning, the field is prioritizing robustness over brute-force scaling.

Position: agentic AI orchestration should be Bayes-consistent

Papamarkou et al. · [abs] [pdf]

This position paper argues that the control layers in agentic systems—specifically tool selection and resource allocation—should transition from heuristic prompting to Bayesian decision theory. By maintaining explicit beliefs over task-relevant latent variables, agents can handle uncertainty more gracefully than standard chain-of-thought methods.

↳ Moving away from black-box prompting toward formal decision-making is likely the only way to make agentic workflows production-ready.

agents decision theory bayesian

To Call or Not to Call: A Framework to Assess and Optimize LLM Tool Calling

Wu et al. · [abs] [pdf]

The authors introduce a decision-theoretic framework for determining whether a tool call is net-positive, specifically for web search. They provide a quantitative method to balance the cost of a call against the marginal utility of potentially noisy external information.

↳ Redundant tool calls are a common source of latency and error in agents; this provides a systematic way to filter them out.

agents tool use llms

Possibilistic Predictive Uncertainty for Deep Learning

Ni et al. · [abs] [pdf]

The authors introduce DAPPr, a framework for epistemic uncertainty quantification that bridges the gap between computationally expensive Bayesian methods and imprecise second-order predictors. It uses Dirichlet-approximated possibilistic posteriors to deliver reliable uncertainty estimates with significantly lower overhead.

↳ Reliable uncertainty quantification is the missing piece for high-stakes AI deployment, and this provides a rare balance of efficiency and rigor.

uncertainty deep learning robustness

Jailbreaking Vision-Language Models Through the Visual Modality

Azulay et al. · [abs] [pdf]

This paper demonstrates that the vision component of VLMs is a massive, under-audited attack surface. By using visual symbol substitution, benign object swapping, and visual analogies, the authors bypass standard text-based safety filters with high success rates.

↳ Multimodal safety alignment is failing; if your VLM relies on a text-based guardrail, it is effectively blind to these visual jailbreaks.

security multimodal vlms

Make Your LVLM KV Cache More Lightweight

Chen et al. · [abs] [pdf]

The authors propose LightKV, which exploits redundancy in vision-token embeddings to prune the KV cache in LVLMs. By using cross-modality message passing, they significantly reduce GPU memory consumption during prefill without sacrificing significant performance.

↳ As context windows grow and multimodal inputs become standard, KV cache optimization is becoming the primary bottleneck for serving throughput.

inference memory vlms

Fairness of Classifiers in the Presence of Constraints between Features

Cooper et al. · [abs] [pdf]

This research addresses the issue of hidden dependencies in fair classification where protected features are masked by correlations with other variables. They define fairness through ‘fair explanations’ based on prime-implicant logic, ensuring decisions don’t rely on protected attributes even in constrained feature spaces.

↳ It moves fairness metrics beyond simple statistical parity into the realm of causal/logical provenance, which is legally and ethically more robust.

fairness logic classification

Back to the terminal. The models are getting smarter, but the logic remains our responsibility.