16 hours ago

Lauri Lovén Alaa Saleh Reza Farahani Ilir Murturi Miguel Bordallo López Praveen Kumar Donta Schahram Dustdar

Table of Contents

Abstract

Real-time AI services increasingly operate across the device-edge-cloud continuum, where autonomous AI agents generate latency-sensitive workloads, orchestrate multi-stage processing pipelines, and compete for shared resources under policy and governance constraints. This article shows that the structure of service-dependency graphs, modelled as DAGs whose nodes represent compute stages and whose edges encode execution ordering, is a primary determinant of whether decentralised, price-based resource allocation can work reliably at scale. When dependency graphs are hierarchical (tree or series-parallel), prices converge to stable equilibria, optimal allocations can be computed efficiently, and under appropriate mechanism design (with quasilinear utilities and discrete slice items), agents have no incentive to misreport their valuations within each decision epoch. When dependencies are more complex, with cross-cutting ties between pipeline stages, prices oscillate, allocation quality degrades, and the system becomes difficult to manage. To bridge this gap, we propose a hybrid management architecture in which cross-domain integrators encapsulate complex sub-graphs into resource slices that present a simpler, well-structured interface to the rest of the market. A systematic ablation study across six experiments (1,620 runs, 10 seeds each) confirms that (i) dependency-graph topology is a first-order determinant of price stability and scalability,(ii) the hybrid architecture reduces price volatility by up to 70-75% without sacrificing throughput, (iii) governance constraints create quantifiable efficiency-compliance trade-offs that depend jointly on topology and load, and (iv) under truthful bidding the decentralised market matches a centralised value-optimal baseline, confirming that decentralised coordination can replicate centralised allocation quality.

One-sentence Summary

Authors from the University of Oulu and other European institutions propose a hybrid management architecture that encapsulates complex service dependencies into polymatroidal slices, enabling stable, incentive-compatible decentralized resource allocation for real-time AI agents across the device-edge-cloud continuum.

Key Contributions

Real-time AI services across the device-edge-cloud continuum face instability when complex service-dependency graphs create cross-resource complementarities that prevent price convergence and efficient allocation.
The proposed hybrid management architecture encapsulates complex sub-graphs into resource slices to enforce hierarchical topologies, ensuring the feasible allocation set forms a polymatroid that guarantees market-clearing prices and truthful bidding.
Systematic ablation studies across 1,620 runs demonstrate that this approach reduces price volatility by up to 75% without sacrificing throughput while matching the value-optimal quality of centralized baselines under truthful bidding.

Introduction

Real-time AI services increasingly operate across device-edge-cloud environments where autonomous agents must coordinate latency-sensitive workloads under strict governance constraints. Prior approaches struggle because centralized orchestration is impractical across trust boundaries, while naive decentralized markets fail when complex service dependencies create resource complementarities that destabilize prices and make optimal allocation computationally intractable. The authors leverage service-dependency graph topology to identify stable regimes where tree or series-parallel structures guarantee market equilibrium and truthful bidding, then propose a hybrid architecture that encapsulates complex sub-graphs into simplified resource slices to restore stability without sacrificing throughput.

Method

The authors propose a framework for distributed service computing where autonomous AI agents generate tasks, compose services, and interact economically across a continuum of devices, edge platforms, and cloud infrastructure. The system operates in discrete time periods $t$ , with agents conditioning their valuations on a commonly accepted system state $s_t$ . The overall model integrates agentic behavior, resource dependencies, governance constraints, and mechanism design to facilitate efficient allocation.

Refer to the framework diagram for a high-level overview of the system components and their interactions. The process begins at the Agentic Layer, where agents issue tasks $T_i(t)$ and send messages $m_i(t)$ . These inputs flow into the Valuation Layer, which computes latency-aware valuations defined as $V_{ik}(T_{ik}, q_{ik}) = v_{ik}(q_k) \delta_{ik}(T_{ik})$ . Here, $v_{ik}(q)$ represents the base value of completing a task at quality $q$ , while $\delta_{ik}$ captures latency decay. The valuation depends on the latency $T_{ik}$ and workload $q_k$ associated with the task.

Concurrently, the Service Layer defines available capacities $C(t)$ , while the Dependency DAG models structural dependencies among resources as $G_{res} = (R, E)$ . These factors converge at the Feasible Set, which is the intersection of resource constraints $X_{res}$ and governance constraints $X_{gov}$ . Governance inputs, including trust scores $\phi(t)$ and policies $G(t)$ , further restrict the feasible allocations. The Mechanism $M$ then maps the messages and current state to an allocation $x_t$ and payments $P_i(t)$ . Finally, the State Update module evolves the system state according to $s_{t+1} = \Psi(s_t, x_t, P(t), \xi_t)$ , incorporating exogenous events and realized allocations.

To ensure tractability in the presence of complex dependencies, the authors introduce a hybrid market architecture. As shown in the figure below, this architecture consists of three primary layers: Cross-Domain Integrators, Local Marketplaces, and AI Agents.

Cross-Domain Integrators form the agent-facing layer. Each integrator encapsulates a complex multi-resource service path into a governance-compliant slice. Internally, the integrator manages the dependency DAG of its sub-system and exposes a simplified, substitutable capacity interface. The capacity of this slice is set equal to the maximum flow of the internal sub-DAG. This encapsulation absorbs complementarities that would otherwise destabilize market-based coordination.

Beneath the integrators, Local Marketplaces operate at the device or edge scope to coordinate fungible services and resources, such as compute cycles and bandwidth. These markets clear via lightweight auctions or posted prices and enforce local governance policies. For simple, single-domain services, agents may interact directly with a local marketplace. Inter-Market Coordination ensures consistency through the exchange of coarse-grained signals, such as aggregate demand and congestion indicators, without requiring full system-wide optimization.

The mechanism design relies on the structural properties of the feasible allocation set. The authors demonstrate that when the service-dependency DAG is a tree or series-parallel network, the capacity constraints form a polymatroid. By using architectural encapsulation, the integrators ensure that the quotient graph seen by agents maintains this tree or series-parallel structure, even if the underlying infrastructure DAG is arbitrary. This preserves the polymatroidal structure of the agent-facing feasible region.

Furthermore, the latency-aware valuations satisfy the gross-substitutes (GS) condition under slice encapsulation. This is achieved because integrators expose discrete, indivisible slices with fixed internal routing and deterministic latency within each mechanism epoch. With a polymatroidal feasible set and GS valuations, the system admits a Walrasian equilibrium. Consequently, efficient allocation is computable in polynomial time via ascending auctions, and the outcome is implementable in a dominant-strategy incentive-compatible (DSIC) manner using mechanisms such as VCG or polymatroid clinching auctions.

Experiment

Structural discipline experiments validate that polymatroidal topologies (tree and linear) ensure market stability with zero price volatility, whereas entangled dependency graphs cause severe degradation and market failure under high load.
Hybrid architecture experiments confirm that encapsulating complex services into slices significantly reduces price volatility, with EMA smoothing acting as the primary stabilizer and efficiency factors improving latency and welfare in congested regimes.
Governance experiments demonstrate that strict trust-gated capacity partitioning trades service coverage for quality by reducing latency, though it can induce price volatility in otherwise stable topologies due to smaller resource pools.
Interaction studies reveal that the hybrid architecture effectively mitigates the volatility penalties introduced by strict governance, with synergy effects varying by topology from additive to super-additive.
Market mechanism experiments show that under truthful bidding, price-based coordination yields welfare outcomes nearly identical to value-greedy allocation, indicating the mechanism's primary value lies in incentive alignment rather than informational superiority.

Source PDF

Table of Contents

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

16 hours ago

Lauri Lovén Alaa Saleh Reza Farahani Ilir Murturi Miguel Bordallo López Praveen Kumar Donta Schahram Dustdar

Table of Contents

Abstract

One-sentence Summary

Key Contributions

Real-time AI services across the device-edge-cloud continuum face instability when complex service-dependency graphs create cross-resource complementarities that prevent price convergence and efficient allocation.
The proposed hybrid management architecture encapsulates complex sub-graphs into resource slices to enforce hierarchical topologies, ensuring the feasible allocation set forms a polymatroid that guarantees market-clearing prices and truthful bidding.
Systematic ablation studies across 1,620 runs demonstrate that this approach reduces price volatility by up to 75% without sacrificing throughput while matching the value-optimal quality of centralized baselines under truthful bidding.

Introduction

Method

Experiment

Structural discipline experiments validate that polymatroidal topologies (tree and linear) ensure market stability with zero price volatility, whereas entangled dependency graphs cause severe degradation and market failure under high load.
Hybrid architecture experiments confirm that encapsulating complex services into slices significantly reduces price volatility, with EMA smoothing acting as the primary stabilizer and efficiency factors improving latency and welfare in congested regimes.
Governance experiments demonstrate that strict trust-gated capacity partitioning trades service coverage for quality by reducing latency, though it can induce price volatility in otherwise stable topologies due to smaller resource pools.
Interaction studies reveal that the hybrid architecture effectively mitigates the volatility penalties introduced by strict governance, with synergy effects varying by topology from additive to super-additive.
Market mechanism experiments show that under truthful bidding, price-based coordination yields welfare outcomes nearly identical to value-greedy allocation, indicating the mechanism's primary value lies in incentive alignment rather than informational superiority.

Source PDF

Table of Contents

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum

Lauri Lovén Alaa Saleh Reza Farahani Ilir Murturi Miguel Bordallo López Praveen Kumar Donta Schahram Dustdar

Abstract

One-sentence Summary

Key Contributions

Introduction

Method

Experiment

Build AI with AI

HyperAI Newsletters

Command Palette

Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum

Lauri Lovén Alaa Saleh Reza Farahani Ilir Murturi Miguel Bordallo López Praveen Kumar Donta Schahram Dustdar

Abstract

One-sentence Summary

Key Contributions

Introduction

Method

Experiment

Build AI with AI

HyperAI Newsletters

Command Palette

Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum

Lauri Lovén Alaa Saleh Reza Farahani Ilir Murturi Miguel Bordallo López Praveen Kumar Donta Schahram Dustdar

Abstract

One-sentence Summary

Key Contributions

Introduction

Method

Experiment

Build AI with AI

HyperAI Newsletters