HyperAIHyperAI

Command Palette

Search for a command to run...

실시간 AI 서비스 경제: 연속체 전반에 걸친 Agentic Computing을 위한 프레임워크

Lauri Lovén Alaa Saleh Reza Farahani Ilir Murturi Miguel Bordallo López Praveen Kumar Donta Schahram Dustdar

초록

실시간 AI 서비스는 점차 디바이스-에지-클라우드 연속체 (continuum) 상에서 운영되며, 자율형 AI 에이전트들이 지연에 민감한 워크로드를 생성하고, 다단계 처리 파이프라인을 오케스트레이션하며, 정책 및 거버넌스 제약 하에서 공유 자원을 경쟁적으로 할당받습니다. 본 논문은 서비스 의존성 그래프의 구조가 분산형 가격 기반 자원 할당이 대규모 환경에서 신뢰성 있게 작동할 수 있는지 여부를 결정하는 주요 요인임을 보여줍니다. 여기서 의존성 그래프는 연산 단계를 노드로, 실행 순서를 간선으로 인코딩하는 방향성 비순환 그래프 (DAG) 로 모델링됩니다. 의존성 그래프가 계층적 (트리 또는 직렬 - 병렬 구조) 인 경우, 가격은 안정적인 균형으로 수렴하고 최적 할당을 효율적으로 계산할 수 있으며, 적절한 메커니즘 설계 (준선형 효용과 이산적 슬라이스 아이템을 전제) 하에서는 에이전트들이 각 의사결정 주기 내에서 자사 평가를 왜곡할 유인이 없습니다. 반면, 파이프라인 단계 간 교차 연결이 존재하는 더 복잡한 의존성 구조에서는 가격이 진동하고 할당 품질이 저하되며 시스템 관리가 어려워집니다. 이러한 격차를 해소하기 위해 본 연구는 크로스 도메인 통합기가 복잡한 서브그래프를 자원 슬라이스로 캡슐화하여 시장 나머지 부분에 더 단순하고 구조화된 인터페이스를 제공하는 하이브리드 관리 아키텍처를 제안합니다. 6 개 실험 (각 10 회 시드, 총 1,620 회 실행) 에 걸친 체계적인 제거 (ablation) 연구는 다음과 같은 결과를 확인했습니다: (i) 의존성 그래프 토폴로지는 가격 안정성과 확장성의 1 차 결정 요인이며, (ii) 하이브리드 아키텍처는 처리량을 희생하지 않으면서 가격 변동성을 최대 70~75% 까지 감소시키고, (iii) 거버넌스 제약은 토폴로지와 부하에 공동으로 의존하는 정량화 가능한 효율성 - 준수 트레이드오프를 생성하며, (iv) 진실된 입찰 하에서 분산형 시장은 중앙집중식 가치 최적 베이스라인과 일치하여 분산형 조정도 중앙집중식 할당 품질을 재현할 수 있음을 입증합니다.

One-sentence Summary

Authors from the University of Oulu and other European institutions propose a hybrid management architecture that encapsulates complex service dependencies into polymatroidal slices, enabling stable, incentive-compatible decentralized resource allocation for real-time AI agents across the device-edge-cloud continuum.

Key Contributions

  • Real-time AI services across the device-edge-cloud continuum face instability when complex service-dependency graphs create cross-resource complementarities that prevent price convergence and efficient allocation.
  • The proposed hybrid management architecture encapsulates complex sub-graphs into resource slices to enforce hierarchical topologies, ensuring the feasible allocation set forms a polymatroid that guarantees market-clearing prices and truthful bidding.
  • Systematic ablation studies across 1,620 runs demonstrate that this approach reduces price volatility by up to 75% without sacrificing throughput while matching the value-optimal quality of centralized baselines under truthful bidding.

Introduction

Real-time AI services increasingly operate across device-edge-cloud environments where autonomous agents must coordinate latency-sensitive workloads under strict governance constraints. Prior approaches struggle because centralized orchestration is impractical across trust boundaries, while naive decentralized markets fail when complex service dependencies create resource complementarities that destabilize prices and make optimal allocation computationally intractable. The authors leverage service-dependency graph topology to identify stable regimes where tree or series-parallel structures guarantee market equilibrium and truthful bidding, then propose a hybrid architecture that encapsulates complex sub-graphs into simplified resource slices to restore stability without sacrificing throughput.

Method

The authors propose a framework for distributed service computing where autonomous AI agents generate tasks, compose services, and interact economically across a continuum of devices, edge platforms, and cloud infrastructure. The system operates in discrete time periods ttt, with agents conditioning their valuations on a commonly accepted system state sts_tst. The overall model integrates agentic behavior, resource dependencies, governance constraints, and mechanism design to facilitate efficient allocation.

Refer to the framework diagram for a high-level overview of the system components and their interactions. The process begins at the Agentic Layer, where agents issue tasks Ti(t)T_i(t)Ti(t) and send messages mi(t)m_i(t)mi(t). These inputs flow into the Valuation Layer, which computes latency-aware valuations defined as Vik(Tik,qik)=vik(qk)δik(Tik)V_{ik}(T_{ik}, q_{ik}) = v_{ik}(q_k) \delta_{ik}(T_{ik})Vik(Tik,qik)=vik(qk)δik(Tik). Here, vik(q)v_{ik}(q)vik(q) represents the base value of completing a task at quality qqq, while δik\delta_{ik}δik captures latency decay. The valuation depends on the latency TikT_{ik}Tik and workload qkq_kqk associated with the task.

Concurrently, the Service Layer defines available capacities C(t)C(t)C(t), while the Dependency DAG models structural dependencies among resources as Gres=(R,E)G_{res} = (R, E)Gres=(R,E). These factors converge at the Feasible Set, which is the intersection of resource constraints XresX_{res}Xres and governance constraints XgovX_{gov}Xgov. Governance inputs, including trust scores ϕ(t)\phi(t)ϕ(t) and policies G(t)G(t)G(t), further restrict the feasible allocations. The Mechanism MMM then maps the messages and current state to an allocation xtx_txt and payments Pi(t)P_i(t)Pi(t). Finally, the State Update module evolves the system state according to st+1=Ψ(st,xt,P(t),ξt)s_{t+1} = \Psi(s_t, x_t, P(t), \xi_t)st+1=Ψ(st,xt,P(t),ξt), incorporating exogenous events and realized allocations.

To ensure tractability in the presence of complex dependencies, the authors introduce a hybrid market architecture. As shown in the figure below, this architecture consists of three primary layers: Cross-Domain Integrators, Local Marketplaces, and AI Agents.

Cross-Domain Integrators form the agent-facing layer. Each integrator encapsulates a complex multi-resource service path into a governance-compliant slice. Internally, the integrator manages the dependency DAG of its sub-system and exposes a simplified, substitutable capacity interface. The capacity of this slice is set equal to the maximum flow of the internal sub-DAG. This encapsulation absorbs complementarities that would otherwise destabilize market-based coordination.

Beneath the integrators, Local Marketplaces operate at the device or edge scope to coordinate fungible services and resources, such as compute cycles and bandwidth. These markets clear via lightweight auctions or posted prices and enforce local governance policies. For simple, single-domain services, agents may interact directly with a local marketplace. Inter-Market Coordination ensures consistency through the exchange of coarse-grained signals, such as aggregate demand and congestion indicators, without requiring full system-wide optimization.

The mechanism design relies on the structural properties of the feasible allocation set. The authors demonstrate that when the service-dependency DAG is a tree or series-parallel network, the capacity constraints form a polymatroid. By using architectural encapsulation, the integrators ensure that the quotient graph seen by agents maintains this tree or series-parallel structure, even if the underlying infrastructure DAG is arbitrary. This preserves the polymatroidal structure of the agent-facing feasible region.

Furthermore, the latency-aware valuations satisfy the gross-substitutes (GS) condition under slice encapsulation. This is achieved because integrators expose discrete, indivisible slices with fixed internal routing and deterministic latency within each mechanism epoch. With a polymatroidal feasible set and GS valuations, the system admits a Walrasian equilibrium. Consequently, efficient allocation is computable in polynomial time via ascending auctions, and the outcome is implementable in a dominant-strategy incentive-compatible (DSIC) manner using mechanisms such as VCG or polymatroid clinching auctions.

Experiment

  • Structural discipline experiments validate that polymatroidal topologies (tree and linear) ensure market stability with zero price volatility, whereas entangled dependency graphs cause severe degradation and market failure under high load.
  • Hybrid architecture experiments confirm that encapsulating complex services into slices significantly reduces price volatility, with EMA smoothing acting as the primary stabilizer and efficiency factors improving latency and welfare in congested regimes.
  • Governance experiments demonstrate that strict trust-gated capacity partitioning trades service coverage for quality by reducing latency, though it can induce price volatility in otherwise stable topologies due to smaller resource pools.
  • Interaction studies reveal that the hybrid architecture effectively mitigates the volatility penalties introduced by strict governance, with synergy effects varying by topology from additive to super-additive.
  • Market mechanism experiments show that under truthful bidding, price-based coordination yields welfare outcomes nearly identical to value-greedy allocation, indicating the mechanism's primary value lies in incentive alignment rather than informational superiority.

AI로 AI 구축

아이디어에서 출시까지 — 무료 AI 코코딩, 즉시 사용 가능한 환경, 최적의 GPU 가격으로 AI 개발을 가속화하세요.

AI 협업 코딩
바로 사용 가능한 GPU
최적의 가격

HyperAI Newsletters

최신 정보 구독하기
한국 시간 매주 월요일 오전 9시 에 이번 주의 최신 업데이트를 메일로 발송합니다
이메일 서비스 제공: MailChimp