منذ 16 ساعات

Zhenfeng Cao

جدول المحتويات

الملخص

لطالما عملت هندسة البرمجيات، على مدى أكثر من نصف قرن، على افتراض أساسي: يقوم المهندسون البشر بتحليل المشكلات، وتشفير منطق اتخاذ القرار داخل شيفرات برمجية ثابتة (static code)، ثم تعديل تلك الشيفرات يدوياً مع تطور المتطلبات. تفترض هذه الورقة أن ظهور وكلاء الذكاء الاصطناعي (AI Agents) – وهي أنظمة تتصرف فيها نماذج اللغة الكبيرة (LLM) كمحرك رئيسي للاستدلال، تولد وتلفظ الشيفرات البرمجية ديناميكياً بوصفها مواردًا وسيطة – لا يمثل تحسيناً تدريجياً فحسب، بل إعادة هيكلة جذرية لنموذج هندسة البرمجيات. بالاستناد إلى تحليل مبني على المبادئ الأولى (first-principles analysis) لتوسع التعقيد (complexity scaling)، نُرسّخ التمييز بين البرمجيات التقليدية (حيث تُعدّ الشيفرة حاملة لمنطق اتخاذ القرار) والأنظمة الوكيلة (Agentic Systems) (حيث تكون الشيفرة أداة مؤقتة تخدم حلقة استدلال يقودها LLM). كما نتتبع المسار التاريخي الممتد من البرمجيات المرخصة، إلى البرمجيات كخدمة (SaaS)، وصولاً إلى ما نسميه الخدمة الوكيلة (Agent-as-a-Service، AaaS)، مُبرهنين على أن كل انتقال قد نقل عبئاً إضافياً من التعقيد بعيداً عن المستخدمين النهائيين. ونقدّم مفهوم "الهندسة الوكيلة" (Agentic Engineering) كعلم ناشئ، متميّز عن هندسة البرمجيات في كائن الدراسة الأساسي، ونموذج التحكم، والدور البشري. ومن خلال تحليل أدلة القياس الحديثة، بما في ذلك SWE-bench Verified وEvoClaw ودراسات تنسيق الوكلاء المتعددين (multi-agent coordination) التابعة لـ LangChain، نُبَيّن كل من الإمكانات التحويلية للنموذج الوكيل وقيوده الحالية.

One-sentence Summary

This paper argues that AI agents fundamentally restructure the software paradigm by treating code as ephemeral tooling for LLM-driven reasoning loops rather than the carrier of decision logic, formalizing Agentic Engineering and Agent-as-a-Service (AaaS) through first-principles analysis of complexity scaling while demonstrating transformative potential and limitations via SWE-bench Verified, EvoClaw, and LangChain's multi-agent coordination studies.

Key Contributions

This work formalizes the distinction between traditional software and agentic systems through a first-principles analysis of complexity scaling, defining code as either a carrier of logic or ephemeral tooling.
The paper introduces Agentic Engineering as a distinct emergent discipline and proposes the term Agent-as-a-Service to characterize the historical shift from licensed software to SaaS.
Analysis of recent benchmark evidence including SWE-bench Verified and EvoClaw demonstrates the transformative potential of the agentic paradigm alongside its current limitations in sustained autonomous development.

Introduction

Traditional software engineering relies on human engineers encoding decision logic into static code, yet this model struggles with exponential complexity scaling as system interactions grow combinatorially. Current AI-augmented development approaches fail to remove the human bottleneck from design decisions and maintain the latency of traditional software lifecycles. The authors contend that AI agents constitute a fundamental restructuring of the software paradigm where code serves as ephemeral tooling for an LLM-driven reasoning loop instead of the system itself. They formalize this shift as Agent-as-a-Service and introduce Agentic Engineering as a distinct discipline focused on intent architecture and multi-agent coordination.

Method

The proposed agentic system operates on a dynamic architecture where decision logic is generated at runtime rather than being statically pre-programmed. As defined in the formal model, an AI agent system $A$ is characterized by the tuple $A = (M, \mathcal{T}, \mathcal{M}, \Pi)$ , where $M$ represents the large language model serving as the reasoning engine, $\mathcal{T}$ denotes the set of executable tools, $\mathcal{M}$ is the memory subsystem, and $\Pi$ is the planning mechanism.

The overall framework is illustrated in the diagram below, which depicts the central role of the LLM Reasoning Core in orchestrating interactions with the external environment.

The architecture consists of three primary functional modules branching from the core. The Perception module handles multi-modal input processing, translating raw environmental data into a format the reasoning engine can utilize. The Memory module manages semantic, episodic, and procedural information, allowing the system to maintain context and learn from past interactions. The Action module encompasses both internal reasoning processes and the invocation of external tools, enabling the agent to execute code, query databases, or call APIs.

The system operates through an iterative execution loop. At each time step $t$ , the model $M$ selects an action $a_t$ based on the current state $s_t$ and the memory subsystem $\mathcal{M}$ , formalized as $a_t \leftarrow M(s_t, \mathcal{M})$ . The system state is then updated by executing the chosen action, denoted as $s_{t+1} \leftarrow \text{exec}(a_t)$ . Unlike traditional software where decision rules $D$ are fixed, this agentic approach allows the LLM to dynamically produce code and adjust behavior based on intermediate results. This paradigm shifts the focus from delivering software artifacts to delivering outcomes, where the agent autonomously plans, executes, and validates tasks to fulfill user intent.

Experiment

Empirical evaluations utilizing benchmarks such as SWE-bench Verified and enterprise debugging workflows demonstrate that agentic engineering outperforms traditional paradigms through process-centric training and multi-agent orchestration. These studies validate that coordinated agents can reduce debugging time and autonomously evolve skills, yet the EvoClaw benchmark exposes significant limitations in continuous software evolution. Consequently, while current systems generalize across the software lifecycle, they face persistent challenges regarding context drift and error propagation during long-term maintenance tasks.

ملف PDF المصدر

جدول المحتويات

بناء الذكاء الاصطناعي بالذكاء الاصطناعي

من الفكرة إلى الإطلاق — سرّع تطوير الذكاء الاصطناعي الخاص بك مع المساعدة البرمجية المجانية بالذكاء الاصطناعي، وبيئة جاهزة للاستخدام، وأفضل أسعار لوحدات معالجة الرسومات.

البرمجة التعاونية باستخدام الذكاء الاصطناعي

وحدات GPU جاهزة للعمل

أفضل الأسعار

ابدأ عرض الأسعار

HyperAI Newsletters

اشترك في آخر تحديثاتنا

سنرسل لك أحدث التحديثات الأسبوعية إلى بريدك الإلكتروني في الساعة التاسعة من صباح كل يوم اثنين

مدعوم بواسطة MailChimp

منذ 16 ساعات

Zhenfeng Cao

جدول المحتويات

الملخص

One-sentence Summary

Key Contributions

This work formalizes the distinction between traditional software and agentic systems through a first-principles analysis of complexity scaling, defining code as either a carrier of logic or ephemeral tooling.
The paper introduces Agentic Engineering as a distinct emergent discipline and proposes the term Agent-as-a-Service to characterize the historical shift from licensed software to SaaS.
Analysis of recent benchmark evidence including SWE-bench Verified and EvoClaw demonstrates the transformative potential of the agentic paradigm alongside its current limitations in sustained autonomous development.

Introduction

Method

The overall framework is illustrated in the diagram below, which depicts the central role of the LLM Reasoning Core in orchestrating interactions with the external environment.

Experiment

ملف PDF المصدر

جدول المحتويات

بناء الذكاء الاصطناعي بالذكاء الاصطناعي

البرمجة التعاونية باستخدام الذكاء الاصطناعي

وحدات GPU جاهزة للعمل

أفضل الأسعار

ابدأ عرض الأسعار

HyperAI Newsletters

اشترك في آخر تحديثاتنا

سنرسل لك أحدث التحديثات الأسبوعية إلى بريدك الإلكتروني في الساعة التاسعة من صباح كل يوم اثنين

مدعوم بواسطة MailChimp

Command Palette

نهاية هندسة البرمجيات: كيف تعمل وكلاء الذكاء الاصطناعي على إعادة هيكلة النموذج البرمجي بشكل جذري

Zhenfeng Cao

الملخص

One-sentence Summary

Key Contributions

Introduction

Method

Experiment

بناء الذكاء الاصطناعي بالذكاء الاصطناعي

HyperAI Newsletters

Command Palette

نهاية هندسة البرمجيات: كيف تعمل وكلاء الذكاء الاصطناعي على إعادة هيكلة النموذج البرمجي بشكل جذري

Zhenfeng Cao

الملخص

One-sentence Summary

Key Contributions

Introduction

Method

Experiment

بناء الذكاء الاصطناعي بالذكاء الاصطناعي

HyperAI Newsletters

Command Palette

نهاية هندسة البرمجيات: كيف تعمل وكلاء الذكاء الاصطناعي على إعادة هيكلة النموذج البرمجي بشكل جذري

Zhenfeng Cao

الملخص

One-sentence Summary

Key Contributions

Introduction

Method

Experiment

بناء الذكاء الاصطناعي بالذكاء الاصطناعي

HyperAI Newsletters