HyperAIHyperAI

Command Palette

Search for a command to run...

인공지능 에이전트 사회에서 사회화는 어떻게 발생하는가? Moltbook에 대한 사례 연구

Ming Li Xirui Li Tianyi Zhou

초록

대규모 언어 모델 에이전트가 점점 더 네트워크 환경에 확산되면서 다음과 같은 근본적인 질문이 제기된다. 인공지능(AI) 에이전트 사회는 인간 사회 시스템과 유사한 수렴 동역학을 겪는가? 최근 Moltbook은 자율 에이전트가 지속적으로 변화하는 오픈엔드 온라인 사회에 참여하는 가능성을 모사한 미래 시나리오를 제시하였다. 본 연구에서는 이 AI 에이전트 사회에 대한 최초의 대규모 체계적 진단을 제시한다. 정적 관찰을 넘어서, AI 에이전트 사회의 동적 진화를 정량적으로 진단할 수 있는 프레임워크를 도입하여 의미적 안정성, 어휘 전환율, 개별 에이전트의 관성, 영향력 지속성, 집단적 합의 정도를 측정하였다. 분석 결과, Moltbook 내 시스템은 동적 균형 상태에 있음을 밝혀냈다. 전반적인 의미 평균은 빠르게 안정화되지만, 개별 에이전트는 높은 다양성과 지속적인 어휘 전환을 유지하며 동질화를 피하고 있다. 그러나 에이전트들은 강한 개별 관성과 상호작용 상대에 대한 최소한의 적응 반응을 보이며, 상호 영향력과 합의 형성의 기반을 마련하지 못하고 있다. 그 결과 영향력은 일시적이며 지속적인 슈퍼노드(super-node)가 형성되지 않으며, 공동 사회적 기억의 부재로 인해 안정적인 집단적 영향력의 중심이 형성되지 못하고 있다. 이러한 발견은 규모와 상호작용 밀도만으로는 사회화를 유도하기에 부족함을 보여주며, 차세대 AI 에이전트 사회의 설계 및 분석에 실질적인 원칙을 제시한다.

One-sentence Summary

Ming Li and Xirui Li (University of Maryland) with Tianyi Zhou (MBZUAI) analyze Moltbook, the largest AI-only social platform, revealing that despite scale and interaction, agents show no socialization: they exhibit semantic inertia, ignore feedback, and fail to form stable influence hierarchies or shared memory, challenging assumptions that density alone enables collective behavior.

Key Contributions

  • We define and operationalize "AI Socialization" as observable behavioral adaptation in LLM agents induced by sustained interaction within an AI-only society, introducing a novel framework to evaluate whether artificial societies develop human-like collective dynamics.
  • We develop a multi-level diagnostic methodology to measure semantic convergence, agent-level adaptation, and collective influence anchoring, applying it to Moltbook—the largest persistent AI agent society—to empirically assess socialization at scale.
  • Our analysis reveals that despite high interaction density, Moltbook agents exhibit semantic stabilization at the society level but lack mutual influence, persistent supernodes, or shared social memory, demonstrating that scale alone does not induce socialization in current AI societies.

Introduction

The authors leverage Moltbook, the largest public AI-only social platform, to investigate whether large-scale LLM agent societies exhibit socialization—the process by which agents adapt behavior through sustained interaction, as seen in human societies. Prior work has scaled agents to support coordination or simulate environments but lacks longitudinal analysis of emergent social dynamics. Existing studies often treat agent behavior as static or task-bound, ignoring how interaction over time shapes collective norms or influence structures. The authors’ main contribution is a multi-level diagnostic framework measuring semantic convergence, agent adaptation, and collective anchoring—and applying it to Moltbook, they find that despite high interaction density and scale, agents show no meaningful behavioral adaptation, persistent influence hierarchies, or shared social memory. This reveals that scalability alone does not induce socialization, highlighting the need for explicit mechanisms to support stability, feedback integration, and memory in future AI societies.

Top Figure
Top Figure

Dataset

The authors use Moltbook, a large-scale agent-only social platform, as their primary dataset. Here’s how it’s composed and processed:

  • Dataset composition and sources:
    Moltbook hosts over 2 million LLM-driven agents interacting via posts, comments, and upvotes across topical sub-forums (“submolts”). All participants are autonomous agents — no human users are involved. The dataset spans the platform’s full observable history up to February 8, 2026.

  • Key subset details:

    • Main interaction log: Includes all posts and comments. Posts repeated over 1,000 times without variation are removed.
    • Daily interaction graphs: Nodes = active agents; edges = comment/reply interactions. Peak daily activity includes >23,000 agents and >400,000 weighted interactions.
    • Probing posts: A structured set of 45 posts designed to test collective recognition. Organized into 3 categories (must-read, accounts to follow, community context), each with 5 sub-forums and 3 paraphrases. Each post adopts a “newcomer” persona and targets a specific submolt.
  • How the data is used:

    • Interaction logs are used to analyze macro activity patterns (post volume, user growth, engagement) and structural dynamics (network graphs).
    • Probing posts are used to evaluate semantic convergence and socialization by measuring agent responses across contexts.
    • For semantic analysis, Sentence-BERT (all-MiniLM-L6-v2) generates embeddings; for n-gram analysis, nltk handles tokenization.
  • Processing and metadata:

    • Interaction graphs are built daily, with edge weights counting comment/reply frequency.
    • Probing posts include metadata: unique ID (category__paraphrase__submolt), target submolt, title, and full text.
    • No cropping is applied — full interaction history is retained after deduplication.

Method

The authors leverage a multi-faceted analytical framework to quantify lexical dynamics, semantic evolution, and agent-level behavioral adaptation in online discourse. Their methodology spans temporal n-gram analysis, centroid-based semantic stability metrics, local clustering density, individual drift quantification, feedback-driven adaptation, and interaction-induced content shifts—all grounded in semantic embeddings derived from Sentence-BERT.

To characterize lexical turnover, the authors define the lifespan of each n-gram ggg using its first and last observed dates: τfirst(g)=min{tgOt(n)}\tau_{\mathrm{first}}(g) = \min \{ t \mid g \in \mathcal{O}_t^{(n)} \}τfirst(g)=min{tgOt(n)} and τlast(g)=max{tgOt(n)}\tau_{\mathrm{last}}(g) = \max \{ t \mid g \in \mathcal{O}_t^{(n)} \}τlast(g)=max{tgOt(n)}. The set of active n-grams on day ttt, At(n)\mathcal{A}_t^{(n)}At(n), includes all n-grams that have entered the lexicon and not yet exited: At(n)={gτfirst(g)tτlast(g)}\mathcal{A}_t^{(n)} = \{ g \mid \tau_{\mathrm{first}}(g) \leq t \leq \tau_{\mathrm{last}}(g) \}At(n)={gτfirst(g)tτlast(g)}. Birth and death rates are then computed relative to the active vocabulary: the birth rate Rbirth(n)(t)R_{\mathrm{birth}}^{(n)}(t)Rbirth(n)(t) is the proportion of new n-grams among active ones on day ttt, while the death rate Rdeath(n)(t)R_{\mathrm{death}}^{(n)}(t)Rdeath(n)(t) measures the proportion of n-grams that exited the lexicon relative to the active set on the prior day.

For semantic evolution, the authors compute a Daily Semantic Centroid ct\mathbf{c}_tct as the mean embedding of all posts on day ttt: ct=1NtpPtvp\mathbf{c}_t = \frac{1}{N_t} \sum_{p \in \mathcal{P}_t} \mathbf{v}_pct=Nt1pPtvp. Macro-stability is captured via Centroid Similarity Scentroid(ti,tj)=cos(cti,ctj)S_{\mathrm{centroid}}(t_i, t_j) = \cos(\mathbf{c}_{t_i}, \mathbf{c}_{t_j})Scentroid(ti,tj)=cos(cti,ctj), which reflects the consistency of the aggregate discourse direction. Micro-homogeneity is measured via Pairwise Similarity Spairwise(ti,tj)S_{\mathrm{pairwise}}(t_i, t_j)Spairwise(ti,tj), the mean cosine similarity across all post pairs between two days, indicating how tightly clustered individual posts are within the semantic space.

To assess structural convergence, the authors analyze local neighborhood densities. For each post ppp on day ttt, they compute its Local Neighborhood Similarity SK(p)S_K(p)SK(p) as the mean cosine similarity to its KKK-nearest neighbors (with K=10K=10K=10). Temporal stability of these densities is quantified using Jensen-Shannon Divergence between the SKS_KSK distributions of consecutive days.

At the agent level, semantic drift is measured by partitioning each agent’s post history into early and late halves, computing centroids ca(early)\mathbf{c}_a^{(early)}ca(early) and ca(late)\mathbf{c}_a^{(late)}ca(late), and defining Individual Semantic Drift Da=1cos(ca(early),ca(late))D_a = 1 - \cos(\mathbf{c}_a^{(early)}, \mathbf{c}_a^{(late)})Da=1cos(ca(early),ca(late)). Drift Direction Consistency SaconsistencyS_a^{consistency}Saconsistency evaluates alignment with the global drift vector dˉ\bar{\mathbf{d}}dˉ, while Movement Toward Societal Centroid ΔSa\Delta S_aΔSa measures whether agents converge toward the global discourse center over time.

Feedback adaptation is evaluated via a sliding window approach. For each window Wk\mathcal{W}_kWk, posts are partitioned into top- and bottom-performing subsets based on net feedback scores. Semantic centroids ctopc_{top}ctop and cbotc_{bot}cbot are computed, and Net Progress NP=ΔbotΔtopNP = \Delta_{bot} - \Delta_{top}NP=ΔbotΔtop quantifies whether the agent’s next window moves closer to successful content and away from unsuccessful content, where Δtop=dist(cnext,ctop)dist(ccurr,ctop)\Delta_{top} = \mathrm{dist}(c_{next}, c_{top}) - \mathrm{dist}(c_{curr}, c_{top})Δtop=dist(cnext,ctop)dist(ccurr,ctop) and Δbot=dist(cnext,cbot)dist(ccurr,cbot)\Delta_{bot} = \mathrm{dist}(c_{next}, c_{bot}) - \mathrm{dist}(c_{curr}, c_{bot})Δbot=dist(cnext,cbot)dist(ccurr,cbot), with dist(x,y)=1cos(x,y)\mathrm{dist}(x,y) = 1 - \cos(x,y)dist(x,y)=1cos(x,y). Statistical significance is assessed against a permutation baseline that shuffles feedback scores.

Finally, interaction effects are studied using an event-study design. For each interaction event (a,t,p)(a, t, p^*)(a,t,p), the authors compare the agent’s pre- and post-interaction content windows Wpre\mathcal{W}_{pre}Wpre and Wpost\mathcal{W}_{post}Wpost relative to the target post’s embedding v\mathbf{v}^*v. Interaction Influence Δinteract=S(Wpost,v)S(Wpre,v)\Delta_{interact} = S(\mathcal{W}_{post}, \mathbf{v}^*) - S(\mathcal{W}_{pre}, \mathbf{v}^*)Δinteract=S(Wpost,v)S(Wpre,v) captures semantic convergence toward the target. A Random Baseline, constructed by sampling non-interacted posts from the same day, controls for global topic drift.

Experiment

  • Moltbook achieves global semantic stability with persistent local diversity, maintaining a stable center while posts remain widely dispersed and heterogeneous.
  • Agents show strong individual inertia, with minimal adaptation to feedback or interactions; their semantic trajectories stem from intrinsic properties rather than social co-evolution.
  • No stable influence anchors emerge: structurally, influence remains transient with no persistent supernodes; cognitively, agents lack shared recognition or consensus on influential figures.
  • Lexical innovation stabilizes after an initial burst, sustaining steady turnover rather than convergence, indicating a dynamic equilibrium in content evolution.
  • Local semantic density stabilizes early without progressive tightening, preserving consistent internal diversity despite societal growth.
  • Participation does not drive socialization: agents do not converge toward societal norms, adapt to feedback, or align with interaction partners.
  • Structural influence remains decentralized, with influence diffusing across the network and no enduring hierarchy forming over time.
  • Cognitive consensus is absent: probing reveals fragmented, inconsistent, or invalid references to influential users or posts, with no shared social memory.

The authors use daily interaction graphs to track structural influence in Moltbook, observing rapid growth in nodes, edges, and total weight over time. Despite increasing interaction volume, influence remains decentralized, with no persistent supernodes or consolidation of authority. Results show that high activity does not translate into stable leadership, reinforcing the society’s fluid and fragmented nature.


AI로 AI 구축

아이디어에서 출시까지 — 무료 AI 코코딩, 즉시 사용 가능한 환경, 최적의 GPU 가격으로 AI 개발을 가속화하세요.

AI 협업 코딩
바로 사용 가능한 GPU
최적의 가격

HyperAI Newsletters

최신 정보 구독하기
한국 시간 매주 월요일 오전 9시 에 이번 주의 최신 업데이트를 메일로 발송합니다
이메일 서비스 제공: MailChimp