Command Palette
Search for a command to run...
실패에서 숙련으로: 도구 사용 에이전트를 위한 어려운 샘플 생성
실패에서 숙련으로: 도구 사용 에이전트를 위한 어려운 샘플 생성
초록
도구 사용 능력을 갖춘 대규모 언어 모델(Large Language Model, LLM) 에이전트의 발전을 위해서는 다양한 형태와 복잡한 구조를 가진 훈련 데이터 코퍼스가 필요하다. 기존의 데이터 생성 방법들은 주로 무작위 샘플링과 얕은 생성 방식을 따르기 때문에, 복잡하고 암묵적인 논리적 종속 관계를 포착하지 못하는 단순하고 동질적인 추적 경로를 생성하는 경향이 있다. 이 격차를 메우기 위해, 검증 가능한 추론을 수반하는 어려운 도구 사용 훈련 샘플을 자동으로 생성하는 ‘HardGen’이라는 에이전트 기반 파이프라인을 제안한다. 먼저, HardGen은 에이전트의 실패 사례를 기반으로 동적 API 그래프를 구축하고, 이를 통해 어려운 추적 경로를 샘플링하여 합성한다. 그 후, 이러한 추적 경로는 모듈화되고 추상화된 고급 도구를 생성할 때 조건부 사전 지식(conditional prior)으로 활용되며, 이를 통해 어려운 질의를 구성한다. 마지막으로, 고급 도구와 어려운 질의를 활용하여 검증 가능한 복잡한 사고 과정(Chain-of-Thought, CoT)을 생성하고, 폐쇄 루프 평가 피드백을 통해 과정의 지속적인 개선을 이끈다. 광범위한 평가 결과, 본 연구에서 구축한 데이터셋을 사용해 훈련한 40억 파라미터 모델이 여러 선도적인 오픈소스 및 클로즈드소스 모델(예: GPT-5.2, Gemini-3-Pro, Claude-Opus-4.5)과 비교해 우수한 성능을 보였다. 본 연구의 코드, 모델 및 데이터셋은 향후 연구를 지원하기 위해 공개될 예정이다.
One-sentence Summary
The author, affiliated with the University of Michigan, proposes a novel quasi-Newton method based on the parameterization framework to directly compute subharmonic periodic orbits (SPOs) and their Floquet vectors and multipliers in perturbed 4D symplectic maps, enabling efficient continuation in ε and Taylor parameterization of associated weak manifolds—offering significant improvements over multi-shooting methods and demonstrating utility in modeling resonant orbits within 2.5 DOF Hamiltonian systems, such as those arising in celestial mechanics.
Key Contributions
-
This paper addresses the challenge of computing long, unstable subharmonic periodic orbits (SPOs) in perturbed 2.5 degree-of-freedom Hamiltonian systems—such as stroboscopic maps of periodically perturbed 2 DOF flows—where these orbits arise from resonant tori in the unperturbed system and are critical for understanding the breakdown of invariant tori via resonance overlap.
-
The authors introduce a novel quasi-Newton method that simultaneously computes SPOs, their Floquet multipliers, and Floquet vectors by adapting the parameterization method framework to periodic orbits, enabling efficient continuation in the perturbation parameter ε while avoiding the costly O(q3) linear solves typical of traditional multiple-shooting approaches.
-
The method is demonstrated on real celestial mechanics problems, including resonant orbits in the Jupiter-Europa-Ganymede system, and enables the subsequent computation of Taylor parameterizations for the weak stable and unstable manifolds of these SPOs, providing a powerful tool for studying global dynamics in high-dimensional symplectic systems.
Introduction
The authors address the challenge of computing long, unstable periodic orbits—known as subharmonic periodic orbits (SPOs)—in 4D symplectic maps that arise from periodically perturbed 2 DOF Hamiltonian systems, such as the planar circular restricted 4-body problem in celestial mechanics. These orbits are critical for understanding the breakdown of invariant tori due to secondary resonances, especially when perturbations cause separatrices from long-period orbits to intersect, leading to chaotic dynamics. Prior methods rely on multiple-shooting algorithms that require solving large 4q×4q linear systems to find both the orbit points and their stability properties, resulting in O(q3) computational complexity—prohibitive for large q. The authors’ main contribution is a novel fast multiple-shooting method that simultaneously computes SPOs and their Floquet vectors and multipliers by adapting the parameterization method to periodic orbits. This approach avoids solving large linear systems by embedding the Floquet equations directly into the Newton iteration, reducing computational cost and enabling direct access to stability information. The method is applicable to a broad class of 4D symplectic maps and has been successfully applied to compute SPOs and their separatrices in real-world models of the Jupiter-Europa-Ganymede and Uranian systems, demonstrating its utility in space mission design and dynamical systems analysis.
Method
The authors develop a quasi-Newton method for the simultaneous computation of subharmonic periodic orbits (SPOs) and their associated Floquet vectors and multipliers within a family of 4D symplectic maps Fε. This framework is designed to study the persistence of SPOs from an unperturbed map F0 into a perturbed system, enabling numerical continuation with respect to the perturbation parameter ε. The method is an adaptation of the parameterization method for invariant tori, but is uniquely tailored to compute periodic orbits directly. The core of the algorithm involves solving a system of equations for the SPO points Xε(k), k=0,…,q−1, and the matrices Pε(k) and Λε(k) that represent the Floquet vectors and multipliers. The invariance equation for the SPO is Fε(Xε(k))=Xε(k+1modq), while the Floquet equation is DFε(Xε(k))Pε(k)=Pε(k+1modq)Λε(k). The matrix Λε(k) is constrained to a near-diagonal form, with the stable and unstable multipliers λs(k) and λu(k) on the diagonal, and the internal dynamics multipliers λ1 and λ2 in the top-left block, which may include an off-diagonal term T. This structure allows the system of equations to be decoupled into a series of 1D linear problems, which are solved efficiently using fixed-point iteration or explicit formulas, depending on the spectral properties of the multipliers. The method proceeds by first initializing the solution for ε=0 using the known unperturbed SPO and its monodromy matrix, and then numerically continuing this solution to ε>0 through a series of quasi-Newton steps. Each step involves correcting the SPO points X and the Floquet matrices P and Λ to reduce the residual errors E(k) and Ered(k), which measure the violation of the invariance and Floquet equations, respectively. The correction for X is derived from a linearized equation for the correction vector ξ(k), while the correction for P and Λ is derived from a similar equation for Q(k) and ΔΛ(k). After each correction, a Schur decomposition is applied to the updated Λc(k) to ensure it maintains the required near-diagonal form, which is crucial for numerical stability. The resulting SPO and Floquet vectors are then used to compute Taylor parameterizations of the SPO's weak stable and unstable manifolds, if they exist.

Experiment
- Computed secondary resonant periodic orbits (SPOs) and their separatrices in the CCR4BP models of Jupiter-Europa-Ganymede and Uranus-Titania-Oberon systems using numerical continuation and the parameterization method.
- Successfully continued SPOs with frequency ratios ω/(2π) = 11/34, 34/105, 23/71, 35/108, 12/37, 25/77, 37/114, and 45/139 in the Jupiter-Europa-Ganymede CCR4BP (ε = 2.5265 × 10⁻⁵), where invariant tori failed to persist due to resonance overlap.
- Detected intersections of separatrices from consecutive SPOs in both systems, confirming the dynamical mechanism of torus destruction in regions with ω < 2.04047 (Jovian) and similar low-frequency ranges (Uranian).
- Achieved accurate computation of separatrices using the parameterization method with Floquet directions, enabling visualization of nonlinear structures in action-angle-like coordinates.
- All computations performed in Julia with OrdinaryDiffEq.jl, TaylorSeries.jl, and TaylorIntegration.jl; numerical integration used DP8 adaptive step size method; SPOs computed with 10⁻⁷ tolerance and step sizes Δε = 5 × 10⁻⁷ to 10⁻⁶.