What did the CGSC use the Vantage AI platform for in this experiment?

Vantage was configured as a virtual corps-level Higher Command (HICON), providing 61 student staff officers with battle damage assessments, enemy intelligence updates, targeting recommendations, and synchronization reports across planning, FUOPS, and CUOPS cells—compensating for limited human HICON capacity during the two-week LSCO capstone exercise.

What were the main risks identified with using LLMs as virtual headquarters?

Faculty warned that LLMs can hallucinate plausible but incorrect outputs for BDA and threat assessments, tend toward escalatory postures compared to human experts, and may foster automation bias—causing staff officers to accept machine-generated products uncritically. Rigorous human oversight and validation protocols were deemed essential.

How did output quality change as the AI iterated during the exercise?

By day three, after prior summaries were fed back as reference material, the AI began predicting enemy order of march and recommending critical targets five days out. Intelligence and execution summaries that previously averaged fewer than three pages grew to an average of 20 pages, incorporating updated commander's intent and complex civil affairs guidance.

AI as Virtual Higher Command: The U.S. Army Command…

AI as Virtual Higher Command: The U.S. Army Command and General Staff College's LLM Training Experiment

The evolving nature of warfare continues to push the Army to seek decisive advantages through technology. During fiscal year 2025 (FY25), faculty at the Command and General Staff College (CGSC) explored how artificial intelligence (AI) could meet training objectives at every echelon. Integrating AI into the execution phase of the Military Decision-Making Process (MDMP) represents the next frontier of that pursuit. A real-world application within the CGSC Advanced Operations Course's capstone Large-Scale Combat Operations (LSCO) exercise provides a concrete blueprint for how large language models (LLMs) can revolutionize simulation exercises as virtual higher headquarters—enabling units at any echelon to build staff proficiency on their own timeline without imposing on adjacent or higher units.

This article synthesizes findings from a two-week, 61-student practical exercise in which students served as 25th Infantry Division staff during a simulated exercise. Our exploration demonstrates that, with proper resourcing, AI can serve as a powerful cognitive collaboration partner, bridging the gap between limited manpower and the complex demands of division-level simulation.

Building the Framework: Resourcing AI for the MDMP

The initial phase involved constructing a digital battlespace that allowed AI agents to reason from the same doctrinal and operational documents used by human staff. Using the Vantage platform, we built a pipeline utilizing two separate knowledge ontologies. A "tactical ontology" aggregated all division-level information—from mission receipt through post-order combined arms rehearsals and continuous estimates across warfighting functions. An "operational higher headquarters ontology" incorporated prior-year intelligence and operational summaries to provide contextual background.

To ensure doctrinal fidelity, we validated the model's ability to cite field manuals covering operations, the MDMP, and Chinese tactics. By converting field manuals into structured ontology objects, the AI was able to parse complex operational inputs and simulate the cognitive load of a full planning staff.

Solving Resource Constraints: AI as the Higher Command (HICON)

A significant challenge in the exercise was the limited capacity of the Higher Command (HICON) cell. Responsible for managing corps-level deliverables for a division wet-gap crossing and forward passage of lines operation, and for continuously injecting Master Scenario Events List (MSELs) into the operational framework, the cell consisted of a single Medical Service (MS) officer with limited experience in targeting or intelligence collection integration. This officer was not a subject matter expert (SME) in intelligence collection (IC), targeting, or the combined arms integration of attack aviation, fires, and M2—creating friction points as division staff required continuous feedback on battle damage assessment (BDA) and collection planning to maintain training tempo.

Employing Vantage as a virtual corps headquarters transformed this constraint into a capability. By building an AI platform with two separate ontologies, the system could provide division staff across the planning, future operations (FUOPS), and current operations (CUOPS) cells with the information needed to plan across time horizons. One ontology referenced Army doctrine and the original corps base operations order (OPORD); the other integrated division briefing slides containing map positions, collection plans, the High-Payoff Target List (HPTL), and the High-Value Target List (HVTL). The AI generated enemy updates and provided real-time BDA on staff "kill contracts," enabling faculty to control exercise tempo while the AI absorbed the substantial cognitive burden of processing data for the division.

Dynamic Adjudication: AI Deliverables Across Echelons

The virtual headquarters went beyond basic feedback, helping provide updates, constraints, and new challenges to the rear, main, and tactical command posts—highly beneficial for training an entire staff. This was reflected in several high-value deliverables:

Operational Synchronization: The model provided 24-hour updates for three simulated divisions, allowing staff to understand how execution synchronizes across time and space. In one instance, a simulated adjacent division falling 12 hours behind prompted the AI to generate two new Priority Intelligence Requirements (PIRs) focused on potential seams between unit boundaries. It also enabled division staff to coordinate cross-boundary fire support missions and confirm effects through BDA, illustrating how properly resourced assets can support an entire corps as a system.
Multi-Source Intelligence Updates: The system generated geospatial, open-source, and signals intelligence updates based on course parameters, refined through existing ontologies. In one case, the AI generated a large-scale population displacement that sparked civil unrest in the division rear area, prompting supporting units to integrate their skills into the capstone exercise and allowing division staff to understand the full area of operations (AO) and provide recommendations to command.
Threat Assessment and Targeting: The AI assessed when enemy field artillery could mass fires and identified a new rocket system appearing near crossing sites, forcing the division intelligence element to focus on specific areas of concern and improve targeting effectiveness. The AI was also tasked with generating daily BDA for adjacent divisions, demonstrating how a corps sets conditions across the AO.
Significant Activity Reports: The model generated detailed reports on adjacent unit actions and offered recommendations for engineer support or obstacle reduction, driving refinement in both current and future operations.

Iterative Advantages: Predictive Analytics and Output Depth

The most significant finding of the exercise was the compounding effect of AI iterative learning. By day three—after feeding generated summaries back into the system as reference material—the AI began providing enemy order of march, tempo assessments, and recommended critical targets projected five days out. This simultaneously drove future operations planning and gave staff situational awareness of enemy activity across deep, close, and rear terrain.

Output depth also underwent a fundamental shift. Intelligence and execution summaries that rarely exceeded three pages in the past averaged 20 pages when produced by the model, incorporating updated commander's intent, close air support reallocation guidance, and a broad range of complex civil affairs issues.

Implications for the Force

This AI-assisted staff training experiment offers several important takeaways for the Army:

Non-SME Force Multiplication: AI enables a single non-SME officer to produce high-quality, doctrinally sound guidance documents across multiple warfighting functions.
Cross-Echelon Repetition: The concept can be scaled down to the battalion level. Staff can conduct tabletop exercises at home station, using the model to provide daily guidance—increasing repetitions and deepening analysis without relying on an actual higher headquarters.
Training Autonomy: Units can schedule training on their own calendars, using AI to drive information flow, maintain a high training tempo, and refine plans in response to AI-simulated changes in the situation.

Risks of Over-Reliance on LLMs as Virtual Higher Headquarters

While LLMs functioning as virtual higher headquarters offer efficiency gains in resource-constrained training environments, they pose significant risks to doctrinal fidelity and professional military judgment. LLMs remain prone to hallucination—generating plausible but incorrect outputs for BDA, threat predictions, and PIRs even when supported by structured ontologies and doctrinal documents. Research into LLM use in wargaming consistently shows a tendency toward escalatory and aggressive postures that diverge from human expert judgment.

The "non-SME force multiplication" effect may foster automation bias, leading staff officers under time pressure to uncritically accept machine-generated deliverables—eroding the iterative human deliberation essential to developing adaptive operational art. Over-reliance on AI-assisted exercises therefore risks producing voluminous but superficial documentation that simulates analytical depth rather than genuinely cultivating the disciplined initiative and situational judgment demanded by multi-domain operations. Rigorous human oversight, validation protocols, and doctrinal safeguards are indispensable elements in ensuring AI augments rather than replaces the human element in staff training.

Conclusion

Integrating artificial intelligence into decision-making processes is today's reality and holds significant potential to increase the speed and depth of execution-phase simulation. By properly resourcing units with AI agents, the Army can ensure greater repetition and deeper analysis—leading to more rigorously tested plans and a staff more capable of handling the complexities of multi-domain operations.

The views expressed in this article are those of the authors and do not reflect the official policy or position of the Department of the Army, the Department of Defense, or the U.S. Government.

This article was originally published in Small Wars Journal, Arizona State University.

原文來源： 查看原文

AI as Virtual Higher Command: The U.S. Army Command and General Staff College's LLM Training Experiment

Highlights

AI as Virtual Higher Command: The U.S. Army Command and General Staff College's LLM Training Experiment

Building the Framework: Resourcing AI for the MDMP

Solving Resource Constraints: AI as the Higher Command (HICON)

Dynamic Adjudication: AI Deliverables Across Echelons

Iterative Advantages: Predictive Analytics and Output Depth

Implications for the Force

Risks of Over-Reliance on LLMs as Virtual Higher Headquarters

Conclusion

FAQ

Subscribe to our Low-Altitude Industry Newsletter

Ukraine's 40th Marine Brigade Deploys 'Barracuda' USV with 20,000 km Control Range

Wreckage of U.S.-Made FLM-136 LUCAS Attack Drone Reportedly Spotted Near Iran's Bandar Abbas Port

US Air Force Completes First Live-Fire Missile Test for Collaborative Combat Aircraft