RQ1
Characterization
How are language models being used for automated planning?
This chapter constructs a structured characterization of the role of language models in automated planning. The contribution is twofold: a manual taxonomy built from the literature through November 2023, and a semi-automated framework for extending that taxonomy and analyzing how the field changes over time.
Language models are not used in planning in a single uniform way; they occupy multiple functional roles, and those roles are shifting as the community moves from end-to-end plan generation toward more structured and tool-supported uses.
Category drift
Still dominant in D2, but its relative share decreases.
Declines as translation is treated as necessary but insufficient for planning.
Decreases as end-to-end interactive use remains difficult to scale reliably.
Grows and becomes the second-highest category in D2.
Maintains a stable presence across the two datasets.
Increases and reaches the third-highest share in D2.
Declines as work shifts toward concrete neuro-symbolic architectures.
Decreases as coordination reliability remains a major challenge.
Emerges in D2 with 4 papers focused on subgoal structuring.
Emerges in D2 with 1 paper centered on plan adaptation after failure.