Step 3: Plan Captions for the Outputs