-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: microsoft/agent-lightning
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support of trajectory aggregation for mrope multimodal model, and add multimodal prefix checks for trajectory merge
#469
opened Jan 29, 2026 by
jackhu-bme
Loading…
fix: improve error when extracted completion is not a list
ci-apo
#463
opened Jan 22, 2026 by
AdithyaKotian
Loading…
Add trajectory-level deduplication for GRPO advantage normalization
#462
opened Jan 21, 2026 by
zzjweb
Loading…
Validate input length in generate_id utility
ci-gpu
#460
opened Jan 21, 2026 by
DunuraWitharama
Loading…
Provide an OpenAI Client training example with reinforcement learning
#435
opened Dec 26, 2025 by
hzy46
Loading…
Handle server shutdown gracefully to prevent traceback spam
#408
opened Dec 12, 2025 by
Vasuk12
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-01-01.