Free Reads
Sign in to view your remaining parses.
Tag Filter
Multistage Reinforcement Learning
Humanoid Whole-Body Badminton via Multi-Stage Reinforcement Learning
Published:11/14/2025
Humanoid Whole-Body ControlReinforcement Learning Training PipelineAction Generation in Dynamic EnvironmentsBadminton Motion ControlMultistage Reinforcement Learning
This paper presents a reinforcement learning training pipeline to develop a unified wholebody controller for humanoid badminton, enabling coordinated footwork and striking without reliance on motion priors or expert demonstrations. The training is validated in both simulated and
03
Qwen2.5 Technical Report
Published:12/20/2024
Qwen 2.5 Large Language ModelMultistage Reinforcement LearningSupervised Fine-Tuning MethodsHuman Preference EnhancementLarge-Scale Pre-Training Datasets
The Qwen2.5 technical report introduces a new large language model series, expanding the pretraining dataset to 18 trillion tokens. It employs over 1 million finetuned samples and multistage reinforcement learning to enhance human preferences, demonstrating superior performanc
05