Sample Actions
Scale by Advantage
Redirect the Flow
Flow Policy Optimization