Scientific Computing Seminar

Date and Place: Thursdays and hybrid (live in 32-349/online via Zoom). For detailed dates see below!

Content

In the Scientific Computing Seminar we host talks of guests and members of the SciComp team as well as students of mathematics, computer science and engineering. Everybody interested in the topics is welcome.

List of Talks

Event Information:

  • Thu
    12
    Feb
    2026

    SC Seminar: Dominik Willrich

    10:15Hybrid (Room 32-349 and via Zoom)

    Dominik Willrich, RPTU University Kaiserslautern-Landau

    Title: Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution

    Abstract:

    Deep reinforcement learning has achieved strong performance in continuous control tasks, yet most stochastic policy gradient methods rely on Gaussian action distributions, even when the true action space is bounded. This mismatch can introduce bias and negatively affect learning efficiency. In this seminar, we present the work of the authors on replacing the Gaussian policy with a Beta distribution, which naturally respects action bounds. The paper provides a theoretical analysis of the bias and variance of policy gradients under both distributions and shows that the Beta policy is bias-free in bounded action spaces. Empirically, the authors evaluate this approach using both on-policy and off-policy methods, across a range of continuous control benchmarks in OpenAI Gym and MuJoCo. The results demonstrate faster convergence and improved performance when using Beta-distributed policies, highlighting the importance of aligning policy parameterization with problem constraints in continuous control.

    How to join online

    You can join online via Zoom, using the following link:
    https://uni-kl-de.zoom-x.de/j/69269239534?pwd=Z9UOzMpkhMjrxVhll3d49sNHFe9Fd1.1