TalkRL: The Reinforcement Learning Podcast  By  cover art

TalkRL: The Reinforcement Learning Podcast

By: Robin Ranjit Singh Chauhan
  • Summary

  • TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.
    © 2024 Robin Ranjit Singh Chauhan
    Show more Show less
Episodes
  • Vincent Moens on TorchRL
    Apr 8 2024

    Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch.

    Featured References

    TorchRL: A data-driven decision-making library for PyTorch
    Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens


    Additional References

    • TorchRL on github
    • TensorDict Documentation


    Show more Show less
    40 mins
  • Arash Ahmadian on Rethinking RLHF
    Mar 25 2024

    Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.

    Featured Reference

    Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

    Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker


    Additional References

    • Self-Rewarding Language Models, Yuan et al 2024
    • Reinforcement Learning: An Introduction, Sutton and Barto 1992
    • Learning from Delayed Rewards, Chris Watkins 1989
    • Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning, Williams 1992
    Show more Show less
    34 mins
  • Glen Berseth on RL Conference
    Mar 11 2024

    Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Institute Courtios, and co-director of the Robotics and Embodied AI Lab (REAL).

    Featured Links

    Reinforcement Learning Conference

    Closing the Gap between TD Learning and Supervised Learning--A Generalisation Point of View
    Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach

    Show more Show less
    22 mins

What listeners say about TalkRL: The Reinforcement Learning Podcast

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.