Episodios

  • LoRA
    Sep 2 2023

    We talk about Low Rank Approximation for fine tuning Transformers. We are also on YouTube now! Check out the video here: https://youtu.be/lLzHr0VFi3Y

    Más Menos
    1 h y 3 m
  • 15: InstructGPT
    Mar 28 2023

    In this episode we discuss the paper "Training language models to follow instructions with human feedback" by Ouyang et al (2022). We discuss the RLHF paradigm and how important RL is to tuning GPT.

    Más Menos
    57 m
  • 14: Whisper
    Mar 17 2023
    This week we talk about Whisper. It is a weakly supervised speech recognition model.



    Más Menos
    49 m
  • 13: AlphaTensor
    Mar 11 2023

    We talk about AlphaTensor, and how researchers were able to find a new algorithm for matrix multiplication.

    Más Menos
    49 m
  • 12: SIRENs
    Oct 25 2022

    In this episode we talked about "Implicit Neural Representations with Periodic Activation Functions" and the strength of periodic non-linearities.

    Más Menos
    54 m
  • 11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer
    Sep 30 2022

    In this episode we discuss this video: https://youtu.be/jPCV4GKX9Dw

    How Tesla approaches collision detection with novel methods.

    Más Menos
    49 m
  • 10: Outracing champion Gran Turismo drivers with deep reinforcement learning
    Aug 23 2022

    We discuss Sony AI's accomplishment of creating a novel AI agent that can beat professional racers in Gran Turismo. Some topics include:
    - The crafting of rewards to make the agent behave nicely
    - What is QR-SAC?
    - How to deal with "rare" experiences in the replay buffer

    Link to paper: https://www.nature.com/articles/s41586-021-04357-7

    Más Menos
    55 m
  • 9: Heads-Up Limit Hold'em Poker Is Solved
    Jul 29 2022

    Today we talk about recent AI advances in Poker; specifically the use of counterfactual regret minimization to solve the game of 2-player Limit Texas Hold'em.

    Más Menos
    48 m