Episodios

  • The Uncertain Art of Accelerating ML Models with Sylvain Gugger
    Oct 14 2024

    Sylvain Gugger is a former math teacher who fell into machine learning via a MOOC and became an expert in the low-level performance details of neural networks. He’s now on the ML infrastructure team at Jane Street, where he helps traders speed up their models. In this episode, Sylvain and Ron go deep on learning rate schedules; the subtle performance bugs PyTorch lets you write; how to keep a hungry GPU well-fed; and lots more, including the foremost importance of reproducibility in training runs. They also discuss some of the unique challenges of doing ML in the world of trading, like the unusual size and shape of market data and the need to do inference at shockingly low latencies.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • “Practical Deep Learning for Coders,” a FastAI MOOC by Jeremy Howard, and the book, of which Sylvain is a co-author.
    • The Stanford DAWNBench competition that Sylvain participated in.
    • HuggingFace, and the Accelerate library that Sylvain wrote there.
    • Some of the languages/systems for expression ML models that were discussed: PyTorch, TensorFlow, Jax, Mojo, and Triton
    • CUDA graphs and streams
    • Hogwild concurrency
    Más Menos
    1 h y 6 m
  • Solving Puzzles in Production with Liora Friedberg
    Oct 7 2024

    Liora Friedberg is a Production Engineer at Jane Street with a background in economics and computer science. In this episode, Liora and Ron discuss how production engineering blends high-stakes puzzle solving with thoughtful software engineering, as the people doing support build tools to make that support less necessary. They also discuss how Jane Street uses both tabletop simulation and hands-on exercises to train Production Engineers; what skills effective Production Engineers have in common; and how to create a culture where people aren’t blamed for making costly mistakes.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • More about production engineering at Jane Street, including how to apply.
    • Notes on Site reliability engineering in the wider world.
    • Alarm fatigue and desensitization.
    • Jane Street’s 1950’s era serialization-format of choice,
    • Some games that Streeters have used for training people to respond to incidents.
    Más Menos
    54 m
  • From the Lab to the Trading Floor with Erin Murphy
    Jul 12 2024

    Erin Murphy is Jane Street’s first UX designer, and before that, she worked at NASA’s Jet Propulsion Laboratory building user interfaces for space missions. She’s also an illustrator with her own quarterly journal. In this episode, Erin and Ron discuss the challenge of doing user-centered design in an organization where experts are used to building tools for themselves. How do you bring a command-line interface to the web without making it worse for power users? They also discuss how beauty in design is more about utility than aesthetics; what Jane Street looks for in UX candidates; and how to help engineers discover what their users really want.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • Erin’s website that shows off her work.
    • Her quarterly journal of sketches and observations.
    • An article about Erin’s design work with NASA JPL.
    • A paper that among other things talks about the user study work that Erin did at JPL.
    • Jane Street’s current UX job opening.
    Más Menos
    1 h y 4 m
  • Performance Engineering on Hard Mode with Andrew Hunter
    Nov 28 2023

    Andrew Hunter makes code really, really fast. Before joining Jane Street, he worked for seven years at Google on multithreaded architecture, and was a tech lead for tcmalloc, Google’s world-class scalable malloc implementation. In this episode, Andrew and Ron discuss how, paradoxically, it can be easier to optimize systems at hyperscale because of the impact that even miniscule changes can have. Finding performance wins in trading systems—which operate at a smaller scale, but which have bursty, low-latency workloads—is often trickier. Andrew explains how he approaches the problem, including his favorite profiling techniques and tools for visualizing traces; the unique challenges of optimizing OCaml versus C++; and when you should and shouldn’t care about nanoseconds. They also touch on the joys of musical theater, and how to pass an interview when you’re sleep-deprived.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • “Profiling a warehouse-scale computer”
    • Magic-trace
    • OODA loop
    Más Menos
    56 m
  • A Poet's Guide to Product Management with Peter Bogart-Johnson
    Aug 15 2023

    Peter Bogart-Johnson was one of Jane Street’s first program managers, and helped bring the art of PMing—where that “P” variously stands for “project,” “product,” or some blend of the two—to the company at large. He’s also a poet and the editor of a literary magazine. In this episode, Peter and Ron discuss the challenge of gaining trust as an outsider: how do you teach teams a new way of doing things while preserving what’s already working? The key, Peter says, is you listen; a good PM is an anthropologist. They also discuss how paying down technical debt isn’t something you do instead of serving customers; what Jane Street looks for in PM candidates; and how to help teams coordinate in times of great change.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • LIT Magazine (more recently here)
    • How to be a PM that engineers don’t hate and How to be an engineer that PMs don’t hate
    Más Menos
    1 h y 2 m
  • The Future of Programming with Richard Eisenberg
    May 18 2023

    Richard Eisenberg is one of the core maintainers of Haskell. He recently joined Jane Street’s Tools and Compilers team, where he hacks on the OCaml compiler. He and Ron discuss the powerful language feature that got him into PL design in the first place—dependent types—and its role in a world where AIs can (somewhat) competently write your code for you. They also discuss the differences between Haskell and OCaml; the perils of trying to make a language that works for everybody; and how best a company like Jane Street can collaborate with the open source community.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • Dependent types
    • GHC
    • Unboxed types in OCaml
    • Language extensions in Haskell
    Más Menos
    1 h
  • Swapping the Engine Out of a Moving Race Car with Ella Ehrlich
    Sep 12 2022

    Ella Ehrlich has been a developer at Jane Street for close to a decade. During much of that time, she’s worked on Gord, one of Jane Street’s oldest and most critical systems, which is responsible for normalizing and distributing the firm’s trading data. Ella and Ron talk about how to grow and modernize a legacy system without compromising uptime, why game developers are the “musicians of software,” and some of the work Jane Street has done to try to hire a more diverse set of software engineers.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • EG, The League of Legends team that Ella is a huge fan of.
    • Apache Kafka, the message bus that Gord migrated to.
    • Some of the various sources of symbology you have to deal with when normalizing trading data. (Really, there are too many sources to list here!)
    • A list of Jane Street’s recruiting Programs and Events, including INSIGHT, which focuses on women, and IN FOCUS, which focuses on historically underrepresented ethnic or racial minorities.
    Más Menos
    1 h
  • State Machine Replication, and Why You Should Care with Doug Patti
    Apr 20 2022

    Doug Patti is a developer in Jane Street’s Client-Facing Tech team, where he works on a system called Concord that undergirds Jane Street’s client offerings. In this episode, Doug and Ron discuss how Concord, which has state-machine replication as its core abstraction, helps Jane Street achieve the reliability, scalability, and speed that the client business demands. They’ll also discuss Doug’s involvement in building a successor system called Aria, which is designed to deliver those same benefits to a much wider audience.

    You can find the transcript for this episode on our website.

    Some links to topics that came up in the discussion:

    • Jane Street’s client-facing trading platforms
    • A Signals and Threads episode on market data and multicast which discusses some of the history of state-machine replication in the markets.
    • The FIX protocol
    • UDP multicast
    • Reliable multicast
    • Kafka
    Más Menos
    1 h y 12 m