poolside is hiring for a

Member of Engineering (GPU)

Position
Full-time, indefinite contract
Location
Remote EMEA/East Coast or Paris
Apply for this role

About poolside

poolside is pursuing AGI. Only very few companies in the world are able to be in this race - we’re early in our journey which makes it a great time to join the team but we also have scale-up+ needs when it comes to product and engineering. We can tell you why over the phone...

About our team

We are a remote-first team that sits across Europe and North America. We come together once a month in-person for 3 days, always Monday-Wednesday. We also do longer off-sites twice a year.

Our team is a combination of more research and more engineering-oriented profiles, however, everyone deeply cares about the quality of the systems we build and has a strong underlying knowledge of software development. We believe that good engineering leads to faster development iterations, which allows us to compound our efforts.

About the role

Performance-oriented programming in CUDA, C++, Cython, Triton. You are going to accelerate high-level primitives used to train Large Language Models (LLMs) and optimize distributed communication over NVLink and InfiniBand. You will be working on poolside’s own implementation of distributed training for LLMs.

Your mission

Ensure cutting-edge performance of LLM pre-training and fine tuning on huge state-of-the-art GPU clusters.

Responsibilities

  • Profile CPU and CUDA code at several abstraction levels.
  • Debug and profile distributed applications.
  • Troubleshoot undocumented CUDA internals.
  • Hack the NCCL library used for GPU communication.
  • Tune vanilla CUDA, Triton, CUTLASS kernels for the latest NVIDIA GPUs.
  • Hack PyTorch internals.
  • Deep dive into InfiniBand RDMA, including SHARP.

Skills & Experience

  • Engineering background
  • Expert understanding of GPU hardware/architecture
  • Strong C/C++ programming skills
  • Fine-grained knowledge of CUDA programming
  • Strong algorithmic skills
  • System programming on Linux experience
  • Plus: knowledge of CPython internals and experience of native extension development
  • Plus: knowledge of InfiniBand internals
  • Plus: compiler development background

Process

  • Intro call with Eiso, our CTO & Co-Founder
  • Technical Interview(s) with one of our Founding Engineers
  • Team-fit call with Beatriz, our Head of People
  • Meet & greet call with the founders, Jason and Eiso, CEO and CTO respectively

Benefits

  • Fully remote work & flexible hours;
  • 37 days/year of vacation & holidays;
  • Health insurance allowance for you and dependents;
  • Company-provided equipment;
  • Wellbeing, always-be-learning and home office allowances;
  • Frequent team get togethers in Paris;
  • Great diverse & inclusive people-first culture.
Position
Full-time, indefinite contract
Location
Remote EMEA/East Coast or Paris
Apply for this role
Loading...