
GPU Systems Engineer – HPC / Parallel Computing
Vast.ai
Posted about 20 hours ago
We’re looking for a systems engineer with HPC or parallel programming experience to help scale AI inference. You’ll leverage your knowledge of high-performance systems to optimize GPU performance at the bleeding edge of AI. Tech stack: CUDA/C++, GPGPU, Python, Linux.
What You'll Do
Design and optimize GPU kernels and tensor libraries
Translate HPC techniques into scalable AI inference solutions
Evaluate emerging architectures and resource management approaches
Collaborate with technical leadership to improve GPU infrastructure efficiency
What We're Looking For
Advanced C++ (C++17/20 preferred)
Expertise with at least one parallel framework (CUDA, HIP, SYCL, OpenCL, OpenACC, or similar)
Strong background in systems optimization and HPC performance tooling
Nice to Have
Familiarity with distributed training/inference frameworks
Benefits
Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Job details
Jobr Assistant extension
Get the extension →