Senior Core Platform Engineer
Austin, TXDevOps / SREOnsite5+ YOE
Summary
Senior engineer designs and implements core framework and runtimes for self-driving software stack, optimizing GPU/CPU execution graphs for latency/reliability, building observability tools, and owning subsystems. Requires 5+ years C++, multithreading, Linux, and complex systems experience.
About the role
What You'll Do
- Implementation and design of Avride’s core framework - both developer APIs and runtimes - building and optimizing execution graph consisting of GPU and CPU computations, scheduling execution to guarantee latency and reliability
- Building internal instrumentation and tools to keep the system observable
- Participate in system performance evaluations and help with full-system debugging
- Take long-term ownership of subsystems owned by the team, tracking their requirements and improving them over time
- Design and automate testing of the team’s components to make validation and issue-detection loops shorter
- Advocate for simplicity and long-term maintainability. Write well-structured code, produce clear design documents, and help the team resist unnecessary complexity
What You'll Need
- 5+ years of experience with C++
- 3+ years of programming experience in multi-threaded environments - multiple processes, threads, timers, and interrupts
- Experience with Linux and understanding of OS fundamentals
- Experience in diagnosing and resolving performance issues
- Proven experience of designing and shipping complex systems including scoping the problem, evaluating trade-offs, and verifying outcomes after delivery
- Strong cross-team collaboration skills - being ready to communicate with multiple teams to validate your ideas and manage testing and rollout out of your solutions
Nice to Have
- Python experience for scripting and non-time-critical tooling
- Background in autonomous vehicles, robotics, or other safety- or performance- critical systems
- Experience implementing middleware systems - execution scheduling, inter-process/inter-host transport layer, issue detection
- Experience working with large and complex systems and codebases
Skills
C++LinuxMultithreadingGPUCPUPythonPerformance OptimizationMiddlewareExecution SchedulingSystem Debugging