Skip to content

Software Engineer, AI Inference / HPC

Develops and optimizes AI inference engine for image/video enhancement, focusing on performance, GPU/CPU optimization, model deployment, and hardware partnerships. Requires C/C++ expertise, 1+ years experience in performance optimization and image processing.

110k – 150kDallas, TXML EngineeringOnsite1+ YOE

About the role

Responsibilities

  • Improve performance, stability, availability of new features, and simplify/improve the API of the internal AI Engine framework.
  • Act as technical bridge between Deep Learning research team and Production products.
  • Prepare new & updated models for production.
  • Optimize GPU/CPU for inference.
  • Work with hardware partners (NVIDIA, AMD, Intel, Apple) to optimize inference on their hardware.

Requirements

  • Hands-on experience with performance optimization (concurrency, multithreading, memory, speed, benchmarking, reliability).
  • Experience architecting APIs for internal development.
  • Hands-on experience implementing image processing or computational photography algorithms.
  • Expert knowledge of C/C++.
  • At least 1+ years of professional working experience in a related field.

Preferred

  • Experience with video encoding/decoding and file formats.
  • Experience with OpenCV, ffmpeg, GPU programming.
  • Experience with raw image camera pipeline and image formats.
  • Experience with ONNX, CoreML, TensorRT runtime SDKs.
  • Interest in photography or videography.

Skills

C++CPerformance OptimizationMultithreadingConcurrencyAPI DesignImage ProcessingOpencvFfmpegGpu ProgrammingOnnxCoremlTensorRT

Similar roles

ML Engineering jobs

Machine Learning Engineer II/III (Applied Research & Model Development)

Designs, develops, and deploys ML models for research and product development in pathology AI, collaborating with scientists and engineers on biological/clinical applications. Requires Master's/PhD, Python/ML expertise, production deployment experience; leads projects at senior levels.

107k – 200kBoston, MA +1ML EngineeringHybrid2+ YOEMLOpsPython

Research Engineer, Asta

Research Engineer builds ML infrastructure and agentic systems to accelerate scientific discovery in biology, neuroscience, and more. Requires 2+ years experience with Python, PyTorch/JAX/TF, cloud resources, and deep learning for LLM/agent research at Ai2.

119k – 178kSeattle, WAML EngineeringOn-site2+ YOEJAXGCP

Machine Learning Engineer

Build and deploy reinforcement learning models to autonomously control mineral refining facilities, optimizing recovery rates, energy use, and uptime in real operating plants.

120k – 160kAnn Arbor, MI +2ML EngineeringOn-siteEntry levelPythonDeep Learning

Founding AI Engineer

Founding AI Engineer building and shipping production-grade computer vision, LLM, and multimodal models for real-world construction documents at an early-stage startup.

120k – 180kSan Francisco, CAML EngineeringOn-siteEntry levelLLMsPrototyping

Early Career Software Engineer – Applied AI

Early career software engineer builds and integrates AI agents and solutions using frameworks like LangChain and RAG pipelines to enhance childcare platform features. Requires bachelor's in CS/engineering, Python/JS proficiency, and AI/ML familiarity; hybrid onsite in SF office 3 days/week.

100k – 120kSan Francisco, CAML EngineeringHybridEntry levelGCPAWS