Back to all jobs
S

AI Inference Engineer

Sauron Systems

San Francisco$175k–225kOn-site3mo ago
Employment
Full-time

About the role

  • Lead the development and optimization of low-latency inference engines using TensorRT and ONNX, including authoring custom plugins to support cutting-edge architectures.
  • Design and maintain multithreaded video processing and streaming pipelines (RTSP, RTP, HLS) using GStreamer and DeepStream.
  • Collaborate closely with embedded engineers to integrate perception software with Yocto platforms, ensuring seamless hardware-software synergy.
  • Work with raw data from cameras and LiDAR to enable real-time data capture, obstacle detection, and avoidance.
  • Write and optimize custom CUDA kernels and perform low-level GPU tuning to maximize throughput and minimize power consumption.
  • Productionize proven prototypes from Jetpack into Yocto
  • Apply advanced optimization techniques—including quantization (INT8/FP16), pruning, and distillation - to bring research-grade models to production-grade efficiency.
  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, Robotics, or a related field.
  • 3+ years of experience developing and deploying computer vision or machine learning applications on real-world robotic systems (not just in simulation).
  • High proficiency in C, C++, and Python, with a focus on real-time and embedded systems.
  • Expert-level knowledge of the NVIDIA Jetson ecosystem (JetPack SDK, DeepStream, TensorRT) and a deep understanding of CUDA/GPU architecture.
  • Hands-on experience with video streaming tools like ffmpeg and protocols such as RTSP, RTP and HLS.
  • Proven track record of deploying AI systems that operate in the field, handling the unpredictability of real-world sensor data.
  • Familiarity with NVIDIA’s broader robotics stack
  • Experience with ML compilers or compiler-level optimizations for GPU inference.
  • Specific background in sensor fusion and AI-driven obstacle avoidance for autonomous navigation.
  • Exposure to remote logging, log ingestion, and distributed telemetry aggregation.
  • Previous experience in early-stage startups or fast-paced hardware/software integration environments.
  • We celebrate as a team and troubleshoot as a team.
  • The goal is the mission, not the credit.
  • Be ruthless with problems, but kind to people.
  • Raise the bar, lower the shield
  • Your perspective is a requirement, not a suggestion.
  • Speak the hard truths early so we can fix them fast.
  • Do what you say you’ll do.
  • If it breaks, fix it. If it works, make it better.
  • Earn trust through empathy and consistency.
  • Anticipate needs before they become requests.

Compensation

Perks & benefits

  • Equity Compensation

764,000+ hidden jobs like this

Sauron Systems and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.