Back to all jobs
S
AI Inference Engineer
Sauron Systems
San Francisco$175k–225kOn-site3mo ago
- Employment
- Full-time
About the role
- Lead the development and optimization of low-latency inference engines using TensorRT and ONNX, including authoring custom plugins to support cutting-edge architectures.
- Design and maintain multithreaded video processing and streaming pipelines (RTSP, RTP, HLS) using GStreamer and DeepStream.
- Collaborate closely with embedded engineers to integrate perception software with Yocto platforms, ensuring seamless hardware-software synergy.
- Work with raw data from cameras and LiDAR to enable real-time data capture, obstacle detection, and avoidance.
- Write and optimize custom CUDA kernels and perform low-level GPU tuning to maximize throughput and minimize power consumption.
- Productionize proven prototypes from Jetpack into Yocto
- Apply advanced optimization techniques—including quantization (INT8/FP16), pruning, and distillation - to bring research-grade models to production-grade efficiency.
- Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, Robotics, or a related field.
- 3+ years of experience developing and deploying computer vision or machine learning applications on real-world robotic systems (not just in simulation).
- High proficiency in C, C++, and Python, with a focus on real-time and embedded systems.
- Expert-level knowledge of the NVIDIA Jetson ecosystem (JetPack SDK, DeepStream, TensorRT) and a deep understanding of CUDA/GPU architecture.
- Hands-on experience with video streaming tools like ffmpeg and protocols such as RTSP, RTP and HLS.
- Proven track record of deploying AI systems that operate in the field, handling the unpredictability of real-world sensor data.
- Familiarity with NVIDIA’s broader robotics stack
- Experience with ML compilers or compiler-level optimizations for GPU inference.
- Specific background in sensor fusion and AI-driven obstacle avoidance for autonomous navigation.
- Exposure to remote logging, log ingestion, and distributed telemetry aggregation.
- Previous experience in early-stage startups or fast-paced hardware/software integration environments.
- We celebrate as a team and troubleshoot as a team.
- The goal is the mission, not the credit.
- Be ruthless with problems, but kind to people.
- Raise the bar, lower the shield
- Your perspective is a requirement, not a suggestion.
- Speak the hard truths early so we can fix them fast.
- Do what you say you’ll do.
- If it breaks, fix it. If it works, make it better.
- Earn trust through empathy and consistency.
- Anticipate needs before they become requests.
Compensation
Perks & benefits
- Equity Compensation
764,000+ hidden jobs like this
Sauron Systems and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites