Back to all jobs
Cato Networks logo

AI Security - AI Platform Engineer

Cato Networks
Tel Aviv District3d ago

About the role

<div class="content-intro"><p><strong>Welcome to the future of cloud networking and security!&nbsp;&nbsp;</strong></p> <p><span data-olk-copy-source="MessageBody">Cato Networks is the first company to converge enterprise networking and security into one centralized and global service that is delivered by cloud. It is led by networking and security pioneer Shlomo Kramer (Check Point, Imperva) and early investor (Palo Alto Networks, Exabeam, Trusteer and more). Cato’s unique technology inspired a brand-new product category, later named “SASE” by Gartner and a market expected to reach $28.5 billion by 2028.<br><br>This is your opportunity to get on the rocket ship and join a company that is building a cutting-edge enterprise network and secure cloud platform, and is on a fast track to becoming the worldwide market leader – don’t miss it!</span></p> <p>&nbsp;</p></div><div class="p-rich_text_section"> <div id="message-list_1761841251.482129" class="c-virtual_list__item" data-qa="virtual-list-item" data-item-key="1761841251.482129"> <div class="c-message_kit__background c-message_kit__background--hovered p-message_pane_message__message c-message_kit__message" data-qa="message_container" data-qa-unprocessed="false" data-qa-placeholder="false"> <div class="c-message_kit__hover c-message_kit__hover--hovered" data-qa-hover="true"> <div class="c-message_kit__actions c-message_kit__actions--default"> <div class="c-message_kit__gutter"> <div class="c-message_kit__gutter__right" data-qa="message_content"> <div class="c-message_kit__blocks c-message_kit__blocks--rich_text"> <div class="c-message__message_blocks c-message__message_blocks--rich_text" data-qa="message-text"> <div class="p-block_kit_renderer" data-qa="block-kit-renderer"> <div class="p-block_kit_renderer__block_wrapper p-block_kit_renderer__block_wrapper--first"> <div class="p-rich_text_block"> <div class="p-rich_text_section"> <div class="p-rich_text_section"> <div id="message-list_1761841251.482129" class="c-virtual_list__item" data-qa="virtual-list-item" data-item-key="1761841251.482129"> <div class="c-message_kit__background c-message_kit__background--hovered p-message_pane_message__message c-message_kit__message" data-qa="message_container" data-qa-unprocessed="false" data-qa-placeholder="false"> <div class="c-message_kit__hover c-message_kit__hover--hovered" data-qa-hover="true"> <div class="c-message_kit__actions c-message_kit__actions--default"> <div class="c-message_kit__gutter"> <div class="c-message_kit__gutter__right" data-qa="message_content"> <div class="c-message_kit__blocks c-message_kit__blocks--rich_text"> <div class="c-message__message_blocks c-message__message_blocks--rich_text" data-qa="message-text"> <div class="p-block_kit_renderer" data-qa="block-kit-renderer"> <div class="p-block_kit_renderer__block_wrapper p-block_kit_renderer__block_wrapper--first"> <div class="p-rich_text_block"> <div class="p-rich_text_section"> <div class="p-rich_text_section"> <div id="message-list_1761841251.482129" class="c-virtual_list__item" data-qa="virtual-list-item" data-item-key="1761841251.482129"> <div class="c-message_kit__background c-message_kit__background--hovered p-message_pane_message__message c-message_kit__message" data-qa="message_container" data-qa-unprocessed="false" data-qa-placeholder="false"> <div class="c-message_kit__hover c-message_kit__hover--hovered" data-qa-hover="true"> <div class="c-message_kit__actions c-message_kit__actions--default"> <div class="c-message_kit__gutter"> <div class="c-message_kit__gutter__right" data-qa="message_content"> <div class="c-message_kit__blocks c-message_kit__blocks--rich_text"> <div class="c-message__message_blocks c-message__message_blocks--rich_text" data-qa="message-text"> <div class="p-block_kit_renderer" data-qa="block-kit-renderer"> <div class="p-block_kit_renderer__block_wrapper p-block_kit_renderer__block_wrapper--first"> <div class="p-rich_text_block"> <div class="p-rich_text_section"> <div id="message-list_1779699730.101959"> <div> <div> <div> <div> <div> <div> <div> <div> <div> <div> <div>Cato is building a real-time AI runtime platform for security algorithms running inline across our global cloud and physical PoPs.<br>We are looking for an <strong>AI Platform Engineer</strong> to help build the infrastructure that powers high-throughput, low-latency AI security decisions in production.<br>You will work on a runtime engine that combines GPU-based models, from MMBERT-style models to LLMs, with CPU-based heuristics and security logic, optimized for scale, performance, reliability, and real-time execution. This is a versatile engineering role that spans AI runtime infrastructure, high-performance backend development, GPU inference, model lifecycle, and close collaboration with research teams to bring AI security algorithms into production.<br><br><strong><br>Responsibilities</strong></div> <ul> <li>Build Cato’s AI security runtime platform for high-throughput, low-latency production serving.</li> <li>Develop infrastructure for model serving, multi-model orchestration, and inline decision flows.</li> <li>Optimize inference performance: batching, caching, streaming, GPU utilization, memory usage, and runtime acceleration.</li> <li>Build backend orchestration and performance-critical services in Go.</li> <li>Support the model lifecycle: registry integration, packaging, versioning, deployment, monitoring, and operational health.</li> <li>Work closely with research and algorithm teams to productionize AI security models and algorithms at scale.</li> </ul> <div><br><br><strong>Requirements</strong></div> <ul> <li>3+ years of hands-on experience in AI inference, production ML infrastructure, model serving, or MLOps.</li> <li>Experience with production inference technologies such as <strong>Triton, vLLM, CUDA, Kubernetes, Docker, PyTorch, ONNX, TensorRT</strong>, or similar.</li> <li>Strong understanding of low-latency, high-throughput production systems.</li> <li>Experience with model lifecycle concepts: model registry, versioning, deployment, rollout, rollback, monitoring, and observability.</li> <li>3+ years of experience with <strong>Go</strong>, or strong experience with a similar high-performance backend language such as C++, Rust, or Java.</li> </ul> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> <div id="message-list_unreadDivider"></div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div> </div>

747,000+ hidden jobs like this

Cato Networks and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.