Back to all jobs
P

AI Engineer

prevalent

CochinOn-site3mo ago
Employment
Full-time

About the role

Role Purpose 


As an AI Engineer at Prevalent AI, you will independently design, build, optimize, and deploy production-grade Generative AI systems across our Exposure Management and Data Fabric platforms. You will own end-to-end AI components such as RAG pipelines, multi-agent workflows, LLM-backed APIs, guardrail-enforced inference flows, and cloud-native AI integrations, while collaborating closely with platform, backend, and product teams. 

This role is suited for engineers with hands-on, real-world experience building and operating GenAI systems in production, who can take ownership of design decisions, performance tuning, and reliability of AI-driven features. 


Key Accountabilities 


  • Design, build, and own production-ready GenAI systems, including RAG pipelines, embedding workflows, vector search architectures, tool-using agents, and LLM-integrated microservices. 
  • Create and manage MCP servers and associated tools, integrating and orchestrating them via AI agents 
  • Develop and maintain Fast API-based AI services integrated with LLMs, vector databases, cloud inference endpoints, and orchestration layers. 
  • Architect and implement agentic AI pipelines using frameworks such as Lang ChainLang Graph, ADK, Crew AI, or other relevant agent-based frameworks for multi-step reasoning, tool orchestration, autonomous agents, and structured LLM workflows. 
  • Integrate and operate cloud-based AI services using Google ADK (Gemini / Vertex AI), AWS Bedrock, or Azure OpenAI, including model selection, endpoint configuration, and cost-aware inference. 
  • Apply advanced prompt engineering strategies (structured prompting, ReactCoT, few-shot, tool-calling) and systematically reduce hallucinations and failure modes. 
  • Implement and contribute to LLM fine-tuning workflows (LoRaQLoRA, PEFT), including dataset preparation, training, evaluation, and deployment considerations. 
  • Design and enforce AI guardrails using frameworks such as NeMo Guardrails or Guardrails AI to ensure policy-compliant, safe, and explainable outputs. 
  • Lead model evaluation and optimization, focusing on latency, accuracy, robustness, hallucination mitigation, and cost efficiency. 
  • Own testing and deployment of AI services, including unit tests, integration tests, CI/CD pipelines, and environment-specific configurations (cloud/on-prem). 
  • Produce and maintain high-quality technical documentation covering prompts, workflows, vector schemas, architectural decisions, and API contracts. 
  • Collaborate with cross-functional teams to translate product requirements into scalable, reliable AI solutions and mentor junior engineers when needed. 

 

Skills & Experience 


Must have skills: 


  • Strong hands-on experience with LangChain and LangGraph for building and operating complex LLM workflows and agentic systems. 
  • Proven experience designing and deploying Retrieval-Augmented Generation (RAG) pipelines using embedding models and vector databases such as FAISS, Pinecone, Chroma, or equivalent. 
  • Solid backend engineering experience with FastAPI, including async APIs, dependency injection, authentication, and service observability. 
  • Practical experience with LLM fine-tuning approaches (LoRAQLoRA, PEFT) and understanding of when to fine-tune vs prompt vs retrieve. 
  • Advanced understanding of prompt engineering, including CoTReact, tool calling, schema-based prompting, and prompt versioning strategies. 
  • Experience implementing AI safety and guardrails, including output validation, policy enforcement, and prompt injection mitigation. 
  • Hands-on exposure to cloud AI platforms such as Google ADK / Vertex AI, AWS Bedrock, or Azure OpenAI in production environments. 
  • Strong Python skills with experience using Transformers, Hugging Face, embedding models, and inference optimization techniques. 

 

Good to have skills: 

 

  • Exposure to FastMCP or similar frameworks is an added advantage. 
  • Good understanding of LLM evaluation metrics, hallucination control strategies, and real-world failure patterns. 


731,000+ hidden jobs like this

prevalent and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.