Generative AI Engineer[FLEXJOBS]

Company: Nextiva

Location: Hybrid

Posted on: April 26

Title: Generative AI Engineer

Location: Bangalore, Karnataka (Hybrid); Chennai, Tamil Nadu (Hybrid)

Job Description:

Redefine the future of customer experiences. One conversation at a time.

Were changing the game with a first-of-its-kind, conversation-centric platform that unifies team collaboration and customer experience in one place. Powered by AI, built by amazing humans.

Our culture is forward-thinking, customer-obsessed and built on an unwavering belief that connection fuels business and life; connections to our customers with our signature Amazing Service, our products and services, and most importantly, each other. Since 2008, 100,000+ companies and 1M+ users rely on Nextiva for customer and team communication.

If youre ready to collaborate and create with amazing people, let your personality shine and be on the frontlines of helping businesses deliver amazing experiences, youre in the right place.

Build Amazing - Deliver Amazing - Live Amazing - Be Amazing

Were looking for a highly skilled and hands-on RAG (Retrieval-Augmented Generation) & Prompt Engineer to join our applied AI team. Youll work with cutting-edge open-source and proprietary LLMs (like LLaMA, Mistral, Claude, GPT-4o, etc.) to build, prompt, and orchestrate intelligent agents that are capable, reliable, and production-ready.

This role is perfect for someone who has experience developing prompt chains, implementing tool-calling workflows, and debugging AI agents at scale.

Key Responsibilities

Design, develop, and iterate onprompt strategiestailored to downloadable models and major APIs (LLaMA, Mistral, Claude, GPT-4o, etc.).
Architect and implementRAG pipelineswith a deep understanding of embedding models, retrievers, and context optimization techniques.
Createprompt chains and tool-calling workflowsfor dynamic agent behavior using Responses API and similar frameworks.
Design, test, and deployfoolproof agent architecturesusing OpenAI tool calling and agent protocol layers.
Write robustGuardrails and control flowsfor agents to prevent unintended behaviors and ensure task compliance.
Debug and maintainagent codebases, ensuring reliability and scalability of deployed services.
Applybasic knowledge of OpenAI Operatorand related orchestration tools to manage agent lifecycle.
Collaborate with researchers and infra teams to optimize prompt efficiency and latency.

Must-Have Qualifications

3 - 5 years of experience in AI engineering, prompt engineering, or applied ML roles.
Proven experience working with bothdownloadable open-source modelsandhosted APIs.
Strong knowledge of LLM prompt design patterns, prompt chaining, and failure handling.
Ability to build agent systems that aresecure, auditable, and self-healing.
Good coding and debugging skills in Python (or relevant stack) with focus on AI orchestration.
Familiarity withagent deployment pipelines, containerized environments, and CI/CD flows.

Tech Stack We Use

Python, FastAPI, LangChain / LlamaIndex.
OpenAI, Anthropic, HuggingFace.
Vector DBs (Weaviate, Pinecone, Qdrant).
Responses API, OpenAI Operator, A2A SDK.
Docker, GitHub Actions, GCP/AWS.

Bonus (Nice-to-Have Skills)

Experience buildingagents from scratch, especially with agent transfer logic and persistent memory.
Understanding ofModel Context Protocolsand how to integrate them into multi-agent LLM stacks.
Familiarity withA2A SDKfor agent-to-agent communication and delegation.
Hands-on experience withLoRA / QLoRA techniquesfor fine-tuning GPT-style models on downstream or domain-specific tasks.
Experience withvector DBs,context compression, ormulti-turn reasoningat scale.

#LI-SC1 #LI-Hybrid

Apply Now