Lead our AI agent architecture and build the next generation of NEXUS AI OS. You'll design multi-agent systems, implement RAG pipelines, and optimize LLM inference for local deployment.
Full-timeRemote / India3+ years experience
What You'll Do
Architect and build multi-agent AI systems with Ollama and OpenAI
Design and implement RAG pipelines with ChromaDB and LangChain
Optimize LLM inference for local deployment using quantization and pruning
Build FastAPI backends for AI model serving
Implement active learning loops for continuous model improvement
Mentor junior engineers on ML best practices
What We're Looking For
3+ years of professional ML/AI experience
Strong Python skills (FastAPI, asyncio, Pydantic)
Experience with LLMs (fine-tuning, prompting, RAG)
Familiarity with vector databases (ChromaDB, Pinecone, Weaviate)
Understanding of ML model deployment and serving
Experience with Docker containerization
Nice to Have
Experience with Ollama or local LLM inference
Computer vision experience (YOLOv8, OpenCV)
NPU/GPU acceleration experience (OpenVINO, CUDA)
Contributions to open-source AI projects
Interested?
Send your resume, portfolio, and a note about what excites you about this role.