AI Engineer - RAG & Large Language Models

Weekday AI

1 day ago

Full-time

On-site

Bengaluru, Karnataka, India

$1,800,000 - $3,000,000 USD yearly

ML & AI Engineering

This role is for one of the Weekday's clients

Salary range: Rs 1800000 - Rs 3000000 (ie INR 18-30 LPA)

Min Experience: 2 years

Location: Bangalore

JobType: full-time

We are seeking a highly motivated AI Engineer with 2–6 years of experience to join a growing team working at the forefront of applied AI. In this role, you will design and build intelligent systems powered by Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). You will play a key role in developing scalable, production-grade AI solutions that enhance knowledge discovery, automation, and decision-making across various domains.

Key Responsibilities:

Design, develop, and deploy applications leveraging Large Language Models (LLMs), including both proprietary and open-source models
Build and optimize Retrieval-Augmented Generation (RAG) pipelines for accurate, context-aware responses
Implement document ingestion, embedding generation, vector search, and ranking systems using modern vector databases
Fine-tune and evaluate LLMs for domain-specific use cases, improving performance, accuracy, and relevance
Collaborate with cross-functional teams including product, data engineering, and backend teams to integrate AI solutions into production systems
Develop prompt engineering strategies and experiment with chaining techniques to enhance model outputs
Ensure scalability, reliability, and cost-efficiency of deployed AI systems
Stay updated with the latest advancements in generative AI, LLM architectures, and retrieval techniques

Required Skills & Qualifications:

2–6 years of hands-on experience in AI/ML, with a strong focus on NLP and generative AI
Solid understanding of Large Language Models (LLMs), transformers, and their real-world applications
Proven experience in building RAG-based systems, including knowledge retrieval, embeddings, and vector databases (e.g., FAISS, Pinecone, Weaviate)
Proficiency in Python and experience with ML frameworks such as PyTorch, TensorFlow, or Hugging Face Transformers
Experience with prompt engineering, model evaluation, and performance optimization techniques
Familiarity with APIs and deployment frameworks such as FastAPI, Docker, or cloud platforms (AWS, GCP, Azure)
Strong problem-solving skills and the ability to translate business requirements into technical solutions

Preferred Qualifications:

Experience with LLM orchestration frameworks such as LangChain or LlamaIndex
Understanding of data pipelines, ETL processes, and handling large-scale unstructured data
Exposure to fine-tuning techniques such as LoRA, PEFT, or instruction tuning
Knowledge of search systems, semantic search, and hybrid retrieval methods
Prior experience deploying AI systems in production environments

AI Engineer - RAG & Large Language Models

More jobs

AI / ML Engineer

Accenture

AI / ML Engineer

Accenture