KPMG India logo

AI Engineer - Acon - BLR

KPMG India
1 day ago
Full-time
On-site
India
ML & AI Engineering

Responsible for designing, developing, and deploying scalable Generative AI solutions using state-of-the-art large language models (LLMs) and transformer     architectures.

- Responsible for building out sophisticated AI agent workflows, establishing a scalable cloud infrastructure, and ensuring the application can handle large-     scale document ingestion and concurrent user requests seamlessly.

- Design and implement highly complex Retrieval-Augmented Generation (RAG) systems, specifically focusing on Agentic RAG (autonomous decision-making workflows) and Multimodal RAG (handling text, images, and other data types).

Backend Development & Orchestration: Build robust, high-performance APIs and backend services using Python and FastAPI. Orchestrate complex LLM workflows and agents utilizing LangChain and LangGraph.

- Search & Data Infrastructure: Develop data ingestion pipelines capable of handling exceptionally large documents. Configure, optimize, and maintain Vector Databases and Azure AI Search for low-latency, high-accuracy retrieval

- Frontend & UI Development: Develop rapid, functional, and interactive user interfaces using Streamlit. Utilize JavaScript to build custom frontend components, integrate web elements, and enhance the overall user experience.

- DevOps & Automation: Establish, maintain, and optimize CI/CD pipelines to ensure smooth, automated testing, building, and deployment of the GenAI application.

Document Processing: Handson in parsing complex financial documents, unstructured enterprise documents (PDFs, PPTs, tables, Excel, Word, txt, charts, images etc)

Required Skills

- Programming Languages: Deep expertise in Python; solid working knowledge of JavaScript.

- Strong experience with Azure OpenAI, Azure Machine Learning, Cognitive Search, and AI Studio.

- Solid understanding of LLMs, embeddings, RAG pipelines, vector search, prompt engineering.

- GenAI & LLM Frameworks: Extensive hands-on experience with LangChain and LangGraph/ Semantic Kernel / LlamaIndex. Proven ability to build beyond basic RAG into Agentic and Multimodal implementations.

- Backend & APIs: Strong experience building scalable RESTful APIs with FastAPI.

- Cloud & DevOps (Azure): Demonstrated experience deploying enterprise-grade applications on Azure. Deep understanding of Docker, containerization, and configuring automated CI/CD pipelines.

-  LLM Evaluation & Observability: Hands-on experience with evaluation frameworks (e.g., RAGAS, TruLens, DeepEval) and monitoring/tracing tools (e.g.,   LangSmith, Langfuse, Arize Phoenix).

- Databases & Search: Practical experience with Azure AI Search and leading Vector Databases (e.g., Pinecone, Milvus, Qdrant, or Azure Native).

- System Architecture: Proven track record of designing systems for parallelism, large-scale data ingestion, and multi-user environments (SaaS architecture).

- Frontend/Prototyping: Experience building responsive interfaces using Streamlit,HTML,CSS,Javascript.

- Docker, Kubernetes (for model deployment)

- Previous experience building commercial AI products or complex advisory/analytical AI agents from the ground up.

- Strong understanding of data security and tenant isolation in cloud environments.

- Understanding of responsible AI, data privacy, and application security, Guardrails.

- Fine-tune and customize foundation models using domain-specific datasets and techniques.