QU
Job Description
• Design, develop, and optimize applications powered by Large Language Models (LLMs)
• Implement and fine‑tune LLMs using modern techniques such as LoRA, instruction tuning, and PEFT
• Build and maintain Retrieval‑Augmented Generation (RAG) pipelines using vector databases and hybrid retrieval
• Develop scalable APIs and microservices using frameworks such as FastAPI
• Integrate AI solutions with existing systems and data pipelines
• Deploy, monitor, and maintain AI applications in production environments
• Optimize model performance for latency, cost, and accuracy
• Implement evaluation frameworks to measure model quality and reduce hallucinations
• Collaborate with cross‑functional teams including product, data, and engineering