Logo Taplio

Taplio

Sai Sandeep Kantareddy's Linkedin Analytics

Get the Linkedin stats of Sai Sandeep Kantareddy and many LinkedIn Influencers by Taplio.

Want detailed analytics of your Linkedin Account? Try Taplio for free.
Profile picture of undefined

open on linkedin

Senior Applied Machine Learning Engineer | Generative AI & LLM Researcher | Fine-Tuning, RAG, QLoRA, MoE | Retrieval, Evaluation & Inference | Agentic Systems | MLOps | Cloud-Agnostic With over 8 years of experience driving innovation in AI/ML, I specialize in fine-tuning large language models (LLMs), building retrieval-augmented generation (RAG) pipelines, and deploying enterprise-ready GenAI systems. I hold a Master’s in Artificial Intelligence from Arizona State University and have led impactful AI initiatives at 7-Eleven, Mayo Clinic, NXP Semiconductors, and Bayer. My recent work includes adapting open-source LLMs like Mistral 7B and LLaMA using QLoRA, PEFT, Unsloth, and MosaicML to enhance vector search (FAISS), classification accuracy, and multi-hop reasoning. I’ve developed Agentic AI systems using LangGraph to orchestrate tasks, improve chatbot grounding, and enable goal-directed automation in enterprise settings. Notable contributions: Fine-tuned Mistral using QLoRA + PEFT on Databricks to improve GL code classification across 50+ categories—boosting top-K retrieval precision in RAG systems Built and deployed a multimodal GenAI chatbot using LLava (text, images, tables) and dynamic prompting to reduce hallucinations and improve factual accuracy Developed scalable LLM inference pipelines using SetFit, LangChain, and Flask—saving millions monthly through automation and retrieval improvements Integrated Agentic AI frameworks for better decision flows in production chatbots Created an LLM for financial insights (10K/SEC documents) using Claude + LangChain + RAG Built real-time Microsoft Teams bots using AWS Lambda, API Gateway, and containerized LLM endpoints My toolbox includes: LLMs & GenAI: Open AI, Mistral, LLaMA, Claude, QLoRA, PEFT, LoRA, Hugging Face, Unsloth, LLava, SetFit Infra & Ops: Databricks, FAISS, LangChain, LangGraph, AWS, Azure, GCP, Docker, Kubernetes, Terraform Core AI: NLP, Deep Learning, Computer Vision, RL, Prompt Engineering, Agentic AI, Multi-Agent Systems MLOps: Flask, CI/CD, MLFlow, GitLab, MosaicML, PyTorch, TensorFlow, Streamlit I thrive at the intersection of applied research and production—delivering scalable systems, mentoring teams, and aligning LLM pipelines to tangible business outcomes. If you're hiring for roles in Agentic AI, RAG architecture, LLM fine-tuning, or embedding optimization, let’s connect. I'm open to new challenges and passionate about advancing AI across domains.

Check out 's verified LinkedIn stats (last 30 days)


Want to drive more opportunities from LinkedIn?

Content Inspiration, AI, scheduling, automation, analytics, CRM.

Get all of that and more in Taplio.

Try Taplio for free