Fine-tuning · Education AI

AI Feynman Kannada Tutor

Multi-stage fine-tuning pipeline creating a reasoning-first physics tutor in Kannada — combining SFT and RAG for intuitive, grounded explanations.

  • Multi-stage SFT: language → domain → grounding
  • LLM-as-judge evaluation on 0–5 scale
  • RAG with physics knowledge base
  • 4-model progression with measurable gains
  • Dataset and models on HuggingFace
Open case study →
Fine-tuning · Creative AI

AI Sitcom Scriptwriter

Teaching an open-source LLM to write The Office — reasoning-first screenplay generation with on-brand humor, character voice, and multi-step setups.

  • SFT on reasoning traces + screenplay pairs
  • Reinforcement fine-tuning (RFT) with PPO
  • LLM-as-judge with 8 weighted metrics
  • 3-model progression: Base → SFT → RFT
  • Dataset and models on HuggingFace
Open case study →