Kunj Rathod
CS Researcher & AI Engineer at the University of Utah.
Building AI systems from HIPAA-compliant hospital platforms to spatial memory for embodied agents and materials discovery pipelines.
Experience & Education
From hospital AI platforms and legal research tools to aerospace materials discovery and embodied agents
- Building scalable cloud solutions for distributed data systems on the Azure Data team.
- Focusing on full-stack software development and distributed systems within the Azure ecosystem.
- Built and deployed a HIPAA-compliant AI chat platform for 90+ hospital executives using React/TypeScript, Flask middleware, and AWS Bedrock microservices with event-driven Lambda orchestration.
- Shipped 6 full-stack features across 4 sprints; integrated AWS Bedrock Agents, Knowledge Bases, and Guardrails for production clinical workflows.
- Reduced inference latency by 40% and data query speed by 60% via Bedrock pipeline optimization, API caching, and a DynamoDB–RDS hybrid database strategy.
- Implemented token-streaming LLM responses (p95 <200ms TTFT) with resilient fallback handling and distributed session persistence for 1,000+ conversations.
- Integrated interactive data visualization tools into the LLM chat interface enabling real-time analytics on hospital data.
- Built a multi-agent, graph-augmented pipeline to extract and normalize material-property data from 1,000+ materials-science papers into a physics-aware graph for automated Ashby plot generation.
- Developed a constraint-based 'design region' engine (temperature, creep, pressure limits) and benchmarking suite to identify feasible materials for extreme aerospace environments.
- Explored LLMs and multi-agent AI to streamline knowledge sharing across interdisciplinary stakeholders including engineers, scientists, and DoD partners.
- Built Ref-RAG, a custom RAG chatbot using LangChain and Chainlit to extract structured information from large unorganized PDF datasets for materials researchers.
- Scaled hybrid legal-document retrieval to 10M+ indexed Indian legal documents (statutes, court orders), supporting 5,000+ daily queries.
- Improved retrieval accuracy by 28% and reduced hallucinations by 35% via hybrid RAG (dense vectors + BM25 + reranking) and context-grounding optimizations for Legal-NER tasks.
- Built production ETL ingesting 500k+ documents/week and benchmarked 8 LLM families on 4 legal benchmarks including LegalBench and NyayaAnumana.
- Analysis guided model routing decisions, reducing projected inference spend by $50k+/year.
- Co-authored a comparative analysis paper synthesizing insights from 15+ research papers on legal AI.
- Led development of BioGraphRAG: a Graph Retrieval-Augmented Generation platform combining biomedical knowledge graphs with LLMs for explainable biomedical Q&A.
- Engineered distributed GraphRAG system managing 1M+ biomedical entities (proteins, genes, diseases) integrating UniProt, AlphaFold, and RXNav with NebulaGraph.
- Improved factual accuracy by 40%; optimized graph traversal 3× through strategic caching and high-degree node pruning, achieving sub-500ms query latency at p95.
- Designed automated ETL pipelines processing 2M+ entity updates monthly with schema validation.
- Presented at an international AI panel attended by experts from India and the US — received commendation for technical leadership.
- Spearheaded campus-wide outreach programs to drive adoption of Perplexity's AI-powered search platform among students, faculty, and university clubs.
- Onboarded 150+ Perplexity Pro users, facilitating seamless onboarding and sustained long-term engagement.
- Ensured the safety and well-being of residential housing communities, providing conflict mediation, crisis response, and student support services for a 200+ resident community.
Relevant Coursework
Featured Projects
AI-driven systems, hackathon winners, and research tools at scale
- Full-stack deployment monitoring and incident response system tracking Vercel deployments, classifying build/runtime failures, and triggering Slack alerts with approval workflows.
- AI-assisted root-cause analysis with FastAPI and ChromaDB vector search over logs, generating structured fix suggestions for downstream coding agents.
- Real-time React/TypeScript dashboard for live metrics, incident status, and agent health; deployed on Vercel with CI/CD pipeline.
- iOS personal assistant with voice, chat, and image input integrating GPT-4o and Whisper APIs for context-aware responses with RAG-enhanced memory.
- Offline-first architecture with Firebase sync supporting real-time message streaming and persistent conversation history.
- Production-grade distributed GraphRAG system for healthcare professionals requiring trustworthy biomedical information retrieval.
- Integrated UniProt, AlphaFold, RXNav, and BioKG into a unified NebulaGraph store with automated ETL processing 2M+ entity updates monthly.
- Improved factual accuracy by 40%; optimized graph traversal 3× through caching and high-degree node pruning (sub-500ms at p95).
- Vehicle-to-Everything (V2X) traffic optimization platform combining V2V, V2I, and V2N communication for real-time adaptive traffic management.
- LSTM-based traffic flow prediction models with live SPaT signal data; full system stack from OBD-II hardware to cloud ML backend.
- AES-256 encrypted communication with rotating vehicle identifiers and edge-first architecture for privacy and ultra-low latency.
- Investment recommendation system combining DistillBERT-based sentiment analysis on financial news with DQN and PPO for portfolio optimization.
- Demonstrated measurable outperformance on backtested portfolio allocation tasks.
- Collaborative AI system with specialized agents (Analyst, Trader, Risk Advisor) using CrewAI and LangChain for real-time financial analysis.
- Designed inter-agent communication protocols enabling parallel analysis and consensus-driven output generation.
- Custom RAG chatbot for the STARS Lab to extract structured information from large, unorganized PDF corpora of materials-science research papers.
- Enabled researchers to query domain-specific knowledge across 1,000+ documents through a conversational interface.
Technical Skills
Languages, frameworks, databases, and tools I work with daily
Writing & Publications
Technical articles, research reports, and open knowledge sharing
Kunj's Substack
Co-authored with Niraj Kumar Singh (ML Engineer) · GMG Summer of Code
Full technical article presenting BioGraphRAG: system architecture, GraphRAG algorithm, node-degree performance analysis (low/mid/high-degree nodes), multi-stage answer enrichment pipeline integrating UniProt, AlphaFold, and RXNav, and future directions.
Kunj's Substack
Solo-authored technical article covering FlowVía, a V2X urban traffic optimization system. Details V2V, V2I, V2N protocols, DSRC and C-V2X standards, real-time speed recommendation algorithms, LSTM-based traffic flow prediction, data privacy/security design, and scalability challenges.
Comparative Analysis: LLM Families on Legal Benchmarks
Internal Technical Report · CourtEasy.ai / Nugen
Co-authored with team at CourtEasy.ai / Nugen
Co-authored comparative analysis of InLegalBERT, InLegalLLaMA, and GPT-4o-mini on LegalBench and NyayaAnumana benchmarks, synthesizing insights from 15+ research papers to inform production RAG workflow design and evaluation protocols.
Latest Blog Posts
Thoughts and insights on AI, cloud technologies, and software development
Lessons learned from developing BioGraphRAG and optimizing retrieval for complex medical knowledge graphs.
How to design collaborative AI systems with specialized agents for complex tasks like financial analysis.
Exploring DQN and PPO algorithms for portfolio optimization and investment risk assessment.
Get In Touch
Open to new opportunities and collaborations in AI and software engineering
Get in Touch
I'm currently open to new opportunities and collaborations. Feel free to reach out!