Incoming Software Engineer @ Microsoft Azure Data · Jan 2026

Kunj Rathod

CS Researcher & AI Engineer at the University of Utah. Building AI systems from HIPAA-compliant hospital platforms to spatial memory for embodied agents and materials discovery pipelines.

10M+Docs Indexed
40%Latency Reduced
1M+Entities Managed
Scroll
PythonTypeScriptReactNext.jsSwiftJavaC++AWS BedrockLambdaLangChainLlamaIndexCrewAINebulaGraphChromaDBPineconeDynamoDBPostgreSQLDockerFastAPIFlaskGraphRAGRAGMulti-Agent AIPyTorchTensorFlowSwiftUIFirebaseVercelKubernetesPythonTypeScriptReactNext.jsSwiftJavaC++AWS BedrockLambdaLangChainLlamaIndexCrewAINebulaGraphChromaDBPineconeDynamoDBPostgreSQLDockerFastAPIFlaskGraphRAGRAGMulti-Agent AIPyTorchTensorFlowSwiftUIFirebaseVercelKubernetes

Experience & Education

From hospital AI platforms and legal research tools to aerospace materials discovery and embodied agents

Experience_Log.json
Software Engineer Intern — Azure Data
Active
Microsoft
Jan 2026 – Present
Redmond, WA
industry
  • Building scalable cloud solutions for distributed data systems on the Azure Data team.
  • Focusing on full-stack software development and distributed systems within the Azure ecosystem.
Azure
Distributed Systems
Full Stack
Cloud
Software Development Intern, AI Services (SUDO Program)
Active
University of Utah Health
Jan 2025 – Present
Salt Lake City, UT
industry
  • Built and deployed a HIPAA-compliant AI chat platform for 90+ hospital executives using React/TypeScript, Flask middleware, and AWS Bedrock microservices with event-driven Lambda orchestration.
  • Shipped 6 full-stack features across 4 sprints; integrated AWS Bedrock Agents, Knowledge Bases, and Guardrails for production clinical workflows.
  • Reduced inference latency by 40% and data query speed by 60% via Bedrock pipeline optimization, API caching, and a DynamoDB–RDS hybrid database strategy.
  • Implemented token-streaming LLM responses (p95 <200ms TTFT) with resilient fallback handling and distributed session persistence for 1,000+ conversations.
  • Integrated interactive data visualization tools into the LLM chat interface enabling real-time analytics on hospital data.
React
TypeScript
AWS Bedrock
Lambda
DynamoDB
Flask
HIPAA
Undergraduate Researcher — LLMs & Computational Simulations
STARS Lab, University of Utah (Collaboration: NASA, Microsoft, U.S. DoD)
Aug 2025 – Feb 2026
Salt Lake City, UT
research
  • Built a multi-agent, graph-augmented pipeline to extract and normalize material-property data from 1,000+ materials-science papers into a physics-aware graph for automated Ashby plot generation.
  • Developed a constraint-based 'design region' engine (temperature, creep, pressure limits) and benchmarking suite to identify feasible materials for extreme aerospace environments.
  • Explored LLMs and multi-agent AI to streamline knowledge sharing across interdisciplinary stakeholders including engineers, scientists, and DoD partners.
  • Built Ref-RAG, a custom RAG chatbot using LangChain and Chainlit to extract structured information from large unorganized PDF datasets for materials researchers.
Python
LLMs
Multi-Agent
LangChain
Graph RAG
NASA
DoD
AI Engineering Intern
CourtEasy.ai / Nugen
Nov 2024 – Apr 2025
Remote
industry
  • Scaled hybrid legal-document retrieval to 10M+ indexed Indian legal documents (statutes, court orders), supporting 5,000+ daily queries.
  • Improved retrieval accuracy by 28% and reduced hallucinations by 35% via hybrid RAG (dense vectors + BM25 + reranking) and context-grounding optimizations for Legal-NER tasks.
  • Built production ETL ingesting 500k+ documents/week and benchmarked 8 LLM families on 4 legal benchmarks including LegalBench and NyayaAnumana.
  • Analysis guided model routing decisions, reducing projected inference spend by $50k+/year.
  • Co-authored a comparative analysis paper synthesizing insights from 15+ research papers on legal AI.
RAG
BM25
Legal-NER
LegalBench
Python
LLMs
AI Research Intern — BioGraphRAG
Garje Marathi Global (GMG Summer of Code)
May 2024 – Aug 2024
Salt Lake City, UT
research
  • Led development of BioGraphRAG: a Graph Retrieval-Augmented Generation platform combining biomedical knowledge graphs with LLMs for explainable biomedical Q&A.
  • Engineered distributed GraphRAG system managing 1M+ biomedical entities (proteins, genes, diseases) integrating UniProt, AlphaFold, and RXNav with NebulaGraph.
  • Improved factual accuracy by 40%; optimized graph traversal 3× through strategic caching and high-degree node pruning, achieving sub-500ms query latency at p95.
  • Designed automated ETL pipelines processing 2M+ entity updates monthly with schema validation.
  • Presented at an international AI panel attended by experts from India and the US — received commendation for technical leadership.
Python
NebulaGraph
LlamaIndex
GraphRAG
Docker
FastAPI
Campus Strategist
Perplexity AI
Jan 2025 – Apr 2025
Salt Lake City, UT
campus
  • Spearheaded campus-wide outreach programs to drive adoption of Perplexity's AI-powered search platform among students, faculty, and university clubs.
  • Onboarded 150+ Perplexity Pro users, facilitating seamless onboarding and sustained long-term engagement.
AI Advocacy
Community Building
Growth
Community Advisor
University of Utah Housing & Residential Education
Aug 2024 – Dec 2024
Salt Lake City, UT
campus
  • Ensured the safety and well-being of residential housing communities, providing conflict mediation, crisis response, and student support services for a 200+ resident community.
Leadership
Crisis Management
Community
Education_History
Bachelor of Science, Computer Science
University of Utah · Salt Lake City, UT
Aug 2023 – Dec 2026GPA: 3.7/4.0 (Dean's List)

Relevant Coursework

Machine Learning
Computer Vision
NLP
Distributed Systems
Algorithms & Data Structures
High School Diploma
Krishna Public School · Raipur, India
2017 – 2023

Featured Projects

AI-driven systems, hackathon winners, and research tools at scale

FIG-01
Hackathon WinnerApplication
Minute0
AI-Powered Deployment Monitor
  • Full-stack deployment monitoring and incident response system tracking Vercel deployments, classifying build/runtime failures, and triggering Slack alerts with approval workflows.
  • AI-assisted root-cause analysis with FastAPI and ChromaDB vector search over logs, generating structured fix suggestions for downstream coding agents.
  • Real-time React/TypeScript dashboard for live metrics, incident status, and agent health; deployed on Vercel with CI/CD pipeline.
ReactTypeScriptFastAPIChromaDBCerebrasSlack APIVercel
FIG-02
Application
Wingman.ai
Multi-Modal AI Personal Assistant (iOS)
  • iOS personal assistant with voice, chat, and image input integrating GPT-4o and Whisper APIs for context-aware responses with RAG-enhanced memory.
  • Offline-first architecture with Firebase sync supporting real-time message streaming and persistent conversation history.
SwiftUIGPT-4oWhisperFirebaseRAGMVVM
FIG-03
Research System
BioGraphRAG
Biomedical Knowledge Graph Retrieval
  • Production-grade distributed GraphRAG system for healthcare professionals requiring trustworthy biomedical information retrieval.
  • Integrated UniProt, AlphaFold, RXNav, and BioKG into a unified NebulaGraph store with automated ETL processing 2M+ entity updates monthly.
  • Improved factual accuracy by 40%; optimized graph traversal 3× through caching and high-degree node pruning (sub-500ms at p95).
PythonNebulaGraphLlamaIndexDockerFastAPIAWSGraphRAG
FIG-04
System Architecture
FlowVía
V2X Urban Mobility Optimization System
  • Vehicle-to-Everything (V2X) traffic optimization platform combining V2V, V2I, and V2N communication for real-time adaptive traffic management.
  • LSTM-based traffic flow prediction models with live SPaT signal data; full system stack from OBD-II hardware to cloud ML backend.
  • AES-256 encrypted communication with rotating vehicle identifiers and edge-first architecture for privacy and ultra-low latency.
PythonTensorFlowLSTMV2XDSRCC-V2XOBD-II
FIG-05
HackUSU 2025Algorithm
RL Investment Advisor
Reinforcement Learning Portfolio Optimizer
  • Investment recommendation system combining DistillBERT-based sentiment analysis on financial news with DQN and PPO for portfolio optimization.
  • Demonstrated measurable outperformance on backtested portfolio allocation tasks.
PythonDistillBERTDQNPPOFlaskReinforcement Learning
FIG-06
System Architecture
Financial Multi-Agent System
Collaborative AI Investment Analysis
  • Collaborative AI system with specialized agents (Analyst, Trader, Risk Advisor) using CrewAI and LangChain for real-time financial analysis.
  • Designed inter-agent communication protocols enabling parallel analysis and consensus-driven output generation.
CrewAILangChainPythonFlaskMulti-Agent
FIG-07
Research Tool
Ref-RAG
Research Literature Chatbot
  • Custom RAG chatbot for the STARS Lab to extract structured information from large, unorganized PDF corpora of materials-science research papers.
  • Enabled researchers to query domain-specific knowledge across 1,000+ documents through a conversational interface.
PythonLangChainChainlitFastAPIRAG

Technical Skills

Languages, frameworks, databases, and tools I work with daily

Languages
Python95%
TypeScript / JavaScript88%
Java85%
C++82%
Swift78%
SQL80%
AI & Machine Learning
RAG / GraphRAG95%
LLMs / Transformers92%
Multi-Agent Systems88%
LangChain / LlamaIndex90%
CrewAI85%
PyTorch / TensorFlow80%
Cloud & Infrastructure
AWS (Lambda, Bedrock, S3)88%
Docker / Kubernetes82%
Microservices / Event-Driven80%
Firebase78%
Web & Mobile
React / Next.js90%
SwiftUI (iOS)78%
Flask / FastAPI88%
Tailwind CSS92%
Databases
PostgreSQL / MySQL82%
DynamoDB80%
NebulaGraph78%
ChromaDB / Pinecone85%
Developer Tools
Git / GitHub95%
Docker85%
Vercel / CI/CD85%
VS Code / Postman90%

Writing & Publications

Technical articles, research reports, and open knowledge sharing

Kunj's Substack

Co-authored with Niraj Kumar Singh (ML Engineer) · GMG Summer of Code

Full technical article presenting BioGraphRAG: system architecture, GraphRAG algorithm, node-degree performance analysis (low/mid/high-degree nodes), multi-stage answer enrichment pipeline integrating UniProt, AlphaFold, and RXNav, and future directions.

NebulaGraph's marketing team requested republication on their official website (Jun 2025).

Kunj's Substack

Solo-authored technical article covering FlowVía, a V2X urban traffic optimization system. Details V2V, V2I, V2N protocols, DSRC and C-V2X standards, real-time speed recommendation algorithms, LSTM-based traffic flow prediction, data privacy/security design, and scalability challenges.

Comparative Analysis: LLM Families on Legal Benchmarks

2025

Internal Technical Report · CourtEasy.ai / Nugen

Co-authored with team at CourtEasy.ai / Nugen

Co-authored comparative analysis of InLegalBERT, InLegalLLaMA, and GPT-4o-mini on LegalBench and NyayaAnumana benchmarks, synthesizing insights from 15+ research papers to inform production RAG workflow design and evaluation protocols.

Latest Blog Posts

Thoughts and insights on AI, cloud technologies, and software development

Building Effective RAG Systems for Medical Data
March 1, 2025 · 8 min read

Lessons learned from developing BioGraphRAG and optimizing retrieval for complex medical knowledge graphs.

Multi-Agent AI Systems: Architecture and Implementation
February 20, 2025 · 10 min read

How to design collaborative AI systems with specialized agents for complex tasks like financial analysis.

Reinforcement Learning for Financial Applications
February 5, 2025 · 7 min read

Exploring DQN and PPO algorithms for portfolio optimization and investment risk assessment.

Get In Touch

Open to new opportunities and collaborations in AI and software engineering

Get in Touch

I'm currently open to new opportunities and collaborations. Feel free to reach out!

Send a Message
I'll get back to you as soon as possible.