Incoming Software Engineer @ Microsoft Azure Data · Jan 2026

Kunj Rathod

CS Researcher & AI Engineer at the University of Utah.
Building AI systems from HIPAA-compliant hospital platforms to spatial memory for embodied agents and materials discovery pipelines.

10M+Docs Indexed

40%Latency Reduced

1M+Entities Managed

Explore Work

Download CV

Get In Touch

Scroll

PythonTypeScriptReactNext.jsSwiftJavaC++AWS BedrockLambdaLangChainLlamaIndexCrewAINebulaGraphChromaDBPineconeDynamoDBPostgreSQLDockerFastAPIFlaskGraphRAGRAGMulti-Agent AIPyTorchTensorFlowSwiftUIFirebaseVercelKubernetesPythonTypeScriptReactNext.jsSwiftJavaC++AWS BedrockLambdaLangChainLlamaIndexCrewAINebulaGraphChromaDBPineconeDynamoDBPostgreSQLDockerFastAPIFlaskGraphRAGRAGMulti-Agent AIPyTorchTensorFlowSwiftUIFirebaseVercelKubernetes

Experience & Education

From hospital AI platforms and legal research tools to aerospace materials discovery and embodied agents

Experience_Log.json

Software Engineer Intern — Azure Data

Active

Microsoft

Jan 2026 – Present

Redmond, WA

industry

Building scalable cloud solutions for distributed data systems on the Azure Data team.
Focusing on full-stack software development and distributed systems within the Azure ecosystem.

Azure

Distributed Systems

Full Stack

Cloud

Software Development Intern, AI Services (SUDO Program)

Active

University of Utah Health

Jan 2025 – Present

Salt Lake City, UT

industry

Built and deployed a HIPAA-compliant AI chat platform for 90+ hospital executives using React/TypeScript, Flask middleware, and AWS Bedrock microservices with event-driven Lambda orchestration.
Shipped 6 full-stack features across 4 sprints; integrated AWS Bedrock Agents, Knowledge Bases, and Guardrails for production clinical workflows.
Reduced inference latency by 40% and data query speed by 60% via Bedrock pipeline optimization, API caching, and a DynamoDB–RDS hybrid database strategy.
Implemented token-streaming LLM responses (p95 <200ms TTFT) with resilient fallback handling and distributed session persistence for 1,000+ conversations.
Integrated interactive data visualization tools into the LLM chat interface enabling real-time analytics on hospital data.

React

TypeScript

AWS Bedrock

Lambda

DynamoDB

Flask

HIPAA

Undergraduate Researcher — LLMs & Computational Simulations

STARS Lab, University of Utah (Collaboration: NASA, Microsoft, U.S. DoD)

Aug 2025 – Feb 2026

Salt Lake City, UT

research

Built a multi-agent, graph-augmented pipeline to extract and normalize material-property data from 1,000+ materials-science papers into a physics-aware graph for automated Ashby plot generation.
Developed a constraint-based 'design region' engine (temperature, creep, pressure limits) and benchmarking suite to identify feasible materials for extreme aerospace environments.
Explored LLMs and multi-agent AI to streamline knowledge sharing across interdisciplinary stakeholders including engineers, scientists, and DoD partners.
Built Ref-RAG, a custom RAG chatbot using LangChain and Chainlit to extract structured information from large unorganized PDF datasets for materials researchers.

Python

LLMs

Multi-Agent

LangChain

Graph RAG

NASA

DoD

AI Engineering Intern

CourtEasy.ai / Nugen

Nov 2024 – Apr 2025

Remote

industry

Scaled hybrid legal-document retrieval to 10M+ indexed Indian legal documents (statutes, court orders), supporting 5,000+ daily queries.
Improved retrieval accuracy by 28% and reduced hallucinations by 35% via hybrid RAG (dense vectors + BM25 + reranking) and context-grounding optimizations for Legal-NER tasks.
Built production ETL ingesting 500k+ documents/week and benchmarked 8 LLM families on 4 legal benchmarks including LegalBench and NyayaAnumana.
Analysis guided model routing decisions, reducing projected inference spend by $50k+/year.
Co-authored a comparative analysis paper synthesizing insights from 15+ research papers on legal AI.

RAG

BM25

Legal-NER

LegalBench

Python

LLMs

AI Research Intern — BioGraphRAG

Garje Marathi Global (GMG Summer of Code)

May 2024 – Aug 2024

Salt Lake City, UT

research

Led development of BioGraphRAG: a Graph Retrieval-Augmented Generation platform combining biomedical knowledge graphs with LLMs for explainable biomedical Q&A.
Engineered distributed GraphRAG system managing 1M+ biomedical entities (proteins, genes, diseases) integrating UniProt, AlphaFold, and RXNav with NebulaGraph.
Improved factual accuracy by 40%; optimized graph traversal 3× through strategic caching and high-degree node pruning, achieving sub-500ms query latency at p95.
Designed automated ETL pipelines processing 2M+ entity updates monthly with schema validation.
Presented at an international AI panel attended by experts from India and the US — received commendation for technical leadership.

Python

NebulaGraph

LlamaIndex

GraphRAG

Docker

FastAPI

Campus Strategist

Perplexity AI

Jan 2025 – Apr 2025

Salt Lake City, UT

campus

Spearheaded campus-wide outreach programs to drive adoption of Perplexity's AI-powered search platform among students, faculty, and university clubs.
Onboarded 150+ Perplexity Pro users, facilitating seamless onboarding and sustained long-term engagement.

AI Advocacy

Community Building

Growth

Community Advisor

University of Utah Housing & Residential Education

Aug 2024 – Dec 2024

Salt Lake City, UT

campus

Ensured the safety and well-being of residential housing communities, providing conflict mediation, crisis response, and student support services for a 200+ resident community.

Leadership

Crisis Management

Community

Education_History

Bachelor of Science, Computer Science

University of Utah · Salt Lake City, UT

Aug 2023 – Dec 2026GPA: 3.7/4.0 (Dean's List)

Relevant Coursework

Machine Learning

Computer Vision

NLP

Distributed Systems

Algorithms & Data Structures

High School Diploma

Krishna Public School · Raipur, India

2017 – 2023

Featured Projects

AI-driven systems, hackathon winners, and research tools at scale

FIG-01

Hackathon WinnerApplication

Minute0

AI-Powered Deployment Monitor

Full-stack deployment monitoring and incident response system tracking Vercel deployments, classifying build/runtime failures, and triggering Slack alerts with approval workflows.
AI-assisted root-cause analysis with FastAPI and ChromaDB vector search over logs, generating structured fix suggestions for downstream coding agents.
Real-time React/TypeScript dashboard for live metrics, incident status, and agent health; deployed on Vercel with CI/CD pipeline.

ReactTypeScriptFastAPIChromaDBCerebrasSlack APIVercel

SOURCE_CODE LIVE_DEMO

FIG-02

Application

Wingman.ai

Multi-Modal AI Personal Assistant (iOS)

iOS personal assistant with voice, chat, and image input integrating GPT-4o and Whisper APIs for context-aware responses with RAG-enhanced memory.
Offline-first architecture with Firebase sync supporting real-time message streaming and persistent conversation history.

SwiftUIGPT-4oWhisperFirebaseRAGMVVM

SOURCE_CODE

FIG-03

Research System

BioGraphRAG

Biomedical Knowledge Graph Retrieval

Production-grade distributed GraphRAG system for healthcare professionals requiring trustworthy biomedical information retrieval.
Integrated UniProt, AlphaFold, RXNav, and BioKG into a unified NebulaGraph store with automated ETL processing 2M+ entity updates monthly.
Improved factual accuracy by 40%; optimized graph traversal 3× through caching and high-degree node pruning (sub-500ms at p95).

PythonNebulaGraphLlamaIndexDockerFastAPIAWSGraphRAG

SOURCE_CODE

FIG-04

System Architecture

FlowVía

V2X Urban Mobility Optimization System

Vehicle-to-Everything (V2X) traffic optimization platform combining V2V, V2I, and V2N communication for real-time adaptive traffic management.
LSTM-based traffic flow prediction models with live SPaT signal data; full system stack from OBD-II hardware to cloud ML backend.
AES-256 encrypted communication with rotating vehicle identifiers and edge-first architecture for privacy and ultra-low latency.

PythonTensorFlowLSTMV2XDSRCC-V2XOBD-II

SOURCE_CODE

FIG-05

HackUSU 2025Algorithm

RL Investment Advisor

Reinforcement Learning Portfolio Optimizer

Investment recommendation system combining DistillBERT-based sentiment analysis on financial news with DQN and PPO for portfolio optimization.
Demonstrated measurable outperformance on backtested portfolio allocation tasks.

PythonDistillBERTDQNPPOFlaskReinforcement Learning

SOURCE_CODE

FIG-06

System Architecture

Financial Multi-Agent System

Collaborative AI Investment Analysis

Collaborative AI system with specialized agents (Analyst, Trader, Risk Advisor) using CrewAI and LangChain for real-time financial analysis.
Designed inter-agent communication protocols enabling parallel analysis and consensus-driven output generation.

CrewAILangChainPythonFlaskMulti-Agent

SOURCE_CODE

FIG-07

Research Tool

Ref-RAG

Research Literature Chatbot

Custom RAG chatbot for the STARS Lab to extract structured information from large, unorganized PDF corpora of materials-science research papers.
Enabled researchers to query domain-specific knowledge across 1,000+ documents through a conversational interface.

PythonLangChainChainlitFastAPIRAG

SOURCE_CODE

Technical Skills

Languages, frameworks, databases, and tools I work with daily

Languages

Python95%

TypeScript / JavaScript88%

Java85%

C++82%

Swift78%

SQL80%

AI & Machine Learning

RAG / GraphRAG95%

LLMs / Transformers92%

Multi-Agent Systems88%

LangChain / LlamaIndex90%

CrewAI85%

PyTorch / TensorFlow80%

Cloud & Infrastructure

AWS (Lambda, Bedrock, S3)88%

Docker / Kubernetes82%

Microservices / Event-Driven80%

Firebase78%

Web & Mobile

React / Next.js90%

SwiftUI (iOS)78%

Flask / FastAPI88%

Tailwind CSS92%

Databases

PostgreSQL / MySQL82%

DynamoDB80%

NebulaGraph78%

ChromaDB / Pinecone85%

Developer Tools

Git / GitHub95%

Docker85%

Vercel / CI/CD85%

VS Code / Postman90%

Writing & Publications

Technical articles, research reports, and open knowledge sharing

BioGraphRAG — Biomedical Knowledge Graph Retrieval Augmented Generation

Oct 2024

Kunj's Substack

Co-authored with Niraj Kumar Singh (ML Engineer) · GMG Summer of Code

Full technical article presenting BioGraphRAG: system architecture, GraphRAG algorithm, node-degree performance analysis (low/mid/high-degree nodes), multi-stage answer enrichment pipeline integrating UniProt, AlphaFold, and RXNav, and future directions.

✦NebulaGraph's marketing team requested republication on their official website (Jun 2025).

FlowVía: A Technical Deep Dive into Next-Gen Urban Mobility

Apr 2024

Kunj's Substack

Solo-authored technical article covering FlowVía, a V2X urban traffic optimization system. Details V2V, V2I, V2N protocols, DSRC and C-V2X standards, real-time speed recommendation algorithms, LSTM-based traffic flow prediction, data privacy/security design, and scalability challenges.

Comparative Analysis: LLM Families on Legal Benchmarks

2025

Internal Technical Report · CourtEasy.ai / Nugen

Co-authored with team at CourtEasy.ai / Nugen

Co-authored comparative analysis of InLegalBERT, InLegalLLaMA, and GPT-4o-mini on LegalBench and NyayaAnumana benchmarks, synthesizing insights from 15+ research papers to inform production RAG workflow design and evaluation protocols.

Latest Blog Posts

Thoughts and insights on AI, cloud technologies, and software development

Building Effective RAG Systems for Medical Data

March 1, 2025 · 8 min read

Lessons learned from developing BioGraphRAG and optimizing retrieval for complex medical knowledge graphs.

Multi-Agent AI Systems: Architecture and Implementation

February 20, 2025 · 10 min read

How to design collaborative AI systems with specialized agents for complex tasks like financial analysis.

Reinforcement Learning for Financial Applications

February 5, 2025 · 7 min read

Exploring DQN and PPO algorithms for portfolio optimization and investment risk assessment.

View All Posts

Get In Touch

Open to new opportunities and collaborations in AI and software engineering

Get in Touch

I'm currently open to new opportunities and collaborations. Feel free to reach out!

kunj.rathod@utah.edu

linkedin.com/in/rathodkunj

GitHub

github.com/rathodkunj2005

Send a Message

I'll get back to you as soon as possible.

GitHub LinkedIn