Incoming Software Engineer @ Microsoft Azure Data · Jan 2026

🏆 Kahlert Impact Prize Recipient

Kunj Rathod

CS Researcher & AI Engineer at the University of Utah. Building AI systems from HIPAA-compliant hospital platforms to spatial memory for embodied agents and materials discovery pipelines.

10M+Docs Indexed

40%Latency Reduced

1M+Entities Managed

Explore Work

Resume

Get In Touch

Scroll

PythonTypeScriptReactNext.jsSwiftJavaC++AWS BedrockLambdaLangChainLlamaIndexCrewAINebulaGraphChromaDBPineconeDynamoDBPostgreSQLDockerFastAPIFlaskGraphRAGRAGMulti-Agent AIPyTorchTensorFlowSwiftUIFirebaseVercelKubernetesPythonTypeScriptReactNext.jsSwiftJavaC++AWS BedrockLambdaLangChainLlamaIndexCrewAINebulaGraphChromaDBPineconeDynamoDBPostgreSQLDockerFastAPIFlaskGraphRAGRAGMulti-Agent AIPyTorchTensorFlowSwiftUIFirebaseVercelKubernetes

Experience & Education

From hospital AI platforms and legal research tools to aerospace materials discovery and embodied agents

layer_0: experience_activations

attn_0

Software Engineer Intern — Azure Data

Active

Microsoft

Jan 2026 – Present

Redmond, WA

industry

Building scalable cloud solutions for distributed data systems on the Azure Data team.
Focusing on full-stack software development and distributed systems within the Azure ecosystem.

Azure

Distributed Systems

Full Stack

Cloud

attn_1

Software Development Intern, AI Services (SUDO Program)

Active

University of Utah Health

Jan 2025 – Present

Salt Lake City, UT

industry

Built and deployed a HIPAA-compliant AI chat platform for 90+ hospital executives using React/TypeScript, Flask middleware, and AWS Bedrock microservices with event-driven Lambda orchestration.
Shipped 6 full-stack features across 4 sprints; integrated AWS Bedrock Agents, Knowledge Bases, and Guardrails for production clinical workflows.
Reduced inference latency by 40% and data query speed by 60% via Bedrock pipeline optimization, API caching, and a DynamoDB–RDS hybrid database strategy.
Implemented token-streaming LLM responses (p95 <200ms TTFT) with resilient fallback handling and distributed session persistence for 1,000+ conversations.
Integrated interactive data visualization tools into the LLM chat interface enabling real-time analytics on hospital data.

React

TypeScript

AWS Bedrock

Lambda

DynamoDB

Flask

HIPAA

attn_2

Undergraduate Researcher — LLMs & Computational Simulations

STARS Lab, University of Utah (Collaboration: NASA, Microsoft, U.S. DoD)

Aug 2025 – Feb 2026

Salt Lake City, UT

research

Built a multi-agent, graph-augmented pipeline to extract and normalize material-property data from 1,000+ materials-science papers into a physics-aware graph for automated Ashby plot generation.
Developed a constraint-based 'design region' engine (temperature, creep, pressure limits) and benchmarking suite to identify feasible materials for extreme aerospace environments.
Explored LLMs and multi-agent AI to streamline knowledge sharing across interdisciplinary stakeholders including engineers, scientists, and DoD partners.
Built Ref-RAG, a custom RAG chatbot using LangChain and Chainlit to extract structured information from large unorganized PDF datasets for materials researchers.

Python

LLMs

Multi-Agent

LangChain

Graph RAG

NASA

DoD

attn_3

AI Engineering Intern

CourtEasy.ai / Nugen

Nov 2024 – Apr 2025

Remote

industry

Scaled hybrid legal-document retrieval to 10M+ indexed Indian legal documents (statutes, court orders), supporting 5,000+ daily queries.
Improved retrieval accuracy by 28% and reduced hallucinations by 35% via hybrid RAG (dense vectors + BM25 + reranking) and context-grounding optimizations for Legal-NER tasks.
Built production ETL ingesting 500k+ documents/week and benchmarked 8 LLM families on 4 legal benchmarks including LegalBench and NyayaAnumana.
Analysis guided model routing decisions, reducing projected inference spend by $50k+/year.
Co-authored a comparative analysis paper synthesizing insights from 15+ research papers on legal AI.

RAG

BM25

Legal-NER

LegalBench

Python

LLMs

attn_4

AI Research Intern — BioGraphRAG

Garje Marathi Global (GMG Summer of Code)

May 2024 – Aug 2024

Salt Lake City, UT

research

Led development of BioGraphRAG: a Graph Retrieval-Augmented Generation platform combining biomedical knowledge graphs with LLMs for explainable biomedical Q&A.
Engineered distributed GraphRAG system managing 1M+ biomedical entities (proteins, genes, diseases) integrating UniProt, AlphaFold, and RXNav with NebulaGraph.
Improved factual accuracy by 40%; optimized graph traversal 3× through strategic caching and high-degree node pruning, achieving sub-500ms query latency at p95.
Designed automated ETL pipelines processing 2M+ entity updates monthly with schema validation.
Presented at an international AI panel attended by experts from India and the US — received commendation for technical leadership.

Python

NebulaGraph

LlamaIndex

GraphRAG

Docker

FastAPI

attn_5

Campus Strategist

Perplexity AI

Jan 2025 – Apr 2025

Salt Lake City, UT

campus

Spearheaded campus-wide outreach programs to drive adoption of Perplexity's AI-powered search platform among students, faculty, and university clubs.
Onboarded 150+ Perplexity Pro users, facilitating seamless onboarding and sustained long-term engagement.

AI Advocacy

Community Building

Growth

attn_6

Community Advisor

University of Utah Housing & Residential Education

Aug 2024 – Dec 2024

Salt Lake City, UT

campus

Ensured the safety and well-being of residential housing communities, providing conflict mediation, crisis response, and student support services for a 200+ resident community.

Leadership

Crisis Management

Community

layer_1: system_rewards

reward_0

Kahlert Impact Prize Recipient

Kahlert School of Computing, University of Utah

Mar 2026Amount: $1,000

Undergraduate scholarship awarded for societal impact through AI research and production systems in healthcare, legal-tech, and embodied AI. Funded by a $15M endowment from The Kahlert Foundation; recognizes students with a compelling track record of translating computing research into real-world societal benefit.

layer_2: pre_training_data

pretrain_0

Bachelor of Science, Computer Science

University of Utah · Salt Lake City, UT

Aug 2023 – Dec 2026GPA: 3.7/4.0 (Dean's List)

Relevant Coursework

Machine Learning

Computer Vision

NLP

Distributed Systems

Algorithms & Data Structures

pretrain_1

High School Diploma

Krishna Public School · Raipur, India

2017 – 2023

Featured Projects

AI-driven systems, hackathon winners, and research tools at scale

FIG-00

AI Platform

HirePilot

Autonomous AI Recruiting Agency

Built a fully autonomous recruiting backend with specialized AI agents (Enrichment, Scheduling, Interview, Evaluation) to manage the end-to-end hiring lifecycle, from GitHub sourcing to live candidate screening.
Engineered complex integrations with Twilio for real-time voice AI interviews, Google Calendar for automated slot scheduling, and Slack/Resend for manager approvals and multichannel outreach.

TypeScriptNode.jsExpressPostgreSQLAnthropic API

SOURCE_CODE

FIG-01

New InitiativeApplication

CloudCoder

Instant AI to AWS App Deployment

A full-stack, model-agnostic AI orchestrator embedded within my portfolio that allows users to instantly generate and deploy Serverless AWS applications directly to their live AWS account.
Generates highly-structured React SPAs, Node.js Lambda functions, and SAM CloudFormation templates via Vercel AI SDK (OpenAI/Anthropic).
Uses AWS SDK for Javascript and JSZip to dynamically package Lambda binaries, create S3 artifacts, and execute CloudFormation templates with real-time Server-Sent Events (SSE) streaming logs direct to the user interface.

Next.jsGPT-4o / Claude 3.5AWS SDK v3CloudFormationJSZip

SOURCE_CODE LIVE_DEMO

FIG-02

Hackathon WinnerApplication

Minute0

AI-Powered Deployment Monitor

Full-stack deployment monitoring and incident response system tracking Vercel deployments, classifying build/runtime failures, and triggering Slack alerts with approval workflows.
AI-assisted root-cause analysis with FastAPI and ChromaDB vector search over logs, generating structured fix suggestions for downstream coding agents.
Real-time React/TypeScript dashboard for live metrics, incident status, and agent health; deployed on Vercel with CI/CD pipeline.

ReactTypeScriptFastAPIChromaDBCerebrasSlack APIVercel

SOURCE_CODE LIVE_DEMO

FIG-03

Local AI System

FNDR

Privacy-First Local AI Assistant for macOS

Engineered a high-performance macOS desktop application using Rust and Tauri, delivering a zero-trust, local-only memory assistant with full data sovereignty — no cloud, no telemetry.
Optimized on-device inference for LLMs (Llama 3.2) and VLMs (SmolVLM) with Metal-accelerated backends, achieving low-latency RAG on M-series Apple Silicon.
Architected a real-time screen extraction pipeline using Apple Vision Framework for high-speed OCR and CLIP-based visual embeddings to reconstruct temporal context from screen snapshots.
Designed a Graphiti-style Temporal Search Engine modeling semantic relationships across user activities, web sessions, and meeting transcripts, enabling proactive entity extraction and multi-hop reasoning.
Implemented automated meeting intelligence with local Whisper-based transcription (Parakeet) and segmented audio processing integrated into the global memory index.
Developed a Model Context Protocol (MCP) server for secure, local interoperability between the memory store and external AI agents or IDEs.

RustTauriMetalONNXLlama 3.2WhisperApple Vision

SOURCE_CODE

FIG-04

AI Orchestrator

Omni

Everything. Everywhere. All at Once — AI Orchestrator

Unified AI intelligence layer integrating Gmail, Google Calendar, Slack, and FNDR private memory to eliminate context switching — generates Smart Todos, schedules meetings via natural language, and retrieves personal context on demand.
Deep integrations with GitHub and Apple Services; supports real-time voice interaction and autonomous multi-step workflow orchestration across the entire digital stack.
Designed as the universal interface that transforms from a passive assistant into a proactive digital companion, anticipating needs with unparalleled precision.

ReactTypeScriptOpenAIMCPSlack APIGmail APIGoogle CalendarFNDR

SOURCE_CODE

FIG-05

Application

Wingman.ai

Multi-Modal AI Personal Assistant (iOS)

iOS personal assistant with voice, chat, and image input integrating GPT-4o and Whisper APIs for context-aware responses with RAG-enhanced memory.
Offline-first architecture with Firebase sync supporting real-time message streaming and persistent conversation history.

SwiftUIGPT-4oWhisperFirebaseRAGMVVM

SOURCE_CODE

FIG-06

Research System

BioGraphRAG

Biomedical Knowledge Graph Retrieval

Production-grade distributed GraphRAG system for healthcare professionals requiring trustworthy biomedical information retrieval.
Integrated UniProt, AlphaFold, RXNav, and BioKG into a unified NebulaGraph store with automated ETL processing 2M+ entity updates monthly.
Improved factual accuracy by 40%; optimized graph traversal 3× through caching and high-degree node pruning (sub-500ms at p95).

PythonNebulaGraphLlamaIndexDockerFastAPIAWSGraphRAG

SOURCE_CODE

FIG-07

System Architecture

FlowVía

V2X Urban Mobility Optimization System

Vehicle-to-Everything (V2X) traffic optimization platform combining V2V, V2I, and V2N communication for real-time adaptive traffic management.
LSTM-based traffic flow prediction models with live SPaT signal data; full system stack from OBD-II hardware to cloud ML backend.
AES-256 encrypted communication with rotating vehicle identifiers and edge-first architecture for privacy and ultra-low latency.

PythonTensorFlowLSTMV2XDSRCC-V2XOBD-II

SOURCE_CODE

FIG-08

HackUSU 2025Algorithm

RL Investment Advisor

Reinforcement Learning Portfolio Optimizer

Investment recommendation system combining DistillBERT-based sentiment analysis on financial news with DQN and PPO for portfolio optimization.
Demonstrated measurable outperformance on backtested portfolio allocation tasks.

PythonDistillBERTDQNPPOFlaskReinforcement Learning

SOURCE_CODE

FIG-09

System Architecture

Financial Multi-Agent System

Collaborative AI Investment Analysis

Collaborative AI system with specialized agents (Analyst, Trader, Risk Advisor) using CrewAI and LangChain for real-time financial analysis.
Designed inter-agent communication protocols enabling parallel analysis and consensus-driven output generation.

CrewAILangChainPythonFlaskMulti-Agent

SOURCE_CODE

FIG-10

Research Tool

Ref-RAG

Research Literature Chatbot

Custom RAG chatbot for the STARS Lab to extract structured information from large, unorganized PDF corpora of materials-science research papers.
Enabled researchers to query domain-specific knowledge across 1,000+ documents through a conversational interface.

PythonLangChainChainlitFastAPIRAG

SOURCE_CODE

Technical Skills

Languages, frameworks, databases, and tools I work with daily

Languages

Python95%

TypeScript / JavaScript88%

Java85%

C++82%

Swift78%

SQL80%

AI & Machine Learning

RAG / GraphRAG95%

LLMs / Transformers92%

Multi-Agent Systems88%

LangChain / LlamaIndex90%

CrewAI85%

PyTorch / TensorFlow80%

Cloud & Infrastructure

AWS (Lambda, Bedrock, S3)88%

Docker / Kubernetes82%

Microservices / Event-Driven80%

Firebase78%

Web & Mobile

React / Next.js90%

SwiftUI (iOS)78%

Flask / FastAPI88%

Tailwind CSS92%

Databases

PostgreSQL / MySQL82%

DynamoDB80%

NebulaGraph78%

ChromaDB / Pinecone85%

Developer Tools

Git / GitHub95%

Docker85%

Vercel / CI/CD85%

VS Code / Postman90%

Writing & Publications

Technical articles, research reports, and open knowledge sharing

BioGraphRAG — Biomedical Knowledge Graph Retrieval Augmented Generation

Oct 2024

Kunj's Substack

Co-authored with Niraj Kumar Singh (ML Engineer) · GMG Summer of Code

Full technical article presenting BioGraphRAG: system architecture, GraphRAG algorithm, node-degree performance analysis (low/mid/high-degree nodes), multi-stage answer enrichment pipeline integrating UniProt, AlphaFold, and RXNav, and future directions.

✦NebulaGraph's marketing team requested republication on their official website (Jun 2025).

FlowVía: A Technical Deep Dive into Next-Gen Urban Mobility

Apr 2024

Kunj's Substack

Solo-authored technical article covering FlowVía, a V2X urban traffic optimization system. Details V2V, V2I, V2N protocols, DSRC and C-V2X standards, real-time speed recommendation algorithms, LSTM-based traffic flow prediction, data privacy/security design, and scalability challenges.

Comparative Analysis: LLM Families on Legal Benchmarks

2025

Internal Technical Report · CourtEasy.ai / Nugen

Co-authored with team at CourtEasy.ai / Nugen

Co-authored comparative analysis of InLegalBERT, InLegalLLaMA, and GPT-4o-mini on LegalBench and NyayaAnumana benchmarks, synthesizing insights from 15+ research papers to inform production RAG workflow design and evaluation protocols.

Latest Blog Posts

Thoughts and insights on AI, cloud technologies, and software development

Building Effective RAG Systems for Medical Data

March 1, 2025 · 8 min read

Lessons learned from developing BioGraphRAG and optimizing retrieval for complex medical knowledge graphs.

Multi-Agent AI Systems: Architecture and Implementation

February 20, 2025 · 10 min read

How to design collaborative AI systems with specialized agents for complex tasks like financial analysis.

Reinforcement Learning for Financial Applications

February 5, 2025 · 7 min read

Exploring DQN and PPO algorithms for portfolio optimization and investment risk assessment.

View All Posts

Get In Touch

Open to new opportunities and collaborations in AI and software engineering

Get in Touch

I'm currently open to new opportunities and collaborations. Feel free to reach out!

kunj.rathod@utah.edu

linkedin.com/in/rathodkunj

GitHub

github.com/rathodkunj2005

Send a Message

I'll get back to you as soon as possible.

GitHub LinkedIn