Utkarsh
Mittal.
Software engineer with 5+ years building products end-to-end: microservices and data platforms at Dell, LLM pipelines at Locomex, network scheduling software at Nokia. M.S. Computer Engineering, NYU (May 2026).
//about
I got hooked on distributed systems the day a production outage at Dell took down 500K daily transactions and the RCA pointed to a single unguarded hot path. I spent a week fixing it properly. I never looked at code the same way again.
I'm a software engineer who cares about the full stack: from the data model and API contract, through the service layer and message queue, to the React UI consuming it. Five years across Dell, Locomex, and Nokia has meant writing C#/.NET microservices, LLM prompt chains, Spring Boot schedulers, and React frontends, sometimes in the same week.
The portfolio projects here are a sprint to build public proof-of-work across four domains: AI retrieval, distributed storage, real-time collaboration, and backend infrastructure. Each ships a deployed demo, a measured benchmark, and a public repo you can actually run.
M.S. Computer Engineering, NYU Tandon (May 2026) · GPA 3.963. B.Tech CS, Shiv Nadar University (2020).
- M.S. Computer Engineering · NYU · 20263.963
- B.Tech Computer Science · SNU · 2020
- Backend systems & API design
- Distributed systems & data pipelines
- AI/LLM integration & prompt engineering
- Frontend (React/Next.js)
- Cloud infra & DevOps (AWS, Docker, Terraform)
- SDE / Backend / Full-Stack roles
- Mid-level to senior
- New York area or remote
//skills
//projects
Off-the-shelf RAG systems hallucinate without warning. Users get confident-sounding wrong answers with no way to verify them. This system traces every claim back to an exact source span and quantifies how sure it is.
In regulated industries like legal, finance, and healthcare, a wrong answer with a citation is worse than no answer. Grounded retrieval with measurable precision lets companies actually deploy AI on internal documents.
Hybrid BM25 and vector retrieval fused via RRF, cross-encoder reranking, SOURCE_X citation injection into the LLM prompt, then claim-level confidence scoring via cosine similarity against cited chunks.
High-throughput event streaming is critical infrastructure for modern data pipelines, microservice communication, and real-time analytics. Most teams use Kafka as a black box with no visibility into what happens at the storage and replication layer.
When Kafka goes wrong at scale, engineers who understand segment files, ISR shrinkage, and high-watermark semantics fix it in hours. Engineers who only know the client API spend days. This project demonstrates that depth.
Segment files on disk with sparse index files for O(1) seeks, a custom binary TCP wire protocol, ISR replication with high-watermark tracking, and a consumer group coordinator with JoinGroup and SyncGroup state machine.
>100K msg/sec sustained · p99 <15ms · 3-broker cluster with ISR replication · TypeScript dashboard
Collaborative software requires multiple users to edit shared state simultaneously without conflicts, lost updates, or stale reads. Traditional locking breaks under network partitions and does not support offline editing.
CRDTs power the real-time layer in Notion, Figma, and Linear. Understanding the algorithm well enough to implement a WebSocket sync server and offline-first client is a meaningful signal for product-facing engineering roles.
Custom WebSocket server with Yjs binary sync protocol, IndexedDB offline persistence on the client, Postgres snapshots every 30 seconds for durability, and cursor presence via the Awareness protocol.
Every API needs rate limiting. Most implementations use a naive check-then-set pattern that has race conditions under concurrency: two simultaneous requests can both read under limit and both get through, breaking the contract.
Rate limiting is security-critical infrastructure in any multi-tenant SaaS product. Getting the algorithm and atomicity right under load, then proving it with benchmarks and IaC, is the kind of ownership companies look for in senior engineers.
Three algorithms as atomic Redis Lua scripts with zero race conditions, per-tenant config with 30-second in-memory cache and Redis pub/sub invalidation, Terraform-deployed to Fly.io, and load-tested to 8K+ RPS with k6.
>8K RPS · p50 <2ms · p99 <20ms · zero race conditions under concurrency
//experience
- Shipped a Bandwidth Scheduler for Nokia WaveSuite in Spring Boot, React/Redux, and Oracle DB, replacing a manual reservation workflow for advance allocation of optical network capacity.
- Modeled real-world optical constraints (path conflicts, wavelength contention, 1+1 protection rules) into a normalized data model and REST API, eliminating an entire class of double-booking errors caught in testing.
- Converted tribal domain rules from network engineers into deterministic backend validations and contract tests, unblocking sign-off from 5 stakeholder teams.
//contact
Open to SDE, backend, and full-stack engineering roles, mid-level to senior. I respond to every message, usually within a day.