Available · San Antonio, TX · Open to US opportunities

Shivam Prajapati

Full Stack AI Engineer with 4+ years of production experience. I build AI-powered systems that handle real traffic, real users, and real scale — from webhook-driven code review platforms to RAG pipelines serving semantic search at production.

M.S. Computer Science · Java + Python + TypeScript · Spring Boot + FastAPI + Next.js · Claude API + LangChain

Shivam Prajapati

What I build with

A production-tested stack spanning backend systems, AI/LLM integration, cloud infrastructure, and full-stack web development.

Languages
Primary languages across backend services, scripting, and frontend development.
Java Python TypeScript JavaScript
Backend & APIs
Production-grade frameworks for REST, GraphQL, event-driven, and microservice architectures.
Spring Boot FastAPI Node.js GraphQL REST APIs OAuth 2.0 Webhooks Microservices
Frontend
Modern component-based UI development with server-side rendering and responsive design.
React.js Next.js HTML5 CSS3 Tailwind
AI & LLMs
End-to-end AI application development — from embedding pipelines to agentic workflows and production LLM integration.
Claude API RAG Pipelines LangChain ChromaDB Vector Search Prompt Engineering SSE Streaming
Cloud & DevOps
Cloud-native deployment, containerization, and CI/CD pipeline automation at production scale.
AWS Docker Kubernetes Terraform Jenkins Git
Databases & Streaming
Relational, NoSQL, and event-streaming systems — optimized queries, indexing, and high-throughput pipelines.
PostgreSQL MS SQL Server MongoDB MySQL Kafka Supabase

Where I've shipped

4+ years building and owning production systems — from event-driven billing pipelines to full-stack AI platforms serving 50K+ users.

Aug 2025 — Nov 2025
Find Me LLC
New York, USA
● Remote · Internship
Software Developer Intern
  • Boosted page load 42% (2.4s → 1.4s) and cut payload size 40% by migrating RESTful APIs to GraphQL with field-level resolvers and depth limiting.
  • Reduced manual review workload by 30% (100 → 70 cases/day) by integrating ML classification models into Python backend services.
  • Shipped customer-requested features across frontend, backend, and AWS deployment pipelines within 2-week sprint cycles.
GraphQL Python ML Integration AWS Docker
Sep 2019 — Jul 2023
iSummation Technologies
India
● On-site · Full-time
Software Developer
  • Led full-stack platform development (React, Next.js, Spring Boot) serving 50K+ users with 99.8% uptime — achieved through load testing and circuit-breaker patterns.
  • Enforced OpenAPI contracts across 10+ microservices, cutting production incidents 42% and reducing cross-team integration time from 5 days to 2 days.
  • Diagnosed and resolved N+1 query patterns and added composite indexes in MS SQL Server, reducing API latency 33% (~900ms → 600ms) across production endpoints.
  • Mentored 2 junior engineers through structured code reviews and pair-programming, reducing their PR defect rate 40% in Q1.
Java Spring Boot React Next.js MS SQL Microservices
Dec 2018 — Mar 2019
Silverwing Technologies
India
● On-site · Internship
Software Developer Intern
  • Replaced synchronous billing with a Kafka + Spring Boot event-driven pipeline sustaining 10,000+ events/min with zero message loss and zero billing timeout errors.
  • Automated payment workflows via REST APIs with schema validation, cutting manual processing effort 40% and eliminating recurring data-entry errors.
Kafka Spring Boot Event-Driven REST APIs

Things I've built

Production-grade AI-powered applications — not tutorials, not clones. Each built end-to-end with real architecture decisions and working deployments.

01
May 2026
GitHub PR Reviewer

AI-powered code review system that automatically analyzes GitHub pull requests using Claude API. When a developer opens or updates a PR, GitHub webhooks trigger per-file analysis — generating severity-ranked findings (critical / warning / suggestion) with concrete fixes, posted directly as formatted PR comments.

Key Engineering Decisions
  • Webhook-driven flow — GitHub calls the server; no polling, no manual triggers
  • DB-first caching — reviews served from Supabase instantly on repeat requests; bypasses cache when PR is merged or closed
  • Per-file chunking — each changed file analyzed individually with language detection, preventing context overflow
  • Structured JSON output — Claude prompted to return strict JSON; parsed into severity-ranked entities stored in PostgreSQL
  • Force refresh — developers can re-trigger analysis after new commits without cache
Java Spring Boot Next.js TypeScript PostgreSQL Supabase Claude API GitHub Webhooks Docker
02
Apr 2026
Intelligent Document Retrieval Platform

Full-stack RAG system for semantic PDF retrieval and context-aware conversational Q&A. Users upload documents and ask questions in natural language — the system retrieves relevant chunks using vector similarity and generates grounded answers using Claude API with real-time token streaming.

Key Engineering Decisions
  • RAG pipeline — PDFs chunked, embedded, and stored in ChromaDB for semantic vector retrieval
  • SSE streaming — responses streamed token-by-token to the frontend, reducing perceived latency significantly
  • Page-level citations — every answer references the specific page it came from
  • Conversational memory — chat history maintained for multi-turn contextual Q&A
Python FastAPI Next.js ChromaDB Claude API SSE LangChain
03
Jan 2026
Auth / SSO Service

Production-grade multi-tenant authentication and SSO service built from the ground up. Supports multiple organizations with isolated permission models, JWT refresh-token rotation, and a complete audit trail — designed to handle real security requirements without off-the-shelf auth providers.

Key Engineering Decisions
  • Multi-tenant isolation — each tenant has independent RBAC with 5+ configurable permission levels
  • JWT refresh rotation — short-lived access tokens + rotating refresh tokens; zero token replay vulnerabilities across environments
  • Full audit logging — every auth event (login, token refresh, permission change) logged with timestamp and actor
  • Zero misconfigurations — access control incidents reduced to zero in QA and staging after rollout
Java Spring Boot Spring Security PostgreSQL JWT RBAC OAuth 2.0

The engineer behind the code

I don't just build features.
I own systems.

I'm a Full Stack AI Engineer with roots in enterprise Java backend development and a growing focus on production AI systems. My career started with event-driven pipelines handling 10K+ events/min at Silverwing, progressed to owning a 50K-user platform at iSummation, and now centers on building intelligent applications that combine LLM capabilities with real engineering depth.

What sets me apart is the combination: I understand distributed systems, database performance, and API design from years of backend work — and I can apply that same rigor to AI-powered applications. I don't just wrap an LLM in a chat interface. I build caching layers, webhook pipelines, structured output parsers, and production deployment configurations around them.

I hold an M.S. in Computer Science from Cleveland State University (GPA 3.7) and am actively looking for full-stack or AI engineering roles in the US. I'm open to relocating anywhere.

  • Systems thinking first — understand the full stack before writing line one
  • Metrics-driven — every improvement backed by before/after numbers
  • Production mindset — error handling, caching, monitoring, not just happy path
  • Fast learner — picked up LangChain, ChromaDB, webhooks, and Supabase while building
M.S. Computer Science
Cleveland State University, Ohio
Sep 2023 — May 2025 GPA 3.7/4.0
🐳
Docker Foundations Professional
Docker, Inc. · Jan 2026
🏦
Software Engineering Job Simulation
JPMorgan Chase · Jan 2026
🛒
Advanced Software Engineering Simulation
Walmart · Apr 2026

Let's
connect.

Looking for full-stack or AI engineering roles across the US. Available immediately and open to relocating.