Available for AI Engineer Roles — May 2026

Ganesh
Gopu

ML · LLM · AI Automation Engineer
Building production-grade AI systems — from fine-tuned LLMs and agentic RAG pipelines to full-stack automation platforms that ship real business impact.

LangGraph RAG Pipelines LLaMA · GPT-4o · Claude Vertex AI PyTorch · LoRA React · FastAPI
View Projects Get In Touch
Scroll to explore

Building AI that
ships to production.

I'm an MS Computer Science graduate from the University of Bridgeport (May 2026), specializing in building and deploying end-to-end AI systems that solve real problems.

My experience spans the full AI engineering stack — from fine-tuning LLaMA 3 with QLoRA and building stateful LangGraph agents to shipping a full-stack roasting automation platform at SoundCoffees that eliminated 100% of raw material waste.

I work across both open-source LLMs (LLaMA, Phi-3) and closed APIs (Claude, GPT-4o, Gemini), with hands-on experience in MLOps, guardrails, hybrid-search RAG, and cloud deployment. My cybersecurity background gives me a unique edge in building AI systems that are both powerful and safe.

5+
Production AI Systems Deployed
100%
Waste Eliminated at SoundCoffees
96%
RL Agent Win Rate
4+
LLM Providers (Open + Closed)

Technical Skills

🧠
LLM & Generative AI
LLaMA 3Phi-3 GPT-4oClaude 3 GeminiPEFT/LoRA QLoRAbitsandbytes Prompt EngineeringFew-shot
🤖
Agentic AI & RAG
LangChainLangGraph LlamaIndexCrewAI ReAct AgentsMCP Vertex AI AgentsHybrid Search Cross-encoder RerankingLangSmith
🗄️
Vector Databases
ChromaDBFAISS PineconeQdrant Embedding ModelsBM25
⚙️
MLOps & LLMOps
MLflowWeights & Biases DockerGitHub Actions FastAPIGradio GuardrailsLLM-as-a-Judge Eval-gated Deploys
☁️
Cloud & Infrastructure
GCP (Vertex AI)AWS (S3, EC2) Supabase CloudHugging Face Hub SageMakerGCS
🔐
AI Safety & Security
LLM Red TeamingAdversarial ML Prompt Injection DefensePII Detection Network SecurityResponsible AI

Featured Projects

01 — FLAGSHIP PROJECT
Multi-LLM Agentic RAG System

Production-grade agentic RAG system supporting 4 LLM backends (LLaMA 3, GPT-4o, Claude 3, Gemini). Features hybrid search with BM25 + vector retrieval, cross-encoder reranking, layered guardrails with PII masking and toxicity filtering, and stateful LangGraph agents with persistent memory.

✦ Live demo on Hugging Face Spaces — click to try
LangGraphQLoRA GPT-4o APIClaude API Hybrid SearchGuardrails MLflowFastAPI
02 — ENTERPRISE AI
SoundCoffees AI Automation Platform

Full-stack AI operations platform built from scratch during internship. Integrated live sales data via REST API, ML demand forecasting, real-time inventory monitoring, threshold-based alert systems, and a React + Supabase production dashboard used daily by operations staff.

✦ 100% waste eliminated · Zero stock-out events
ReactFastAPI SupabasePostgreSQL Demand ForecastingGitHub Actions
03 — VERTEX AI
Intelligent Agent — Google Vertex AI

Production AI agent built on Google Vertex AI Agent Builder using Gemini. Implemented multi-step agentic workflows with LangGraph, MCP for external tool integration, Qdrant for self-hosted vector retrieval with metadata filtering, and eval-gated deployments ensuring production reliability.

✦ Multi-tool orchestration with function calling
Vertex AIGemini API LangGraphMCP QdrantGCS
04 — DEEP LEARNING
Attention-Based Reward Training

Custom training framework applying reward signals directly through transformer attention blocks — using a -10 penalty gradient on incorrect predictions to reshape attention patterns. Bridges supervised learning and RLHF principles. Tracked with Weights & Biases, visualized via Gradio.

✦ Measurable accuracy improvement over baseline
PyTorchTransformers Custom LossW&BGradio
05 — REINFORCEMENT LEARNING
Autonomous Catch Game — RL Agent

Q-learning agent trained using experience replay buffer and custom reward shaping (+1 catch / -1 miss). Tuned epsilon-greedy exploration decay and replay buffer hyperparameters. Achieved a 96% win rate across 100 fully autonomous evaluation games.

✦ 96% win rate — fully autonomous gameplay
Q-LearningExperience Replay Reward ShapingW&B

Where I've Worked

Automation & Data Systems Intern
SoundCoffees · Bridgeport, CT
Jun 2025 – Dec 2025
  • Owned end-to-end design and deployment of an intelligent roasting automation platform, replacing all manual tracking with automated data pipelines and ML forecasting.
  • Connected platform to company website via REST API to ingest live sales data, feeding demand forecasting models that auto-generated daily roasting schedules.
  • Eliminated stock-out events entirely and reduced raw material waste by 100% through data-driven batch planning and threshold alert systems.
  • Built real-time React + Supabase dashboard; set up GitHub Actions CI/CD for zero-downtime production deployments.
Software Developer
F13 Technologies · New Delhi, India
Apr 2022 – Aug 2024
  • Built and maintained backend APIs and services in C# (.NET Core) and Python, focusing on distributed computing and data confidentiality across multi-tenant cloud platforms handling 200K+ daily transactions.
  • Achieved 28% latency reduction by optimizing threaded execution and memory management within API request pipelines.
  • Collaborated with DevOps teams to implement containerized security policies and role-based access controls (RBAC) using Docker and Azure Key Vault.
  • Integrated event-driven systems using message queues and background task workers to ensure fault-tolerant execution for high-volume data ingestion pipelines.
  • Designed cryptographic key rotation workflows and encrypted file storage modules using .NET cryptography libraries for data confidentiality compliance.

Academic Background

🎓
Master of Science in Computer Science
University of Bridgeport · Bridgeport, CT
Graduating May 2026
Deep LearningAdvanced Deep Learning Reinforcement LearningNLP CybersecurityComputer Vision Data MiningBig Data Systems Autonomous Vehicles
🎓
Bachelor of Technology in Information Technology
Sphoorthy Engineering College · Hyderabad, India
May 2022
AWS Certified Cloud Practitioner
Amazon Web Services · Certified
Salesforce Certified Platform Developer
Salesforce · Certified
🔄
AWS ML Specialty
In Progress — Expected 2026
🤗
Hugging Face — Active Model Contributor
Deployed fine-tuned LLaMA 3.4 publicly

Let's build something
interesting.

I'm actively looking for AI Engineer, ML Engineer, and AI Automation roles starting May 2026. Open to full-time positions, contract work, and interesting conversations.

ganeshreddy1811@gmail.com