Hello! I'm Naman, an AI Engineer at Ushur specializing in building intelligent systems from the ground up. I'm a graduate student (Masters) in Computer Science from University at Buffalo with a deep passion for AI agents, LLM evaluation, and cutting-edge research in the field of artificial intelligence.
I excel at AI agent development and evaluation, with extensive experience in fine-tuning both open-source and closed-source LLMs. My research focuses on pioneering evaluation methodologies using logit-based statistical methods, integrating techniques like LLM-as-a-Judge, Rank Consistency, and Response Entropy to benchmark hallucination rates and factual grounding.
I have 5+ years of industry experience developing intelligent systems, with expertise in Retrieval-Augmented Generation (RAG), Memory-Controlled Prompting (MCP), and building production-ready AI workflows. I'm comfortable programming in any language and believe the future of intelligent agents lies at the intersection of retrieval precision, prompt-free reasoning, and memory-aware computation.
Naman Khurpia
namankhurpia2@gmail.com
+1 716 259 5818
Read Blog (25k+ Views)
Say Hi on LinkedIn
AI Engineer • September 2024 - Present (FullTime)
• Engineered a Retrieval Model for AI Agents from scratch with a novel in-context chunking strategy — boosting relevance across long-context queries.
• Designed a custom algorithm to extract key–value pairs from unstructured documents, which now powers real-time quoting for Insurance contracts.
• Fine-tuned a LLaMA model to generate structured responses autonomously — no prompting required — using minimal supervision and internal feedback loops.
• Pioneered evaluation of LLMs using logit-based statistical methods, integrating techniques like LLM-as-a-Judge, Rank Consistency, and Response Entropy to benchmark hallucination rates and factual grounding.
• Always experimenting with tools like RAG, DeepEval, phi-3, and vector search hybrids to push the boundaries of intelligent agent capabilities.
Machine Learning Engineer • March 2024 - September 2024 (FullTime)
• Engineered a high-throughput OCR pipeline using LayoutLMV3 and PyTorch Lightning, optimized for multi-GPU training to process massive document datasets while managing memory constraints.
• Architected a scalable data ingestion system with Apache Kafka, setting up new topics and brokers to handle high-volume, real-time data streams for the OCR model.
• Fine-tuned a custom language model to perform post-OCR validation and correction, automatically identifying and rectifying inaccuracies to ensure high-fidelity data extraction.
Machine Learning Engineer Intern• Aug 2023 - February 2024 (Internship)
• Engineered a multi-modal deep learning framework to detect subtle instances of racial bias in text, speech, and video content. The model utilized a hybrid attention mechanism with Bi-LSTMs and Transformer encoders, achieving 87% accuracy while minimizing false positives on a highly nuanced, custom-built dataset.
• Integrated facial emotion recognition using OpenCV as a key feature for the multi-modal bias detection system, improving context understanding in video analysis.
• Implemented a few-shot learning pipeline with GPT-3 and multi-threaded calls to the OpenAI Moderation API to establish a robust baseline and augment training data for hard-to-classify examples of implicit bias.
Research Assistant • January 2023 - May 2023 (Cloud Developer)
• Collaborated at Transfer Scheduler, facilitating transfer of terabytes of data from Google Drive, S3using Kafka and RabbitMQ.
• Implemented PKI secrets engine using Vault, enabling token management with fine grained ACL policies..
Cloud Engineer - Spring Boot Microservices Angular Developer• Aug 2020 - July 2022 (Fulltime)
• Developed an NLP model to optimally convert code from one syntax to another, finding ambiguities and resolution, Achieved an accuracy of 88% for Cobol to Java for over 10 million lines of code.
• Invented a script to map and create all different variables, removed multiple duplicated variables in project code.
• Migrated Western Union's money transfer solution from Cobol to Spring Java Microservice, re-designed and designed architecture of WillCallStore (WCSTOX) program's solution by collaborating with 10+ teams, transactions from 50+ countries.
• Developed WCSTOX Program from scratch, validates and stores all money order records in postgres and connecting data-stream Kafka, managing configurations of multiple Kafka topics, Devised common encryption layer for all user sensitive data.
• Steered team of 10 new developers and resolved task of stitching code piece by piece, reporting to higher management.
Cloud Engineer Intern• 13 Jan 2020 to 15 May 2020 (Paid Internship)
Worked at CMA (Cloud Microservices Apification), building a large scale solution for a global enterprise, with server side load balancing
Android Developer• 4 April 2019 to 4 June 2020 (Paid Internship)
Masters of Science in Computer Science (specialization in AI)• Expected - Dec 2023
CGPA - 3.83/4
Subjects done :
Analysis of Algorithms, Natural Language Processing, Data Intensive Computing, Independent Study
,Reinforcement Learning
,Introduction to Machine Learning
,Modern Network Concepts
Bachelor in Technology • June 2020
CGPA - 8.49 out of 10
Top 5% of class in Objected Oriented Programming Courses and Java
Class 12th • April 2016
Heigher Secondary ( CBSE ) - 89%
Created an Admissions portal where we evaluated profiles to analyze applicant's documents like Resume, SOP, test scores and ranking them. Currently, being developed for School of Engineering and School of Management, processing application data in batches and gaining insights.
Currently guided by Prof Jinjun Xiong at University at Buffalo (August 2023 - present)
Designed Maven and Gradle build tools for Chatting, Moderation - OpenAI API, enabling asynchronous calling of all endpoints, Implemented multithreading support for Java. Featured APIs are - Chatting API (With my own documentation), Moderation API (with my own documentation) and Completion API. This Java library provides a convenient way to interact with OpenAI's API for both Moderation and Chat Completion and many more. The library encapsulates the necessary details, making it easy to integrate OpenAI's powerful models into your Java applications.
Launched and made code open sourced (March 2023 - present)
An open-sourced Android App that allows content creators to generate images using OpenAI's - Dall-E API and post on social media, developed within 2 days following the launch of APIs. OpenAI's Dall E and Moderation API to check if the input text violates the OpenAi policy and generate image.
Launched and made code open sourced (March 2023 - May 2023)
Chrome extension that allows LinkedIn users to list out important details about the job. (sponsorship availability, minimum experience, key skills required for job)
Launched and made code open sourced (Jan 2023 - May 2023)
Food order app in Restaurant, allows management of ordering food, from table to kitchen. QR based ordering system - for customers (User and restaurant owner) Developed an Android app for customers that allows placing order, make payments, reserve tables. Another android app for kitchen staff where they can see the items ordered and mark them as served.
Launched and made code open sourced (January 2022 - June 2022)
Coneato is an Advanced version of our traditional XO, Coneato works on Machine Learning prediction algorithm (regression) and using reinforcement learning techniques, we have evaluated the outcomes.
Launched and made code open sourced (January 2020 - June 2020)
Paper VIT is a question paper supplier app that allows student to view/ donate previous years question papers.
Launched and made code open sourced (January 2018 - June 2018)
Paper VIT is a question paper supplier app that allows student to view and donate previous years question papers.
Launched and made code open sourced (January 2017 - June 2017)
Hypercausal Game powered by ML model, play against your ML model or two player
Android App in beta
Restaurant QR based ordering system - Backend & Android App (User and restaurant owner)
Android App in beta
An Android-based application which enables visualizing various scientific concepts in 3-D by using augmented reality..
Android App in beta
An Event Management application that allows you to take attendance, conduct live quiz and allows participants to know about the event timeline.
Android App
An attendance management Android application which enables teachers to mark attendance.
Android app beta
over 1.5k downloads
Android App to supply question papers to entire VIT University.Running on a Django backend.
Photography
Proin gravida nibh vel velit auctor aliquet. Aenean sollicitudin, lorem quis bibendum auctor, nisi elit consequat ipsum, nec sagittis sem nibh id elit.
Branding, Illustration
Proin gravida nibh vel velit auctor aliquet. Aenean sollicitudin, lorem quis bibendum auctor, nisi elit consequat ipsum, nec sagittis sem nibh id elit.
Webdesign, PhotographyWould love to hear from you
email me at namankhurpia2@gmail.com