COMPUTER SCIENCE + MACHINE LEARNING @ UNIVERSITY OF MARYLAND, COLLEGE PARK
ABOUT ME
I am a student at the University of Maryland studying Computer Science with an emphasis in Machine Learning, alongside minors in Business and Statistics.
My work has focused on modernizing pipelines and building zero-to-one solutions. My research focus lies in reinforcement learning with the goal to empower human decision-making through AGI.
In my free time, I enjoy playing golf, going hiking, travelling to new places, or just being outdoors.
I'm always eager to create meaningful impact through my work. Feel free to reach out with any opportunities, volunteering, or just to chat!
TOP TECHNICAL SKILLS
PYTHONLANGUAGES
JAVALANGUAGES
JAVASCRIPTLANGUAGES
RLANGUAGES
TENSORFLOW/KERASML/AI
AWSCLOUD
EXCELBI
PROFESSIONAL JOURNEY
SOFTWARE ENGINEER INTERN
General Dynamics Information Technology
Jun 2025 - Present
Annapolis Junction, MD
CURRENT
Developing multimodal deep learning software for precision targeting, navigation intelligence, and situational awareness
Tensorflow
Keras
Scikit-Learn
Android Studio
TECHNICAL PROJECT LEAD (CONTRACT)
General Dynamics Information Technology
Jan 2025 - May 2025
College Park, MD
Engineered a scalable RAG-powered LLM in AWS Bedrock for email insight and AI analysis for global email database
AWS Bedrock
AWS EC2
LangChain
Neo4J
Qdrant
NLTK
Flask
Next.js
Tailwind CSS
TECHNICAL PROJECT MANAGER (CONTRACT)
Booz Allen Hamilton
Sep 2024 - Dec 2024
College Park, MD
Created Medicaid/CHIP web application for policy comparison, user annotation, and analysis for US Government contract
PyTorch
Scikit-Learn
Selenium
PostgreSQL
Express.js
Next.js
Tailwind CSS
SOFTWARE ENGINEER INTERN
Internal Revenue Service
Jan 2024 - Dec 2024
Washington, DC
Modernized post-ETL CI/CD pipeline for criminal auditing and idenity compliance tool and resolved defects in build system
Previous Position: Social Media Chair (Handicap: 4.4)
3rd National Ranking (2024)
2x National Bids
RESEARCH & PUBLICATIONS
EASY2HARD-BENCH: STANDARDIZED DIFFICULTY LABELS FOR PROFILING LLM PERFORMANCE AND GENERALIZATION
Contributed as an Undergraduate Researcher
NeurIPS•2024•CONFERENCE
Despite the abundance of datasets available for assessing large language models (LLMs), the scarcity of continuous and reliable difficulty labels for individual data points, in most cases, curtails their capacity to benchmark model generalization performance across different levels of complexity. Addressing this limitation, we present Easy2Hard, an innovative collection of 6 benchmark datasets featuring standardized difficulty labels spanning a wide range of domains, such as mathematics and programming problems, chess puzzles, and reasoning questions, providing a much-needed tool for those in demand of a dataset with varying degrees of difficulty for LLM assessment. We estimate the difficulty of individual problems by leveraging the performance data of many human subjects and LLMs on prominent leaderboards. Harnessing the rich human performance data, we employ widely recognized difficulty ranking systems, including the Item Response Theory (IRT) and Glicko-2 models, to uniformly assign difficulty scores to problems. The Easy2Hard datasets distinguish themselves from previous collections by incorporating a significantly higher proportion of challenging problems, presenting a novel and demanding test for state-of-the-art LLMs. Through extensive experiments conducted with six state-of-the-art LLMs on the Easy2Hard datasets, we offer valuable insights into their performance and generalization capabilities across varying degrees of difficulty, setting the stage for future research in LLM generalization.
SHOULD PGA TOUR PROFESSIONALS CONSIDER THEIR ADVERSARY'S STRATEGY?
This study explores strategic considerations in professional golf’s Match Play format. Leveraging Professional Golfers’ Association Tour data, we investigate the impact of factoring in an adversary’s strategy. Our findings suggest that while slight strategy adjustments can be advantageous in specific scenarios, the overall benefit of considering an opponent’s strategy remains modest. This confirms the common wisdom in golf, reinforcing the recommendation to adhere to optimal stroke-play strategies due to challenges in obtaining precise opponent statistics. The methodology employed here is generic and could offer valuable insights into whether opponents’ performances should also be considered in other two-player or team sports, such as tennis, darts, soccer, volleyball, etc. We hope that this research will pave the way for new avenues of study in these areas.
CAMPAIGNS TO OVERCOME GOLFERS’ LOSS-AVERSE COGNITIVE BIAS
Nishad Wajge and Krista Hill-Cummings
International Journal for High School Research•2024•JOURNAL
Loss-aversion is a type of conservative mindset that can negatively impact performance. In this study, we systematically surveyed a pool of golfers. We then analyzed, in detail, the relationships among participants’ attributes and the degree of loss-aversion. Based on these findings, we propose developing effective campaigns to help individuals or teams overcome their loss-aversion.
USING MODEL CLASSIFICATION TO DETECT BIAS IN HOSPITAL TRIAGING
Nishad Wajge, Phil Mui, et al.
Stanford JUST Health | Annals of Biomedical Science and Engineering•2022-2023•CONFERENCE & JOURNAL
In light of the COVID-19 pandemic and the health crisis left in its wake, our goal is to develop extensive machine-learning techniques to provide a clear picture of the treatment, and possible mistreatment, of specific patient demographics during hospital triaging. We aim to reveal whether a patient’s treatment and hospital disposition is related to the following attributes - Emergency Severity Index (ESI), gender, employment status, insurance status, race, or ethnicity which our 100 MB dataset included. Our work is separated into two parts - the classification task and data analysis. As part of the classification task, we used the k-Nearest-Neighbor classifier, the F1-score, and a random forest. We then analyze the data using SHapley Additive exPlanations (SHAP) values to determine the importance of each attribute. Our findings show that significance varies for each attribute. Notably, we found that patients with private insurance programs receive better treatment compared to patients with federal-run healthcare programs (e.g. Medicaid, Medicare). Furthermore, a patient’s ethnicity has a greater impact on treatment for patients under 40 years of age for any given ESI level. Surprisingly, our findings show language is not a barrier during treatment. Discussion and conclusion: We, therefore, conclude that although hospitals may not be doing so intentionally, there is a systemic bias in hospital triaging for specific patient demographics. For future works, we hope to aggregate additional patient data from hospitals to find whether specific demographics of patients receive better healthcare in different parts of the United States.