💙

Hello, I'm Benji Peng. I started my first research project 11 Years ago. I have gained experience in applying automation and ML to address complex biological and chemical problems. My focus is on Machine Learning, Product Development, and Biophysics, and statistical modeling for cutting-edge research.

About Me

I grew up helping my mother with her e-business for pet supplies, which naturally made me tech-forward and gave me an early understanding of digital workflows and customer-facing technology.

I come from a family passionate about life sciences, with two cousins holding Ph.D. in the related fields. That love for science is what motivated me to pursue my education, with an emphasis on applying computational techniques to solve complex problems.

After getting my Ph.D. candidacy, I pursued my passion for machine learning to enhance my research capabilities. My current work is mostly project-based, focus bring novel technology to the broader audiences.

Outside of my work, I enjoy traveling ✈️, art 🖼️, techno 🪩, and spending time with my dog 🐶.

My Research

Securing Large Language Models: Addressing Bias, Misinformation, and Prompt Attacks

This paper reviews security concerns in large language models, focusing on bias, misinformation, detection mechanisms, and vulnerabilities like prompt attacks.

LLMs
Security
Bias
Misinformation
Prompt Attacks
Detection
Mitigation

Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges

This review explores how Large Language Models (LLMs) relate to cognitive science, discussing their similarities to human cognition, evaluation methods, and potential applications. It addresses cognitive biases, limitations of LLMs, and suggests methods for enhancing their performance.

LLMs
Cognitive Science
Evaluation
Applications
Cognitive Biases
Artificial Intelligence

Emerging Techniques in Vision-Based Human Posture Detection: Machine Learning Methods and Applications

This article reviews machine learning methods for human posture detection, covering 2D and 3D detection methods, image and video-based approaches, and the use of open datasets for training. It explores applications in healthcare, surveillance, and human-computer interaction.

Human Posture Detection
Machine Learning
2D Detection
3D Detection
Healthcare
Surveillance
HCI

Optically Modulated and Delayed Fluorescent Proteins for Sensitive Cellular Imaging

This study introduces optically modulated and activated delayed fluorescent proteins for biological imaging. It details the engineering of YFP variants to create new emitters with unique modulation profiles and delayed fluorescence, enhancing sensitivity in cellular detection.

Fluorescent Proteins
YFP
Cellular Imaging
Delayed Fluorescence
Optical Modulation
Biological Detection

Measuring Unbiased Retention Factor vs. Temperature Relationships in Gas Chromatography

This paper introduces a methodology for accurately measuring gas chromatographic retention factor vs. temperature (k vs. T) relationships, overcoming the bias caused by non-ideal GC system behavior. It improves compound identification accuracy by leveraging back-calculated temperature profiles and free software for easy application.

Retention Factor
GC-MS
Retention Projection
Chromatography
Temperature Programs
Compound Identification

My Skills

Agile/Scrum
JIRA
Kanban
Confluence
JavaScript
TypeScript
Next.js
Deno
Python
PyTorch
JAX
SQL
PowerBI
AWS
Azure
Oracle Cloud
Chromatography
Spectroscopy
Fluorescence Microscopy
Digital Signal Processing
Machine Learning
Artificial Intelligence
Full-stack Development
RESTful APIs
GraphQL
Data Analytics

My Experience

AppCubic

Research Scientist & Technical Consultant

Miami, FL (Remote)

At AppCubic, I designed and implemented ML-driven digital infrastructures and worked closely with researchers worldwide on advanced machine learning projects. I co-authored multiple research articles, including work with NIH entrepreneurship partners. My key project, TradeGPT, involved the development of an LSTM and reinforcement learning model for equity and digital asset analysis, integrating language models for named entity recognition tasks. This research-driven approach helped attract institutional investors and led to licensing opportunities for the model.

2018 – Present

Kaseya

Software Development Engineer

Miami, FL

As part of Kaseya's leadership development program, I spearheaded the creation of a comprehensive API testing framework, reducing manual QA intervention by over 82%. My research-driven mindset led me to advocate for type safety and data cleanliness in network topology discovery, ensuring scalability and robustness. Additionally, I mentored team members in various testing methodologies, including AI-assisted testing and visual regression testing, which further honed my skills in analyzing and improving software performance.

2022 – 2023

Georgia Institute of Technology

Ph.D. Candidate in Physical Chemistry / Biophysics

Atlanta, GA

My doctoral research focused on applying unique statistical binning methods to scatter patterns in flow cytometry data for antibacterial susceptibility tests. I integrated machine learning algorithms to improve accuracy and predictive power, and used high-throughput pipelines for biomarker design and tissue imaging. Throughout my Ph.D., I employed 3D visualization tools and mathematical modeling, which were implemented using test-driven development principles (95% coverage). I also conducted customer discovery sessions with lab managers and pathologists to tailor our solution for real-world applications.

2017 – 2021

University of Minnesota

Undergraduate Research Scientist / Junior Scientist

Minneapolis, MN

During my time as an undergraduate researcher, I worked on gas chromatography automation and retention time prediction for building a plant metabolite library. I later contributed to an open-source project led by my advisor, where we wrote Java code for compound prediction based on retention times. I presented my research at multiple conferences and published my first scientific paper, honing my skills in data analysis, scientific communication, and coding for research applications.

2012 – 2014