Bin Feng

Logo

Business Intelligence Engineer at TopRX, LLC

Contact me: 901-206-8312 / binfengmemphis@gmail.com

My LinkedIn Profile /  My Kaggle Profile / My Tableau Profile

View My GitHub Profile

Portfolio

Professional Experience

Business Intelligence Engineer @ TopRx, LLC (Apr 2022 - Present)

Research Assistant @ University of Memphis (Jun 2020 - Dec 2021)


Education


Certifications

Machine Learning & Data Science

SQL

Cloud Services


Projects

Machine Learning

SQL-based QA bot with Gemini API

Develop a chatbot capable of generating and executing SQL queries to retrieve data from a supermarket sales database and provide users with precise, context-based answers.

Main Objectives

Skills: Python, Nature Language Processing, Large Language Model, Gemini API, Question Answering

Nature Language Processing: Build GPT Tokenizer from Scratch

Explore, compare, and implement the Byte-level Byte Pair Encoding (BBPE) tokenizer to demonstrate its efficiency in large language models (LLMs) by balancing vocabulary size and tokenization efficiency.

Main Objectives

Skills: Python, Nature Language Processing, Byte-level Byte Pair Encoding, Large Language Models

Computer Vision: YOLOv8 on Traffic Detection

In this project, we utilized the YOLO model for traffic detection, starting with familiarization and testing of a pre-trained model. Then, we have fine-tuned the model with our dataset. The retrained model shows a significantly improved performance. We have also analyzed different metrics and plots obtained during the training process. All of those plots confirm enhanced accuracy.

Skills: Python, Computer Vision, Model Finetuning, Metrics Evaluation

Regression Problem: House Price Prediction

Throughout the project, we have performed exploration data analysis to understand the dataset and the relationship between given features and the house price. Then, we cleaned and preprocessed the dataset. Lastly, We aimed to create a model that can predict house prices with only the provided features. As we can see from the final results, we can obtain a pretty decent performance using the random forest regression model on the unseen data.

Skills: Python, Regression Model, EDA, Data Cleaning, Feature Engineer


Data Analysis

Gait Analysis Performance Comparison

We have a running dataset that was collected using our proposed markerless gait analysis system, and another dataset was collected simultaneously by Qualisys(a marker-based gait analysis system that is the standard of the current market). We want to compare the accuracy of our system using the Qualisys system as the golden rule and find the areas of improvement.

Skills: Python, Data Analysis, Data Cleaning, Feature Engineer, Data Collection, Data Visualization


Data Visualization

Superstore Sales Overview

Create an interactive dashboard in Tableau to draw insights into the overview of superstore sales.

Skills: Tableau, Data Visualization, Reporting

Genative AI Search Trend in the US

Create an interactive dashboard in Tableau to draw insights into the search trend for different generative AI models in early 2024.

Skills: Tableau, Data Visualization, Reporting

New York Airbnb Investment by Location

By looking into the New York Airbnb records from 2008 to 2015, I built a dashboard in Tableau to enhance the decision-making process for choosing locations and properties for Airbnb investment in New York.

Skills: Tableau, Data Visualization, Reporting


Publications

Abstract:

Motion capture systems are widely used for measuring athletic performance and as a diagnostic tool in sports medicine. Standard motion capture systems record body movement using: (1) a set of cameras to localize body segments; or (2) specialized suits in which inertial measurement units are directly attached to body segments. Major drawbacks of these systems are limited portability, affordability, and accessibility. This contribution presents a markerless motion capture system using a commercially available sports camera and the OpenPose human pose estimation algorithm. We have validated the proposed markerless system by analyzing the human biometrics during running and jumping movements. The findings of this study demonstrate that pairing a low-cost sports camera with artificial intelligence allows for highquality analysis of human movement.