Yingzi(Zoe) Yuan

Logo

👩‍💻 About Me
💼Seeking opportunities where I can apply data-driven thinking to real-world problems.
Driven by curiosity, guided by data, growing into a better analyst every day.
Values: Curiosity, Clarity, Empathy.
🌲Skills
Python(Numpy, Pandas), SQL(MySql, PostgreSql), Machine Learning, Tableau
📍 Burnaby, BC  


View My LinkedIn Profile

View My GitHub Profile

Portfolio


📈 Sentiment Meets Stock

A data-driven project aims to predict SPY’s daily movement direction (S&P 500 ETF) by analyzing financial news sentiment. This end-to-end pipeline integrates text processing, sentiment analysis, and machine learning to enhance market prediction accuracy.

🛠 Tools: Python, FinBERT, OpenAI GPT API, VADER, XGBoost, ARIMA, LSTM, scikit-learn, pandas, yfinance

✨ Highlights:


🤖 LLM-based PDF Chatbot Demo

An experimental AI chatbot designed to extract insights from PDF documents using state-of-the-art large language models. The system allows users to upload documents and interactively query their content in natural language.

🛠 Tools: Python, LangChain, OpenAI API, FAISS, Streamlit

Highlights:

🔍 More details?GitHub Repo


🍷Iowa Liquor Sales Data Analysis

A data analysis project using public liquor sales data from the state of Iowa, with the goal of uncovering trends in consumer behavior, seasonal patterns, and store performance.

🛠Tools: Python (Scikit-Learn, pandas, matplotlib), PySpark, Tableau

Highlights:

🔍More details?Report | Tableau Dashboard | YouTube


🖥️ Intelligent text big data analysis display platform

Aimed to collect, analyze, and improve the quality of large-scale campus news text data from different sources, focusing on text search, keyword extraction, sensitive content detection, and text management.

🛠Tools:Python, NLP(TextRank, LDA), Flask, Elasticsearch, shell

Highlights:

🔍More details: Project Summary Slides


🛣️ Highway Big Data Analytics and Visualization Dashboard

A highway big data analytics and visualization dashboard to assist transportation authorities in monitoring traffic flow and transaction performance across regions.

🛠 Tools:SQL, shell, dashboard tool

Highlights:

🔍More details:Project Website | Project Summary Slides