Yi (Zoe) Zhu

Yi (Zoe) Zhu

MS in Data Science

Duke University

Welcome

Nice to meet you! 你好~ ヾ(o´ω`o)ノ゙

My name is Yi Zhu (朱易) and I go by Zoe.

I am a master’s in data science candidate at Duke University graduating in May 2022.

For more information, feel free to explore my website or check my resume.

Education
  • MS in Data Science

    Duke University, 2020 - 2022

  • BS in Statistics & Psychology

    UC Davis, 2016 - 2020

  • International Baccalaureate Diploma

    HS affiliated to Nanjing Normal University, 2013 - 2016

Experience

 
 
 
 
 
IEEE
Capstone Data Scientist
Aug 2021 – Present
•   Leverage topic modeling and knowledge graph to enable efficient cross-disciplinary research and new concepts discovery for large-scale datasets (5.4M+ publications) hosted by IEEE Xplore digital library.
•   Create automation to facilitate research collaboration across divisions and increase search efficiency.
•   Design end-to-end ML pipelines to extract scientific concepts, validate through graph linkages; Tag paper with concepts based on relevancy, and establish the parent-child hierarchy between concepts.
 
 
 
 
 
Duke University
Graduate Teaching Assistant
Duke University
May 2021 – Present
•   Mentor graduate-level projects related to data wrangling, modeling, analysis, and application.
•   Capture and solve coding problems in Python, R, Bash, Git for inter-disciplinary students.
Courses:
   •   Fuqua School of Business - Programming for Analysis and Visualization, Data Analytics and Applications course site
   •   MIDS (my program :)) - Practical Data Science course site
   •   Bootcamp - Computational Methods for Social Scientists course site
 
 
 
 
 
MorphoSource Repository
Data Scientist Intern
May 2021 – Aug 2021
•   Developed an interactive dashboard on self-defined metrics to visualize the impact of 3D scans of specimens. Derived insights to encourage user contribution and optimize data storage allocations.
•   Remapped user information after database reconsolidation. Prototyped the data analysis pipeline with cleaning and processing.
•   Worked and communicated across functions in a virtual environment setting to drive the product updates.
 
 
 
 
 
UC Davis FoxLab
Research Assistant
Nov 2018 – Jun 2020
•   Implemented an advanced CNN architecture Deeplabcut for object detection with annotations to identify key body part coordinates for research subject video data.
•   Performed clustering methods including PCA and tSNE on research subjects' behavior to classify their time-stamped emotional state. Contributed the analysis to the NIH grant proposal and received approval.

Accomplish­ments

Understand the business problem, identify data to explore for analysis, and deliver actionable insights.
Execute graph algorithms that operate on the nodes and relationships in a graph.
Build, train, tune, and deploy ML models using the AWS Cloud.
See certificate

Projects

*
A/B testing for Ads Tone

A/B testing for Ads Tone

A/B Testing on the Effect of Different Advertisement Tones

Facial Expression Recognition

Facial Expression Recognition

An application that evaluates real time facial expression/engagement to help content presenters improve communication efficiency.

Fake News Text Classification

Fake News Text Classification

An analysis and comparison of word2vec and GloVe embeddings for fake news text classification.

Flask Web App for Language Translation

Flask Web App for Language Translation

A serverless web app for language translation built on AWS with CI/CD.

R package for Bag of Little Bootstraps

R package for Bag of Little Bootstraps

An R package designed for performing the resampling method Bag of Little Bootstraps.

Statistical Modeling Projects

Statistical Modeling Projects

A list of statistical modeling projects including Causal Inference, Time Series Analysis, Hierarchical Modeling & Regression.