Eboni Lee, MSc

Data Analyst · Data Scientist · Neuroscientist

People do not become extraordinary. They decide to accomplish extraordinary things. — Sir Edmund Hillary


DC · MD · VA · WA email me!

I am a data scientist with a background in psychology, neuroscience, early childhood education, and media. As a data analyst, I’ve had extensive experience in writing Python scripts to clean and recode data, as well as creating data reports and dashboards to present that data for internal and external stakeholders. As a data scientist, I’ve run thematic analyses on large volumes of text data by utilizing the Python libraries sklearn and nltk.


Projects

ACE website thumbnail

This project explores data collected from the CDC's Behavioral Risk Factor Surveillance System (BRFSS) Adverse Childhood Experiences (ACE) module for the years between 2009 - 2012. According to the Children's Bureau, ACEs "are traumatic events occurring before the age of 18." The project was meant to see if ACEs could be used to predict future behavior and health outcomes. With almost 120,000 respondents and using demographic features in addition to ACEs, I was able to get accuracy above 70% for each classifier. This topic is important to study because ACEs are preventable and can be mitigated, when resources and information are distributed correctly. Unfortunately, what I found was that it seems that the percentage of those experiencing ACEs isn't going down and as a society it's important to reconcile why when moving forward.


map with fire icons

Currently, there are no maps available that track both COVID-19 and fires in the US. My colleagues and I were able to create a map that included data for both of these things. Using folium, we were able to stack a chloropleth map with COVID-19 occurances onto a heat map tracking fire occurrences over the course of 2020.


For a tutorial on building a heat map with time in folium, you can go to my Medium page.


Created a Natural Language Processing (NLP) Classifier using TF-DIF and multiple modeling techniques to distinguish subreddit posts based off of their titles. Was able to achieve 89% accuracy in classifying whether a post was in the Science or Technology subreddit.


Skills

Programming Languages, Skills, & Tools

  •   MATLAB
  •   Tableau
  •   SPSS
  •   Qualtrics
  •   Alchemer
  •   Google Analytics
  •   Google Looker Studio
  •   Google Cloud
  •   Google BigQuery
  •   Machine Learning
  •   Natural Language Processing
  •   Webscraping
  •   EEG
  •   SQL
  •   Regression Testing
  •   Predictive Analytics
  •   AWS
  •   CQI
  •   Visual Analytics
  •   Thematic Analysis

Interests

People like to put you into a box. I'm afraid I don't sit in a box. — Andrew Lloyd Webber


Apart from being a data scientist, I am an aspiring chef. I love trying new recipes and coming up with my own, all in an effort to make good and healthy food that makes people happy.

I also love traveling! I hope that for each place I visit, I will be able to learn a regional dish that will increase my cooking skills and help me learn more about the culture I am visiting. I've currently visited 9 countries and can't wait to travel again soon!


Lastly, I am an avid reader (and writer), having read hundreds of books. The author that has had the biggest impact on me is Tamora Pierce, who has given me strong female leads to look up to.


Awards, Certifications, & Accomplishments

  • Kaggle BIPOC Grant Program 2021
  • Career Karma #3 Project of the Week
  • Career Karma #15 Project of All Time
  • Former Community Outreach Chair for The Society for Black Brain and Behavioral Scientists (SB3S)
  • Undergraduate Straus Institute Certificate in Conflict Management
  • Posse Foundation Leadership Scholarship