I am a Data Scientist and Ph.D. Candidate in Computer Science with a passion for transforming complex data into impactful solutions. My career has spanned government, healthcare, and academic research, where I have analyzed nuclear physics data for U.S. Customs and Border Protection, supported federal technology acquisition at the Department of Veterans Affairs, and modeled complex biological systems. My current research focuses on leveraging big data for the automated detection of environmental change.
I thrive on building robust data pipelines, developing predictive models, and visualizing data to tell a compelling story. I am actively seeking remote Senior Data Scientist or Machine Learning Engineer roles.
| Languages | Machine Learning / Data Science | Big Data / Cloud | Databases | Tools & Platforms |
|---|---|---|---|---|
| Python | Scikit-learn | AWS | PostgreSQL | Jira |
| R | TensorFlow | Azure | SQL Server | Git / GitLab |
| SQL | PyTorch | Snowflake | NoSQL | Docker |
| Bash | Pandas / NumPy | Apache Spark | GCP | Tableau |
| Julia / Go | Predictive Analysis | Hadoop | Azure | Power BI |
This ongoing research involves analyzing large-scale geospatial and environmental datasets to develop automated models for detecting significant changes in the Tennessee and Flint River Basins.
- Tech: Python, R, Bash, ESRI ArcGIS, Google Earth, Cesium, AutoDesk
- Skills: Big Data, Machine Learning, Geospatial Analysis
🎓 Master’s Capstone: The Hidden Burden of Gallbladder Disease
A comprehensive data analytics project investigating the prevalence, costs, and diagnostic advancements related to gallbladder disease in the USA.
- Tech: Python, R, SPSS 29, R Studio, Tableau
- Skills: Data Analytics, Data Visualization, Statistical Analysis
I’m always interested in discussing new projects, innovative ideas, or opportunities. Feel free to connect with me!
- LinkedIn: linkedin.com/in/stefani-d-w-yates
- Portfolio: datascienceportfol.io/stefaniyates

