Data Innovation and Transformation
With Data Science and AI

I'm a passionate data scientist and analyst who is eager to take on new challenges.

I am an experienced and highly motivated Data Scientist and Analyst with a passion for Natural Language Understanding (NLU), Deep Learning (DL), and researching new technologies with innate curiosity and a love of learning.

With 5 years‘ experience in data analytics, data science, and machine learning, I have a proven track record of writing quality code and delivering projects on schedule as well as demonstrated ability to learn new tools quickly and develop innovative solutions to problems.

My leadership experience includes providing training and assisting colleagues on new technologies as well as being active in S&P Global’s Women In Technology. I have a passion not only to succeed but to help others succeed as well.
95%

Predictive Modeling and Analytics

95%

Data Visualization

93%

ETL Solutions

90%

Statistical Analysis

95%

Python

95%

Tableau, Plotly Dash

93%

Azure Databricks, PySpark, Kafka, Hadoop, Presto, Git

93%

SQL

90%

AWS, GCP

88%

Docker, Airflow

80%

R

  • Natural Language Processing (NLP) - Bidirectional Encoder Representations from Transformers (BERT) and BERTweet (A pre-trained language model for English Tweets)
  • Convolutional Neural Networks (CNN)
  • CNN with Transfer Learning:
    • VGG16, VGG19, ResNet50, ResNet152, ResNet152V2
    • DenseNet201, Xception, InceptionV3, EfficientNetB7, MobileNetV
  • Long-Short Term Memory (LSTM) Recurrent Neural Networks (RNN)
  • XGBoost
  • Decision Trees
  • Random Forest
  • Simple Linear/Multivariate Regression
  • Logistic Regression
  • General Linear Model (GLM)
  • K-Means Clustering
  • Seasonal ARIMA
  • STL Decomposition
  • Exponential Smoothing
  • Prophet

Data Analyst

September 2021 - Present

Develop 5 key accuracy metrics, which evaluate relative performance of 20 mainstream oil and gas price forecasts and forward curves compared to actual prices for 23 proprietary hedge funds. Build Tableau dashboard to visualize price forecasts and forecast accuracy metrics for 20 oil and gas market data, saving 5 hours per week of manual reporting work. Develop ETL solutions for market- and asset-level gas and oil data, which reduce average processing time from 7 days to under 5 minutes, utilizing Databricks, Pyspark, Python, SQL, AWS, and Airflow. Train and mentor team members on new technologies such as Databricks, PySpark, and AWS S3 to set up and manage end-to-end data pipelines, reducing average processing time to generate analysis-ready data from 6 weeks down to a few days. Active board member and speaker for various panel discussions in S&P Global’s Women in Technology.

About the company

Teaching Assistant

January 23, 2023 - January 27, 2023

Answered technical questions in Python training for Machine Learning (ML) and NLP for 75+ data practitioners at Center For Disease Control and Prevention (CDC) live stream training. Guided trainees in breakouts with code exercises in real time and to proceed ML- and NLP-driven work-related projects.

About the company

Senior Data Researcher

May 2018 - September 2021

Utilized machine learning and data science techniques to deliver actionable insights in global power generation in 10 geographical regions covering 190+ countries worldwide. Built an interactive dashboard using Dash Python to formulate energy market- and asset-level impacts on fossil fuel outlook. Developed high-performing ETL pipelines for power and utility data, decreasing average processing time from 10 days to under 10 minutes.

About the company

Research Contractor

May 2019 - July 2019

Collaborated with R&D team to develop RNN-based LSTM while applying denoising tools (i.e. Fast Fourier Transform) to detect anomalies in time series sensor data on athletes’ performance, improved detection performance by 55%.

About the company

Quantitative Methodology Analyst

May 2006 - September 2009

Collaborated with cross-functional teams to develop and manage methodologies and algorithms on the KPIs of mutual fund and fund. Led the quantitative analysis on the US side of individual fund, fund families, and asset class/industry to identify trends and insights for investment-decision making.

About the company

University of California, Berkeley

Master of Information and Data Science - May 2024 (Anticipated)

About the school

University of Colorado, Boulder

Studied Computer Science toward Bachelor of Science

About the school

University of Colorado, Denver

Pursued a Master of Business Administration (MBA) with an emphasis in Finance

About the school
  • Irvine, California, United States

Contact me if you're interested in learning more about more recent work.