Artificial Intelligence Solutions Architect • Experienced Machine Learning Researcher • Woman in AI • Educator • Developer
Dive into a selection of my work samples where innovation meets craftsmanship. Here, you’ll discover a curated collection that showcases not just technical proficiency but a vivid testament to what’s possible in the world of data science, data engineering and ai. Step into a space of endless curiosity and commitment to excellence, where each project narrates a unique journey in the technology landscape.
TechStack
Here are my techstack with their level of experience
Python
> 6 years
PySpark
> 4 years
MySQL
> 5
years
R
> 2 years
Scala
> 2 years
MatLab
> 2 years
PyCharm
>5 years
DataBricks
>4 years
Jupiter Notebooks
> 5
years
Git
> 2 years
OTEL
> 1 years
Tableau
> 2 years
Docker
> 1 years
Jira
> 4 years
Confluence
> 4 years
Azure
> 1 years
aws
> 4 years
GCP
> 3 years
Code Sample
Glimpse at my code snippets
BabyGPT: Pre-Training GPT from scratch with PyTorch in Python
Here is a snippet of Single Head Self Attention and Multi-Head Self Attention as part of entire model.
A/B Testing in Python
Data Analysis in Python, data wrangling, sorting, joining, generating random data from Binomial dist, 2-sample Z-test, p-values Product Data Science & Statistical Analysis
Training RNN with Tensorflow in Python
Training Recurrent Neural Networks (RNN) with Tensorflow in Python for Predicting Stock Prices Deep Learning
Adam Optimization in Python
Updating Neural Networks' Parameters using Adam optimization Deep Learning
Training ML Models in Python
Training Machine Learning models (Linear Regression, Lasso Regression, Decision Tree, Random Forest, XGBoost) with Scikit-learn, testing, hyper-parameter tuning with K-Fold Cross Validation, to predict Salaries
Machine Learning
Machine Learning in Python
Predicting House Prices using simple set-up with XGBoost Algorithm
Machine Learning
PySpark in DataBricks
Using PySpark to load, and do advanced preprocessing of highly nested complex JSON file, autoloader VMWare cloud data. In Databricks we do mounting of S3 bucket, then flatten and store in Delta Live Table.
Data Engineering
SQL Query to Get Active Users
Query to find monthly active users who made a purchase within 30 days of their last login. from three tables: "Users" with columns: UserID and UserName; "LoginHistory" with columns: UserID and LoginTime; and "Purchases" with columns: UserID and PurchaseTime.
Data Analytics
AWS Lambda for Decoding JSON files and store in S3
MLOps
Using AWS Lambda and Python, to develop Lambda Handler for decoding JSON files, collected with Docker/OTEL, and Partition it, store it in S3 MLOps
Want to See More?
Check out My Github for more Code Samples and complete Case Studies with Papers, Code, and Data