AVAILABLE FOR REMOTE WORK

EMMANUEL
JOHN LELO

Full-Stack Data Scientist · MLOps Engineer

🌍 Open to Remote — Europe & Worldwide

Building intelligent, production-ready ML systems — from exploratory analysis and model training to CI/CD pipelines, containerised APIs, and Kubernetes-scaled deployments.

🚀 View Projects ✉️ Get in Touch

Projects

99.9%

Best Accuracy

Certifications

MLOps Systems

Who I Am

About Me

Full-Stack Data Scientist and MLOps Engineer passionate about shipping production ML systems.

🧑‍💻

Emmanuel John Lelo

Data Scientist · MLOps Engineer

Python MLOps Kubernetes FastAPI Docker

📍 LocationDar-es-Salaam, Tanzania

📧 Emailleloemmanuel540@gmail.com

💼 Experience1 Year

🌍 Work ModeRemote · Europe & Worldwide

Hello, I'm Emmanuel 👋

I'm a Full-Stack Data Scientist and MLOps Engineer with 1 year of hands-on experience building real ML systems — not just notebooks. I specialise in the full ML lifecycle: from raw data and feature engineering through to containerised model serving, CI/CD automation, and production monitoring.

I've shipped two production MLOps pipelines (bank churn + credit card fraud), completed virtual experience programmes with British Airways and Lloyds Bank Group, and built a deep learning fraud detection system achieving 99.9% accuracy on 284k transactions.

My toolkit spans Python, Scikit-learn, TensorFlow, XGBoost, DVC, MLflow, Prefect, Docker, Kubernetes, FastAPI, AWS, and more. I'm passionate about turning data into reliable, scalable production systems.

🌍

Open to Remote Opportunities — Europe & Worldwide

Actively seeking remote roles as a Data Scientist or MLOps Engineer. Available for full-time, part-time, or contract engagements. Comfortable with async workflows and international time zones.

What I Work With

Technical Skills

A full-stack toolkit spanning data science, ML engineering, and production infrastructure.

🐍

Programming & Data

PythonSQLRPandasNumPy

🧠

Machine Learning & AI

Scikit-learnTensorFlowXGBoostLightGBMHuggingFace

⚙️

MLOps & DevOps

MLflowDVCPrefectGitHub ActionsZenML

☁️

Cloud & Infrastructure

AWS SageMakerAWS S3DockerKubernetesHelm

📊

Analysis & Visualisation

PostgreSQLMongoDBPlotly / DashPower BIStreamlit

🔍

Model Serving & Monitoring

FastAPIPrometheusGrafanaEvidently AIPytest

Skill Depth

My Proficiency

Honest self-assessment of core technical competencies.

Python & Data Science Libraries88%

SQL & Data Analysis85%

Machine Learning & Model Development82%

MLOps Pipelines & CI/CD75%

Docker & Kubernetes72%

AWS Cloud Platform70%

Model Monitoring & Observability70%

What I've Built

Work Experience

A track record of shipping real ML systems, not just prototypes.

🔐

MLOps Engineer — Production Churn Prediction System

2024 · Self-Initiated

Self-Initiated · bank-churn-mlops · github.com/emmanuelmassawe200

Architected a production-grade MLOps pipeline covering DVC + DagsHub data versioning, MLflow experiment tracking, Prefect orchestration, and a containerised FastAPI REST API serving real-time predictions.
Published Docker image (emmanuely2000/bank-churn-api), achieving 84% accuracy and 63.1% weighted F1 with a scikit-learn Pipeline bundling preprocessing and Random Forest as a single serialisable artifact.
Implemented full CI/CD via GitHub Actions — automated test → build → push to Docker Hub on every commit, reducing manual deployment steps to zero.
Resolved production challenges: scikit-learn version mismatch across training/serving, Docker secrets via runtime env vars, and FastAPI CORS for browser clients.
Roadmapped Evidently drift detection, Prefect scheduled retraining, SHAP explainability, GCP Cloud Run deployment as next-phase additions.

💳

MLOps Engineer — Real-Time Credit Card Fraud Detection

2024 · Self-Initiated

Self-Initiated · credit-card-fraud-detection · github.com/emmanuelmassawe

Built a production fraud detection system on 284,807 transactions (0.17% fraud rate), achieving 99.9% accuracy, 93.2% F1, 98.7% ROC-AUC with XGBoost + SMOTE; FastAPI serving with sub-100ms response time.
Deployed on Kubernetes (Minikube + GKE/EKS/AKS-ready) with HPA autoscaling, rolling updates, and health-check probes — scaling from 1 to 5+ replicas under load.
Implemented complete MLOps lifecycle: DVC versioning, MLflow + DagsHub tracking, Prefect orchestration, dual GitHub Actions workflows with 90%+ test coverage via Pytest.
Engineered a modular pipeline across 5 reproducible DVC stages (ingestion → preprocessing → training → evaluation → serving) with full dvc repro traceability.
Delivered browser UI with real-time confidence scores; roadmapped Prometheus/Grafana monitoring and SHAP explainability.

✈️

Data Scientist — Lounge Demand & Revenue Modelling

2024 · Virtual Experience

British Airways · Virtual Experience (Forage)

Analysed a 50,000+ flight schedule dataset to model lounge eligibility across 3 tiers — identified peak demand at ~1,280 users/hour at 07:00–08:00.
Built a revenue & cost model projecting £262.8M annual profit; recommended Concorde Room expansion with projected 2-month payback on £7–10M investment.
Engineered a 6-panel Matplotlib/Seaborn dashboard + multi-sheet Excel workbook; model predictions within 3% of actual BA data (long-haul 41% vs actual 43.9%).
Delivered a one-page executive summary with dynamic staffing recommendation saving ~£500k/year.

📈

ML Engineer — Customer Booking Conversion Prediction

2024 · Virtual Experience

British Airways · Virtual Experience (Forage)

Built a binary classification pipeline on 50,000 customer records predicting booking completion; achieved recall of 0.62 on a heavily imbalanced dataset (15% positive class).
Engineered 8 domain-specific features (booking lead, group size, long-haul flag, extras score); applied SMOTE + tuned Random Forest & XGBoost via GridSearchCV/RandomizedSearchCV.
Identified booking origin (34% importance) and trip duration (20%) as top predictors; recommended strategy projected to deliver 5–10% conversion lift.

🏦

Data Scientist — Bank Customer Churn Prediction

2024 · Self-Initiated

Lloyds Bank Group · Self-Initiated Project

Compared 4 classifiers (Logistic Regression, Random Forest, XGBoost, SVM) on 10,000+ customer records; engineered 4 behavioural features and applied SMOTE + 5-fold StratifiedKFold CV.
Tuned Random Forest (72 combinations) and XGBoost (24 combinations) via GridSearchCV; findings applicable to retention campaign targeting for high-risk segments.

🧠

ML Engineer — Financial Fraud Detection (Deep Learning)

2024 · Self-Initiated

Self-Initiated · PaySim Synthetic Dataset

Designed a TensorFlow/Keras ANN (128→64→1, dual Dropout 0.5) on large-scale synthetic payments data; implemented precision-recall threshold optimisation over default 0.5 cutoff.
Scaffolded sklearn Pipeline architectures for Logistic Regression, XGBoost, LightGBM with integrated SMOTE and RandomizedSearchCV for multi-model benchmarking.
Implemented EarlyStopping callbacks, SMOTE + MinMaxScaler, and joblib serialisation for deployment-ready inference.

Portfolio

Featured Projects

Production-ready ML systems with measurable impact and modern engineering practices.

MLOPS · KUBERNETES · FASTAPI

Credit Card Fraud Detection Pipeline

99.9% ACC · 93.2% F1 · 98.7% AUC

Production-grade fraud detection on 284k transactions — XGBoost + SMOTE, Kubernetes HPA autoscaling, dual GitHub Actions CI/CD, 90%+ test coverage via Pytest.

XGBoostKubernetesFastAPIDVCMLflowPrefect

MLOPS · DOCKER · CICD

Bank Churn MLOps Pipeline

84% ACC · 63.1% F1 · Zero-touch deploy

End-to-end MLOps system: DVC data versioning, Prefect orchestration, MLflow registry, FastAPI serving, Docker Hub distribution, GitHub Actions CI/CD.

Random ForestDockerMLflowDVCDagsHubFastAPI

DATA SCIENCE · REVENUE MODELLING

British Airways Lounge Analysis

£262.8M projected profit · 3% model error

Demand forecasting and revenue modelling for BA Terminal 3 lounge operations — 50k+ records, 6-panel dashboard, stakeholder executive summary.

PandasMatplotlibSeabornExcelNumPy

CLASSIFICATION · FEATURE ENGINEERING

BA Booking Conversion Prediction

0.62 Recall · 8 engineered features

Binary classification on 50k records to predict flight booking completion — SMOTE, 8 engineered features, Random Forest + XGBoost, stakeholder presentation.

XGBoostRandom ForestSMOTEscikit-learn

CLASSIFICATION · CHURN ANALYSIS

Lloyds Bank Churn Prediction

4 models compared · 72 param combos tuned

End-to-end churn prediction pipeline comparing LR, Random Forest, XGBoost, and SVM — 4 engineered features, SMOTE, 5-fold StratifiedKFold CV.

XGBoostSVMSMOTEGridSearchCV

DEEP LEARNING · ANN · FRAUD

Synthetic Financial Fraud Detection

ANN 128→64→1 · Threshold-tuned F1

TensorFlow/Keras ANN with Dropout regularisation, precision-recall threshold optimisation, SMOTE + MinMaxScaler pipeline, and joblib deployment-ready serialisation.

TensorFlowKerasXGBoostLightGBMSMOTE

Credentials

Certifications & Education

Continuous learning backed by industry-recognised credentials.

☁️

AWS Fundamentals

Amazon Web Services · 2024

⚙️

MLOps Specialization

DeepLearning.AI / Coursera · 2025

🧠

Deep Learning Specialization

DeepLearning.AI / Coursera · 2024

🐳

Docker & Kubernetes Fundamentals

Linux Foundation · 2024

📊

Data Science Professional Certificate

IBM / Coursera · 2023

🎓

B.Sc. Data Science

EASTC University · Tanzania

Let's Connect

Get In Touch

Open to remote roles worldwide. I reply within 24 hours.

I'm actively seeking remote opportunities in Europe and worldwide as a Full-Stack Data Scientist or MLOps Engineer. Whether you have a role, project, or just want to connect — drop me a message.

📧

leloemmanuel540@gmail.com

💼

LinkedIn Profile

🐙

github.com/emmanuelmassawe

🔬

DagsHub — emmanuelmassawe200

✉️ Send Me a Message

Your Name *

Your Email *

Subject *

Message *

Emmanuel John Lelo

Full-Stack Data Scientist & MLOps Engineer

📧 leloemmanuel540@gmail.com

📱 +255 696 393 526

💼 linkedin.com/in/emmanuel-john-61b343334

🐙 github.com/emmanuelmassawe

📍 Dar-es-Salaam, Tanzania

🌍 Open to Remote — Europe & Worldwide 1 Year Experience Python · ML · MLOps · Kubernetes

⚡ Core Skills

Python & Data Libraries88%

Machine Learning82%

SQL & Data Analysis85%

MLOps & CI/CD75%

Docker & Kubernetes72%

AWS Cloud70%

🛠️ Tools & Frameworks

Scikit-learnTensorFlowXGBoost LightGBMHuggingFaceMLflow DVCDagsHubGitHub Actions PrefectFastAPIEvidently AI PrometheusGrafanaDocker KubernetesAWS SageMakerPytest

🎓 Education

B.Sc. Data Science

EASTC · Dar-es-Salaam

📜 Certifications

▸ AWS Fundamentals

▸ MLOps Specialization — DeepLearning.AI

▸ Deep Learning Specialization — DeepLearning.AI

▸ Docker & Kubernetes — Linux Foundation

▸ Data Science Certificate — IBM

🗣️ Languages

▸ English — Fluent

▸ Swahili — Native

👤 Professional Summary

Results-driven Full-Stack Data Scientist & MLOps Engineer with 1 year of hands-on experience across data analysis, ML model development, and production deployment pipelines. Shipped 2 end-to-end MLOps systems (bank churn + credit card fraud) achieving up to 99.9% accuracy. Skilled in DVC, MLflow, Docker, Kubernetes, FastAPI, and GitHub Actions CI/CD. Actively seeking remote opportunities in Europe and worldwide.

💼 Experience & Projects

MLOps Engineer — Credit Card Fraud Detection

2024

Self-Initiated · Kubernetes · XGBoost · FastAPI

284,807 transactions; 99.9% accuracy, 93.2% F1, 98.7% ROC-AUC. Sub-100ms FastAPI inference.
Full Kubernetes deployment (HPA autoscaling, rolling updates) + dual GitHub Actions CI/CD, 90%+ test coverage.
Complete MLOps: DVC · MLflow · DagsHub · Prefect · 5-stage reproducible pipeline.

MLOps Engineer — Bank Churn Prediction System

2024

Self-Initiated · Docker · MLflow · Prefect

84% accuracy; Docker image published to Docker Hub; zero-touch CI/CD via GitHub Actions.
DVC + DagsHub data versioning; Prefect orchestration; FastAPI REST serving.

Data Scientist — British Airways (Forage)

2024

Lounge Demand Modelling + Booking Conversion Prediction

£262.8M profit model from 50k+ records; predictions within 3% of BA actual data.
Booking conversion: 0.62 recall on 50k records; 8 engineered features; Random Forest + XGBoost.

Data Scientist — Lloyds Bank & Financial Fraud (Deep Learning)

2024

Churn Prediction · ANN Fraud Detection · scikit-learn · TensorFlow

4 classifiers compared (LR, RF, XGBoost, SVM); 72+24 hyperparameter combinations tuned via GridSearchCV.
TensorFlow ANN (128→64→1, Dropout 0.5); precision-recall threshold optimisation; joblib serialisation.

🚀 Key Projects

🔐 Fraud Detection (K8s)

99.9% acc · Kubernetes HPA · sub-100ms API

XGBoostKubernetesFastAPI

🏭 Bank Churn MLOps

84% acc · Docker · Zero-touch CI/CD

MLflowDVCPrefect

✈️ BA Lounge Analysis

£262.8M model · 3% prediction error

PandasSeaborn

🧠 ANN Fraud Detection

TF/Keras · Threshold-tuned F1 · joblib

TensorFlowLightGBM

Emmanuel John Lelo

Full-Stack Data Scientist & MLOps Engineer

🌍 Open to Remote — Europe & Worldwide 1 Year Experience Python · ML · MLOps · Kubernetes

📧 leloemmanuel540@gmail.com

📱 +255 696 393 526

📍 Dar-es-Salaam, Tanzania

💼 linkedin.com/in/emmanuel-john-61b343334

🐙 github.com/emmanuelmassawe

🔬 dagshub.com/emmanuelmassawe200

👤 Professional Profile

I'm a dedicated Full-Stack Data Scientist and MLOps Engineer with 1 year of professional experience spanning data analysis, machine learning, and production ML systems. I've shipped 2 end-to-end MLOps pipelines achieving up to 99.9% accuracy, deployed on Kubernetes with full CI/CD, experiment tracking, and data versioning. I cover the complete ML lifecycle — EDA, feature engineering, model training, containerised serving, and monitoring. Actively seeking remote opportunities in Europe and worldwide.

💼 Experience & Projects

MLOps Engineer — Credit Card Fraud Detection System

2024 · Self-Initiated

github.com/emmanuelmassawe/credit-card-fraud-detection

Production fraud detection on 284,807 transactions: 99.9% accuracy, 93.2% F1, 98.7% ROC-AUC with XGBoost + SMOTE. Sub-100ms FastAPI inference.
Kubernetes deployment (Minikube + GKE/EKS/AKS) with HPA autoscaling, rolling updates, health probes.
Full MLOps: DVC versioning, MLflow + DagsHub tracking, Prefect orchestration, dual GitHub Actions workflows, 90%+ Pytest coverage.
5-stage reproducible DVC pipeline: ingestion → preprocessing → training → evaluation → serving.

MLOps Engineer — Bank Churn Prediction Pipeline

2024 · Self-Initiated

emmanuely2000/bank-churn-api · Docker Hub

Production MLOps pipeline: DVC + DagsHub data versioning, MLflow experiment tracking, Prefect orchestration, FastAPI serving.
84% accuracy, 63.1% weighted F1. Docker image published; GitHub Actions CI/CD (zero manual deployment).
Resolved production challenges: sklearn version mismatch, Docker secrets management, FastAPI CORS.

Data Scientist — British Airways Virtual Experience (Forage)

2024

Lounge Analysis: Modelled demand on 50k+ records; £262.8M revenue model, predictions within 3% of BA actual data. 6-panel Seaborn dashboard + Excel workbook.
Booking Prediction: 0.62 recall on 50k records; 8 engineered features; SMOTE + RF/XGBoost; booking origin = 34% feature importance.

Data Scientist — Lloyds Bank Churn + Financial Fraud (Deep Learning)

2024

Lloyds Churn: 4 classifiers on 10k+ records; 4 engineered features; 72+24 GridSearchCV combinations; StratifiedKFold CV.
ANN Fraud Detection: TF/Keras ANN (128→64→1, Dropout 0.5) on PaySim dataset; precision-recall threshold optimisation; sklearn Pipelines for LR, XGBoost, LightGBM.

⚡ Technical Skills

🐍 Programming & Data

PythonSQLRPandasNumPy

🧠 ML & Deep Learning

Scikit-learnTensorFlowXGBoostLightGBMHuggingFace

⚙️ MLOps & DevOps

MLflowDVCDagsHubPrefectGitHub Actions

☁️ Cloud & Infrastructure

AWS SageMakerAWS S3DockerKubernetesHelm

📊 Analysis & Visualisation

PostgreSQLMongoDBPlotlyPower BIStreamlit

🔍 Serving & Monitoring

FastAPIEvidently AIPrometheusGrafanaPytest

🚀 Featured Projects

🔐

Fraud Detection (Kubernetes)

99.9% acc · 93.2% F1 · HPA autoscaling · 90%+ test coverage · sub-100ms API

XGBoostKubernetesFastAPIMLflow

🏭

Bank Churn MLOps Pipeline

84% acc · Docker Hub · Zero-touch CI/CD · DVC + MLflow + Prefect

MLflowDVCDockerPrefect

✈️

BA Lounge Revenue Model

£262.8M projection · 50k+ records · 3% model error · 6-panel dashboard

PandasSeabornMatplotlib

📈

BA Booking Conversion

0.62 recall · 8 features engineered · 50k records · SMOTE

XGBoostSMOTEscikit-learn

🏦

Lloyds Bank Churn

4 models · 72 GridSearch combos · StratifiedKFold · SMOTE

XGBoostSVMGridSearchCV

🧠

ANN Fraud Detection

TF/Keras 128→64→1 · Dropout · Threshold-tuned · joblib deploy

TensorFlowLightGBMSMOTE

🎓 Education

🎓

B.Sc. Data Science

EASTC University · Tanzania

ML · Algorithms · Database Systems · Statistics

📚

Self-Directed Learning

Coursera · DeepLearning.AI · Kaggle · AWS

2023 – 2025

📜 Certifications

☁️

AWS Fundamentals

Amazon Web Services · 2024

⚙️

MLOps Specialization

DeepLearning.AI · 2025

🧠

Deep Learning Specialization

DeepLearning.AI · 2024

🐳

Docker & Kubernetes

Linux Foundation · 2024

📊

Data Science Certificate

IBM / Coursera · 2023

🏅

Machine Learning with Python

freeCodeCamp · 2023

🏆 Key Achievements

99.9%

Accuracy on credit card fraud detection with 284,807 transactions; 93.2% F1 on 0.17% imbalanced class.

End-to-end MLOps systems shipped and containerised, with full CI/CD, experiment tracking, and Kubernetes deployment.

90%+

Automated test coverage via Pytest on fraud detection system; dual GitHub Actions CI/CD workflows.

£262.8M

Revenue projection model for British Airways lounge operations, within 3% of actual BA data.

🗣️ Languages & Soft Skills

Languages

English

Swahili

Soft Skills

Analytical ThinkingProblem SolvingTeam CollaborationFast LearnerRemote-ReadyClear Communication

🌍 Availability

EMMANUELJOHN LELO

About Me

Hello, I'm Emmanuel 👋

Open to Remote Opportunities — Europe & Worldwide

Technical Skills

My Proficiency

Work Experience

Featured Projects

Certifications & Education

Get In Touch

✉️ Send Me a Message

Open to Remote Opportunities — Europe & Worldwide

EMMANUEL
JOHN LELO