Resume / About Me
Sr. Data Engineer • Machine Learning Engineer • Consultant • Mentor • Technical Blogger
Subhadip Mukherjee
Senior Data Engineer | Machine Learning Engineer | Consultant | Mentor
📍 Toronto, Ontario, Canada
📍 Kolkata, West Bengal, India
☎️ US/Canada: +1 647 217 1511
☎️ India: +91 9163916803
✉️ [email protected]
🔗 LinkedIn
🌐 subhadip.ca
👨🏫 Codementor Profile
Professional Summary
With over 15 years of experience across Data Engineering, Cloud, and Machine Learning, I specialize in building scalable, high-performance data platforms and analytics solutions for enterprise clients across Canada, the US, and India.
- 15+ years in Data Warehouse, ETL, and Analytics projects
- 8+ years in Python programming
- 5+ years in Google Cloud Platform (GCP) & dbt
- 3+ years in Machine Learning (Python, RapidMiner, IBM Watson Studio)
- 7+ years with ETL tools like Informatica and DataStage
- 15 years working with SQL and databases (Teradata, Oracle, Netezza, MSSQL, Snowflake)
- 15 years experience in Unix / Linux Shell Scripting
- 9+ years in Data Modeling (PowerDesigner and similar tools)
- Web technologies: HTML, CSS, JS, PHP, FastAPI, VB.NET
- Strong foundation in Cloud-native data solutions, API integration, and automation
Work Experience
- Individual Consultant — Apr 2023 – Present
Advising clients on GCP, Data Pipelines, ML, and Cloud Modernization - Wipro Canada — Technical Lead (Data Engineer) | Oct 2021 – Apr 2023
Best Buy US (via TechM & Wipro) – Built large-scale data pipelines and Snowflake Clean Room integrations - Cubert Inc. — Data Engineer | Nov 2020 – Sep 2021
Developed automated data ingestion workflows and dbt models - CGI / Bell Canada — Machine Learning Engineer | Apr 2018 – Sep 2020
ML pipeline for telecom fraud detection, feature engineering and model serving - IBM Canada — ETL Data Specialist | Feb 2016 – Mar 2018
- IBM India — Application Developer | May 2013 – Feb 2016
- Cognizant Technology Solutions — Programmer Analyst | Jul 2010 – Apr 2013
Technical Skills
- Data Engineering: Airflow, dbt, GCP Dataflow, BigQuery, Composer
- Cloud Platforms: Google Cloud Platform, Snowflake, IBM Bluemix
- ETL Tools: Informatica, DataStage
- Programming: Python (FastAPI, Pandas, NumPy), SQL, Shell Script
- Machine Learning: TensorFlow, scikit-learn, OpenCV, RapidMiner
- Databases: Teradata, Oracle, Netezza, PostgreSQL, MySQL, Snowflake
- Web Development: HTML, CSS, JS, PHP, FastAPI integration
- Version Control: Git, Bitbucket
Education
Bachelor of Technology in Computer Science and Engineering
Jalpaiguri Government Engineering College (2006–2010)
Highlighted Machine Learning Projects
- Mobility Subscription Fraud Detection — Telecom activation data, binary classification using Python, scikit-learn, and GCP (Bell Canada)
- Face Recognition System — Python, OpenCV, Deep Learning (CNN/DNN)
- Social Engineering Toolkit (SET) — Image-based search program over social media for identity verification
- Object Detection in Video — Video stream detection using deep learning and OpenCV
- OCR Decoder — Receipt and handwriting recognition using Tesseract and Python
- Facemask Detection — Real-time mask detection using deep learning and OpenCV
Let’s Build Something Exceptional Together
Open for Consulting • Mentorship • Speaking Engagements • Project Collaborations
