Skip to main content
HomeRoadmapsData Scientist
Production AI & Analytics Path

Data Scientist Roadmap

A complete project-based data science journey across math, statistics, Python, analytics, machine learning, deep learning, MLOps, big data, generative AI, and research.

32

Roadmap nodes

82-112 weeks

Total duration

145+

Data skills

16

Portfolio projects

Data Science Architecture

Project-Based Learning Track

Every phase ends with practical projects that combine datasets, notebooks, dashboards, models, experiments, deployment, and stakeholder communication.

Beginner
  • Data Cleaning Project
  • Sales Analysis Dashboard
  • COVID Data Analysis
  • Student Performance Analysis
Intermediate
  • House Price Prediction
  • Customer Churn Prediction
  • Recommendation System
  • Sentiment Analysis
Advanced
  • Image Classification System
  • Object Detection Model
  • Chatbot using NLP
  • AI-Powered Analytics Platform
Expert
  • Production ML Platform
  • RAG-Based AI Assistant
  • Recommendation Engine at Scale
  • End-to-End MLOps Pipeline

Vertical Data Science Journey

Expand each node for prerequisites, learning outcomes, resources, practice exercises, mini projects, quizzes, bookmarks, and notes.

32/32
01

Core Topics

What is Data ScienceData Science LifecycleTypes of DataStructured vs Unstructured DataData-Driven Decision Making

Skills Gained

  • Apply What is Data Science in notebook and production workflows
  • Use Data Science Lifecycle to make data-driven decisions
  • Communicate uncertainty, tradeoffs, and model behavior
  • Build portfolio-ready analytical proof of skill

Learning Outcomes

  • Explain how Data Science Fundamentals fits into the data science lifecycle
  • Complete a dataset challenge using What is Data Science
  • Evaluate risks, assumptions, and next experiments

Practice Exercises

  • Run an interactive notebook using the Iris dataset.
  • Write an analysis note comparing What is Data Science and Data Science Lifecycle.
  • Create a reproducible experiment or dashboard for this topic.
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32

Data Science Resource Matrix

Data Science Career Paths

Choose a specialization across analytics, machine learning, AI engineering, MLOps, research, and data leadership.

Data Analyst

Turn messy datasets into clear dashboards, insights, SQL reports, and stakeholder-ready narratives.

EDA portfolioBI dashboardSQL case study

Data Scientist

Own analysis, modeling, evaluation, experimentation, and business impact from idea to recommendation.

Prediction modelExperiment designImpact memo

Machine Learning Engineer

Productionize features, training pipelines, model serving, monitoring, and scalable inference.

Model APIFeature pipelineMonitoring dashboard

AI Engineer

Build LLM, RAG, agentic, and generative AI systems with evaluation and guardrails.

RAG assistantPrompt evalsAgent workflow

Research Scientist

Read papers, design reproducible experiments, test novel methods, and communicate evidence.

Paper replicationAblation studyResearch report

Analytics Engineer

Bridge data engineering and analytics with dbt models, metrics layers, and trusted datasets.

dbt projectData martMetrics layer

Data Science Certifications Tracker

Use certifications as optional validation for analytics, ML, cloud AI, and data engineering competency.

Google Data Analytics

Analytics lifecycle, cleaning, visualization, SQL, dashboards, and stakeholder communication.

Case studySQL portfolioDashboard project

IBM Data Science

Python, notebooks, statistics, machine learning, and applied capstone projects.

Notebook workflowML capstoneModel report

Microsoft Data Scientist

Azure ML, data preparation, model training, deployment, and responsible AI workflows.

Azure ML workspaceExperiment trackingDeployment lab

AWS Machine Learning Engineer

AWS ML services, feature workflows, training jobs, model deployment, and monitoring.

SageMaker labModel endpointMonitoring setup

TensorFlow Developer Certificate

TensorFlow modeling, computer vision, NLP, time series, and deployment basics.

TF modelsCV/NLP practiceExam prep notebooks

Learning Features

Portfolio Milestones

Track notebooks, dashboards, ML systems, GenAI demos, and production data products.

Achievement Badges

Unlock visible badges for statistics, Python, EDA, ML, deep learning, GenAI, MLOps, and research mastery.

Research Notes

Use notes and quizzes to capture assumptions, experiment results, and model tradeoffs.