avatarNaina Chaturvedi

Summary

The website content outlines a comprehensive educational series on Machine Learning Operations (MLOps), Data Analytics, System Design, and foundational Data Structures and Algorithms, complete with projects, examples, and in-depth discussions, aimed at equipping readers with practical skills in the tech industry.

Abstract

The provided web content introduces a multi-faceted learning journey encompassing 30 days of MLOps, 30 days of Data Analytics, a Complete Data Structures and Algorithms series, and a Complete System Design series, all accompanied by practical projects and examples. The series is designed to cover essential topics such as MLOps principles, data preprocessing, SQL basics, modeling, developing, testing, production, and research papers in MLOps. It also includes system design case studies and a hands-on approach to learning Python and Tableau for data analysis. The content emphasizes the importance of clear objectives, feasibility checks, and the application of theoretical knowledge to real-world projects, with the ultimate goal of enabling readers to become proficient in tech-focused roles, from data science to ML engineering.

Opinions

  • The author believes in the practical application of knowledge, emphasizing the importance of projects and examples in learning.
  • There is a strong focus on the intersection of Data Engineering, Machine Learning, and DevOps, suggesting a holistic approach to MLOps.
  • The content suggests that MLOps is experimental and crucial for streamlining and deploying ML processes effectively.
  • The author advocates for a depth of understanding in key areas of MLOps rather than a superficial coverage of all topics.
  • There is an opinion that learning should be structured and progressive, starting with foundational concepts and moving towards more complex topics.
  • The inclusion of system design case studies indicates the author's view that system design is a critical skill for professionals in the tech industry.
  • The author values the use of Git and Docker for versioning and containerization

Day 1 of 30 days of Machine Learning Ops

With examples and projects…

Pic credits :gridai

Welcome back peeps! Hope all is going well. So, after receiving a great response ( and some really good feedback and inputs) for 60 days of Data Science and ML with projects series, I’m excited to share that I’m starting a new Series — 30 days of Machine Learning Ops with (amazing) projects. PS: I’ll be writing as and when I’m free out of my busy work schedule.

Advanced SQL Series

Day 1 : SQL Basics and Kick start of Advanced SQL Series

Day 2 : SQL Basics, Query Structure, Built In functions Conditions

Day 3 : Most Important Commands, Joins and Filters

Day 4 : Set Theory Operations, Stored Procedures and CASE statements in SQL

Day 5 : Wildcards, Aggregation and Sequences in SQL

Day 6 : Subqueries, Group by, order by and Having clauses in SQL and Analytical Functions

Day 7 : Window Functions, Grouping Sets and Constraints in SQL

Day 8 : BigQuery Basics, SELECT, FROM, WHERE and Date and Extract in BigQuery

Day 9 : Common Expression Table, UNNEST Clause, SQL vs NoSQL Databases

Day 10 : Triggers, Pivot and Cursors in SQL

Day 11 : Views, Indexes and Auto Increment in SQL

Day 12 : Query optimizations, Performance tuning in SQL

Day 13 : Introduction to MySQL, PostgreSQL and Mongo DB, Comparison between MySQL and PostgreSQL and Mongo DB, Introduction to SQL and NoSQL Databases

Day 14 : MySQL in Depth

Day 15 : PostgreSQL inDepth

Complete Data Structures and Algorithm Series

Complexity Analysis

Backtracking

Sliding Window

Greedy Technique

Two pointer Technique

Arrays

Linked List

Strings

Stack

Queues

1- D Dynamic Programming

Divide and Conquer Technique

Recursion

Github —

Solved System Design Case Studies

Design Instagram

Design Netflix

Design Reddit

Design Amazon

Design Messenger App

Design Twitter

Design URL Shortener

Design Dropbox

Design Youtube

Design API Rate Limiter

Design Web Crawler

Design Amazon Prime Video

Design Facebook’s Newsfeed

Design Yelp

Design Uber

Design Tinder

Design Tiktok

Design Whatsapp

Most Popular System Design Questions

Mega Compilation : Solved System Design Case studies

Day 2 of 30 days of MLOps —

Let me get to the point — the prerequisite of this series is 60 days of Data Science and ML Series ( complete it before you jump the ship). You must have a basic knowledge of the Data Science and ML and terms that I’ll be using in this MLOps series —

The main aim of 30 days of Machine Learning Ops with (amazing) projects series to understand MLOps from a practical perspective and get hands on practice by implementing projects (without falling in the rabbit hole of too much theory)

Projects Videos —

All the projects, data structures, SQL, algorithms, system design, Data Science and ML , Data Analytics, Data Engineering, , Implemented Data Science and ML projects, Implemented Data Engineering Projects, Implemented Deep Learning Projects, Implemented Machine Learning Ops Projects, Implemented Time Series Analysis and Forecasting Projects, Implemented Applied Machine Learning Projects, Implemented Tensorflow and Keras Projects, Implemented PyTorch Projects, Implemented Scikit Learn Projects, Implemented Big Data Projects, Implemented Cloud Machine Learning Projects, Implemented Neural Networks Projects, Implemented OpenCV Projects,Complete ML Research Papers Summarized, Implemented Data Analytics projects, Implemented Data Visualization Projects, Implemented Data Mining Projects, Implemented Natural Leaning Processing Projects, MLOps and Deep Learning, Applied Machine Learning with Projects Series, PyTorch with Projects Series, Tensorflow and Keras with Projects Series, Scikit Learn Series with Projects, Time Series Analysis and Forecasting with Projects Series, ML System Design Case Studies Series videos will be published on our youtube channel ( just launched).

Subscribe today!

Tech Newsletter —

If you are interested, you can join my newsletter through which I send tech interview tips, techniques, patterns, hacks — Software Development, ML, Data Science, Startups and Technology projects to more than 30K readers. You can subscribe to Tech Brew :

Let’s get started!

I’l be covering only the most important topics in MLOps ( written below)—

1.MLOps Basics and Principles

What is MLOps?

Purpose

What’s important?

2. Data

Complete Python with projects

Pandas and Numpy

Exploratory Data Analysis

Data preprocessing ( Collecting, Labeling and Validating data)

Data Labelling and Advanced Data Labeling Methods

Data Splitting

Feature Engineering

Data Augmentation

SQL Basics

Structured Query Language

Query Structure

Conditions

Joins

Stored Procedures

Aggregations

Wild cards

Grouping Data

Aggregation Functions

Filtering

Sequences

Group By, Order By

Having Clause

Write Sub queries

Grouping Sets

Analytical Functions

Window Functions

Row Numbering

Percentile

Advanced windowing techniques

BigQuery

BigQuery Basics

SELECT, FROM, WHERE and Date and Extract in BigQuery

Common Expression Table

UNNEST Clause

SQL vs NoSQL Database

Advanced Functions

Triggers

Pivot

Cursors

Views

Indexes

Auto Increment

Performance Tuning SQL Queries

Query Optimizations in. SQL

Performance Tuning in SQL

MySQL, PostgreSQL and MongoDB

Introduction to MySQL

Introduction to PostgreSQL

Introduction to Mongo DB

Comparison between MySQL and PostgreSQL and Mongo DB

Introduction to SQL and NoSQL Databases

MySQL in Depth

PostgreSQL inDepth

3.Modeling

Model Training and Evaluation

Model Baselines

Model Tuning and Optimization

Model Review and governance

Automated Model retraining

Model Deployment and monitoring

Model Inference and Serving

Model Resource Management Techniques

Model Analysis

High-Performance Modeling

4.Developing

End — to — End ML Workflow Cycle

ML workflows

MLOps Logging and Documentation

MLOps Makefile

ML Lake

ML Pipelines and toolkits

MLOps tools and Frameworks

5. Testing and Reproducibility

Git

Versioning

Docker

6. Production

Continuous Integration

Continuous Delivery and Deployment

Monitoring and Logging

Feature Stores

MLOps architecture and Infrastructure Stack

Model Serving Patterns and Infrastructures

Model fairness, Explainability issues, and Mitigate bottlenecks

7. MLOps (Amazing) Papers

Some amazing MLOps research papers that I have read over the years to help you boot up to the industry standards and what’s next in this field.

So, Let’s dive in!

What is MLOps?

To put it in simple words, MLOps is the heart of ML engineering and processes. It lies at the intersection of Data Engineering, Machine Learning and DevOps.

MLOps is the process which is used to efficiently write, deploy, run the ML applications, establish collaboration and communication between data scientists and ML engineers team.

Pic credits : nassco

Why MLOps ?

  1. To automate the deployment of ML and DL models in large-scale production environments
  2. Robust testing of ML artifacts
  3. Enables agile principles to be used in ML projects
  4. Helps in Scaling and deploying ML models
  5. To improve the quality of production ML
  6. MLOps is more experimental

and many more things that you will learn as we proceed further in this series.

Stages of MLOps —

Pic credits :gridai

Data Collection and Labeling

Data analysis

Data pre-processing and transformation

Model training and validation

Model serving

Model monitoring and Logging

Model Retraining and experimentation

Data Feedback Looping

Purpose

To put it straight, the purpose here is to explore in depth how MLOps can help us streamline, automate and deploy the crucial stages of ML processes which is covered in the ML canvas below.

Pic credits : indatalabs

Following above template can help us answer transparently what and why, how and when of ML processes and product.

What’s Important?

While MLOps is a wide topic and honestly to be able to cover all of it isn’t the goal of this series, instead the goal is to cover the most important topics and build a depth which can boot you at your job as a data scientist/ML engineer. So, to summarize 4 things are important —

Clear ( and practical) Objective — Laying down a clear objective is important — i.e the key objectives that we want to focus on.

Checking the feasibility of proposed solution — core functionalities, constraints, integration, production deployment etc

Data — Collecting and validation of data, labeling the data well and preprocessing it

Evaluation and deployment — Laying down evaluation metrics and deployment processes.

That’s it for now!

Day 2 —

System Design Case Studies — In Depth

Design Instagram

Design Messenger App

Design Twitter

Design URL Shortener

Design Dropbox

Design Youtube

Design API Rate Limiter

Design Web Crawler

Design Facebook’s Newsfeed

Most Popular System Design Questions

Mega Compilation : Solved System Design Case studies

Complete Data Structures and Algorithm Series

Complexity Analysis

Sliding Window

Backtracking

Greedy Technique

Two pointer Technique

1- D Dynamic Programming

Divide and Conquer Technique

Recursion

Github —

Complete System Design Series Parts —

1. System design basics

2. Horizontal and vertical scaling

3. Load balancing and Message queues

4. High level design and low level design, Consistent Hashing, Monolithic and Microservices architecture

5. Caching, Indexing, Proxies

6. Networking, How Browsers work, Content Network Delivery ( CDN)

7. Database Sharding, CAP Theorem, Database schema Design

8. Concurrency, API, Components + OOP + Abstraction

9. Estimation and Planning, Performance

10. Map Reduce, Patterns and Microservices

11. SQL vs NoSQL and Cloud

12. Most Popular System Design Questions

Github —

Keep learning and coding :)

30 days of Data Analytics Series —

Day 1 : Data Analytics basics and kickstart of Data analytics with projects series

Day 2: Business Understanding — Data Driven Decision Making, Descriptive Analysis, Predictive Analysis, Diagnostic Analysis, Prescriptive Analysis

Day 3 : Data Analytics Ecosystem — Data Life Cycle, Data Analysis complete process ( most important things)

Day 4 : Probability, Conditional Probability, Binomial Distribution, Probability Density Function, Sampling Distribution

Day 5 : Statistics

Day 6 : Basic and Advanced SQL

Day 7 : Data Collection, Data Cleaning and Python

Day 8 : Pandas and Numpy

Day 9 : Data Manipulation

Day 10 : Data Visualization — Part 1

Day 11 : Project 1 : Data Visualization — Part 2

Day 12 : Data Visualization — Part 3

Day 13: Tableau — Part 1

Day 14: Tableau — Part 2

Day 15: Tableau — Part 3

Tableau Project

Day 16 : Data Analysis Project 2

Day 17 : Data Analysis Project 3

Day 18: Data Analysis Project 4

Day 19: Data Analysis Project 5

Day 20 : Data Analysis Project 6

Categorical and Numerical Features

Missing Value Analysis

Fill the missing Values

Unique Value Analysis

Univariate Analysis

Bivariate Analysis

Multivariate Analysis

Correlation Analysis

Day 21 : Data Analysis Project 7

Data Profiling

Feature Engineering

GroupBy Features

Categorical and Numerical Features

Missing Value Analysis

Fill the missing Values

Unique Value Analysis

Univariate Analysis

Bivariate Analysis

Multivariate Analysis

Correlation Analysis

Day 22 : Data analysis Project 8

Linear Regression

Data Profiling

Feature Engineering

Sort Values

Categorical and Numerical Features

Missing Value Analysis

Unique Value Analysis

Univariate Analysis

Bivariate Analysis

Multivariate Analysis

Correlation Analysis

Correlation Coefficients

Take Complete Hands On Tableau Course : Link

For Python Projects —

For complete 60 days of Data Science and ML : Day 1 — Day 60 : Quick Recap of 60 days of Data Science and ML

Follow for more updates. Stay tuned and keep coding!

For other projects, tune to —

Build Machine Learning Pipelines( With Code)

Recurrent Neural Network with Keras

Clustering Geolocation Data in Python using DBSCAN and K-Means

Facial Expression Recognition using Keras

Hyperparameter Tuning with Keras Tuner

Custom Layers in Keras

Machine Learning
Data Science
Tech
Programming
Artificial Intelligence
Recommended from ReadMedium