avatarDavid Venturi

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

5200

Abstract

li></ul><p id="0d38">Listed below are the individual courses contained within the Nanodegree. The estimated timeline for graduation is 378 hours.</p><ul><li><a href="https://www.udacity.com/course/intro-to-inferential-statistics--ud201">Intro to Inferential Statistics</a></li><li><a href="https://www.udacity.com/course/intro-to-descriptive-statistics--ud827">Intro to Descriptive Statistics</a></li><li><a href="https://www.udacity.com/course/intro-to-data-analysis--ud170">Intro to Data Analysis (Using NumPy and Pandas)</a></li><li><a href="https://www.udacity.com/course/data-wrangling-with-mongodb--ud032">Data Wrangling</a></li><li><a href="https://www.udacity.com/course/sql-for-data-analysis--ud198">SQL for Data Analysis</a></li><li><a href="https://www.udacity.com/course/data-wrangling-with-mongodb--ud032">MongoDB for Data Analysis</a></li><li><a href="https://www.udacity.com/course/data-analysis-with-r--ud651">Data Analysis with R</a></li><li><a href="https://www.udacity.com/course/intro-to-machine-learning--ud120">Intro to Machine Learning</a></li><li><a href="https://www.udacity.com/course/data-visualization-and-d3js--ud507">Data Visualization and D3.js</a></li><li><a href="https://www.udacity.com/course/ab-testing--ud257">A/B Testing</a></li></ul><figure id="ff41"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*t1CaHkoeuB0SFQWJVPFHpw.png"><figcaption></figcaption></figure><figure id="575d"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*0jo6IM7v1TZuUwZKmvGMNQ.png"><figcaption></figcaption></figure><figure id="f4b5"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*OOedgXYjvoXrAP2mXzZ0lA.png"><figcaption>Three courses from the <a href="https://www.udacity.com/course/data-analyst-nanodegree--nd002">Udacity Data Analyst Nanodegree</a></figcaption></figure><h2 id="ae3c">Why the Udacity Data Analyst Nanodegree?</h2><p id="1e92">First and foremost, it received <a href="https://www.class-central.com/certificate/data-analyst-nanodegree--nd002">stellar reviews</a>. Second, I wanted a consistent learning experience for my introduction to the field. The Data Analyst Nanodegree offered <b>a combination of breadth, depth, and cohesiveness</b> that a combination of content from various providers would be hard pressed to provide. I am also a fan of their “less passive listening (no long lectures) and more active doing” approach to education.</p><figure id="7c5d"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*_RSk3RlOdHErS3JrT9LSvw.png"><figcaption>What is a <a href="https://www.udacity.com/nanodegree">Udacity Nanodegree</a>?</figcaption></figure><h1 id="8012">Machine Learning</h1><h2 id="afd8">Learning from data</h2><ul><li><a href="https://www.class-central.com/mooc/835/coursera-machine-learning">Machine Learning</a> (Stanford University/Coursera)</li><li><a href="https://www.class-central.com/mooc/6679/kadenze-creative-applications-of-deep-learning-with-tensorflow">Creative Applications of Deep Learning with TensorFlow</a> (Kadenze) <b>(IN PROGRESS)</b></li><li><a href="https://www.class-central.com/mooc/2965/edx-cs190-1x-scalable-machine-learning">Distributed Machine Learning with Apache Spark</a> (University of California, Berkeley/edX)</li></ul><figure id="b0ec"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*SPzfIf6dWPyiDFlI7sWqBg.png"><figcaption></figcaption></figure><figure id="614a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*G5cn4DEe3mI_uhVBqg0BYQ.jpeg"><figcaption></figcaption></figure><figure id="023a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*s8esBEwDZ67cV0v5S7V28g.png"><figcaption>Stanford University, <a href="https://en.wikipedia.org/wiki/TensorFlow">TensorFlow</a> (Google’s open source software library for machine learning), and The University of California, Berkeley</figcaption></figure><h1 id="ac35">Software Engineering</h1><h2 id="0bae">Best practices</h2><ul><li><a href="https://www.udacity.com/course/software-testing--cs258">Software Testing</a> (Udacity)</li><li><a href="https://www.udacity.com/course/software-debugging--cs259">Software Debugging</a> (Udacity)</li><li><a href="https://www.udacity.com/course/how-to-use-git-and-github--ud775">How to Use Git & GitHub: Version Control for Code</a> (Udacity)</li><li><a href="https://www.coursera.org/specializations/r">Mastering Software Development in R Specialization</a> (Johns Hopkins University/Coursera) <b>(IN PROGRESS)</b></li></ul><p id="cd1e">Listed below are the individual courses contained within Johns Hopkins University’s “Mastering Software Development in R Specialization” on Coursera:</p><ul><li><a href="https://www.coursera.org/learn/r-programming-environment">The R Programming Environment</a></li><li><a href="https://www.coursera.org/learn/advanced-r">Advanced R Programming</a></li><li><a href="https://www.coursera.org/learn/r-packages">Building R Packages</a></li><li><a href="https://www.coursera.org/learn/r-data-visualization">Building Data Visualization Tools</a></li></ul><figure id="b494"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*E2KwUM0g78J

Options

kwV_wTvAPEQ.png"><figcaption></figcaption></figure><figure id="da13"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*4Pd-AJen5dV8sUt1vkIOIA.png"><figcaption>Johns Hopkins University’s “<a href="https://www.coursera.org/specializations/r">Mastering Software Development in R Specialization</a>” on Coursera</figcaption></figure><h2 id="ea48">Why software engineering?</h2><p id="be80">The role of software engineering in data science is covered in great detail <a href="https://www.experfy.com/blog/how-to-become-a-data-scientist-part-1-3">here</a> by Alec Smith (a data science recruiter) and <a href="http://simplystatistics.org/2016/05/18/software-engineering-data-science/">here</a> by Roger Peng (Johns Hopkins University professor and “Mastering Software Development in R Specialization” creator). A quote from the former:</p><blockquote id="45e8"><p>A lot of data science work is software engineering. Not always in the sense of designing robust systems, but simply writing software. A lot of tasks you can automate and if you want to run experiments, you have to write code, and if you can do it fast, it makes a huge difference.</p></blockquote><p id="f165">And from the <a href="https://www.coursera.org/specializations/r">Mastering Software Development in R Specialization</a> page:</p><blockquote id="c1f4"><p>As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers.</p></blockquote><h1 id="0334">Back End Development</h1><h2 id="9a6a">Storing and manipulating data</h2><ul><li><a href="https://www.udacity.com/courses/intro-to-backend--ud171">Intro to Backend</a> (Udacity)</li><li><a href="https://www.udacity.com/courses/developing-scalable-apps-in-python--ud858">Developing Scalable Apps in Python</a> (Google/Udacity)</li><li><a href="https://www.udacity.com/courses/configuring-linux-web-servers--ud299">Configuring Linux Web Servers</a> (Udacity)</li><li><a href="https://www.udacity.com/courses/linux-command-line-basics--ud595">Linux Command Line Basics</a> (Udacity)</li><li><a href="https://www.class-central.com/mooc/1580/stanford-openedx-db-introduction-to-databases">Introduction to Databases</a> (Stanford University)</li></ul><h2 id="31d6">Why back end development?</h2><p id="d11f">This <a href="https://www.quora.com/Do-you-use-both-backend-development-and-data-science-in-your-career-If-so-what-is-your-career-and-how-do-you-use-these-skills-together">Quora page</a> and this <a href="http://blog.udacity.com/2014/12/front-end-vs-back-end-vs-full-stack-web-developers.html">Udacity article</a> suggest that back end development and data science can be a useful combination. These Udacity courses, which are the back end courses in their <a href="https://www.udacity.com/course/full-stack-web-developer-nanodegree--nd004">Full Stack Web Developer Nanodegree</a>, along with Stanford’s top-ranked databases course, add an aspect of data engineering to the curriculum.</p><h1 id="cd25">Additional Resources</h1><h2 id="4cc5">Filling in the gaps. Suggestions welcome!</h2><ul><li><a href="https://www.udacity.com/course/intro-to-hadoop-and-mapreduce--ud617">Intro to Hadoop and MapReduce</a> (Cloudera/Udacity)</li><li><a href="https://www.class-central.com/mooc/4343/coursera-using-python-to-access-web-data">Using Python to Access Web Data</a> (University of Michigan/Coursera)</li><li><a href="https://www.coursera.org/learn/build-data-science-team">Building a Data Science Team</a> (Johns Hopkins University/Coursera)</li></ul><p id="650a"><b>This section is fluid.</b> Additional resources will be added as I progress through the curriculum.</p><h1 id="ebdd">That’s it!</h1><p id="248b">Many thanks to <a href="undefined">Dhawal Shah</a> of <a href="https://www.class-central.com/">Class Central</a>, as the ratings and reviews from his online course search engine (plus a few insider tips) helped guide the above curriculum choices.</p><p id="8971">If you have any recommendations for the curriculum, the above subject material in general, or would like to chat about your own educational goals, please don’t hesitate to <a href="https://davidanalyst.com/">contact me</a>.</p><p id="b4e1"><i>Originally published at <a href="http://davidventuri.com/blog/my-data-science-masters">davidventuri.com</a>.</i></p><div id="c6a3" class="link-block"> <a href="https://twitter.com/venturidb"> <div> <div> <h2>David Venturi (@venturidb) | Twitter</h2> <div><h3>The latest Tweets from David Venturi (@venturidb). Creating my own data science master’s degree. @queensu chem eng/econ…</h3></div> <div><p>twitter.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*7Anb0UMOlf3JkaUE.)"></div> </div> </div> </a> </div></article></body>

By DAVID VENTURI

I Dropped Out of School to Create My Own Data Science Master’s — Here’s My Curriculum

With the recent advances in affordable, reputable online education, going back to college/university seems irresponsible

Note: I wrote this article in 2016 so the curriculum is now outdated. Find my latest course recommendations for the aspiring data pro here (updated for 2024).

I dropped out of a top computer science program to teach myself data science using online resources like Udacity, edX, and Coursera. The decision was not difficult. I could learn the content I wanted to faster, more efficiently, and for a fraction of the cost. I already had a university degree and, perhaps more importantly, I already had the university experience. Paying $30K+ to go back to school seemed irresponsible.

Here are my curriculum choices and the rationale behind them. Using thousands of course ratings and reviews from Class Central, I selected the best computer science, data science, and machine learning courses from world-class institutions like Harvard, Stanford, MIT, Berkeley, Google, and Facebook. You can read my detailed reviews for most of these courses here on Medium or on my personal website — davidventuri.com.

My curriculum covers both Python and R, which are the two most popular programming languages for data science.

Note: In May 2017, I paused my progress in this program because I joined Udacity as a Content Developer. Another benefit of personalized online education — flexibility!

Bridging Module

A solid computer science foundation

Bridging Module

Why a bridging module?

I wanted a solid computer science foundation before I started learning data science. My engineering background gave me a head start on the math and stats. Completing these three courses means I will have completed a standard first-year computer science curriculum, plus the full mathematical and statistical core.

The following courses from my undergrad chemical engineering program are also core computer science courses:

  • ✔ Linear Algebra
  • ✔ Calculus
  • ✔ Multivariable Calculus
  • ✔ Statistics I
  • ✔ Statistics II

Data Science Core

The fundamentals

Listed below are the individual courses contained within the Nanodegree. The estimated timeline for graduation is 378 hours.

Three courses from the Udacity Data Analyst Nanodegree

Why the Udacity Data Analyst Nanodegree?

First and foremost, it received stellar reviews. Second, I wanted a consistent learning experience for my introduction to the field. The Data Analyst Nanodegree offered a combination of breadth, depth, and cohesiveness that a combination of content from various providers would be hard pressed to provide. I am also a fan of their “less passive listening (no long lectures) and more active doing” approach to education.

What is a Udacity Nanodegree?

Machine Learning

Learning from data

Stanford University, TensorFlow (Google’s open source software library for machine learning), and The University of California, Berkeley

Software Engineering

Best practices

Listed below are the individual courses contained within Johns Hopkins University’s “Mastering Software Development in R Specialization” on Coursera:

Johns Hopkins University’s “Mastering Software Development in R Specialization” on Coursera

Why software engineering?

The role of software engineering in data science is covered in great detail here by Alec Smith (a data science recruiter) and here by Roger Peng (Johns Hopkins University professor and “Mastering Software Development in R Specialization” creator). A quote from the former:

A lot of data science work is software engineering. Not always in the sense of designing robust systems, but simply writing software. A lot of tasks you can automate and if you want to run experiments, you have to write code, and if you can do it fast, it makes a huge difference.

And from the Mastering Software Development in R Specialization page:

As the field of data science evolves, it has become clear that software development skills are essential for producing useful data science results and products. You will learn modern software development practices to build tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers.

Back End Development

Storing and manipulating data

Why back end development?

This Quora page and this Udacity article suggest that back end development and data science can be a useful combination. These Udacity courses, which are the back end courses in their Full Stack Web Developer Nanodegree, along with Stanford’s top-ranked databases course, add an aspect of data engineering to the curriculum.

Additional Resources

Filling in the gaps. Suggestions welcome!

This section is fluid. Additional resources will be added as I progress through the curriculum.

That’s it!

Many thanks to Dhawal Shah of Class Central, as the ratings and reviews from his online course search engine (plus a few insider tips) helped guide the above curriculum choices.

If you have any recommendations for the curriculum, the above subject material in general, or would like to chat about your own educational goals, please don’t hesitate to contact me.

Originally published at davidventuri.com.

Programming
Data Science
Education
Machine Learning
Learning
Recommended from ReadMedium