avatarSahiti Kappagantula

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

4649

Abstract

p between the target (dependent variable) and predictor variable (independent variable) is termed as linear regression. It allows us to study and summarize a relation between two variables.</p><ul><li>X = Independent variable</li><li>Y = Dependent variable</li><li>c = y-intercept</li><li>m = Slop of line</li></ul><p id="7012">If we consider the equation we have values for X which is an independent variable, so all we have to do is calculate values for m and c in order to predict the value of Y.</p><p id="628e">So how do we find these variables?</p><p id="4326">In order to find these variables, we can try a bunch of values and try to find out a line which intersects the maximum number of data points. But, how can we find the best fit line?</p><p id="2587">So in order to find the best-fit line, we can use least squares error function that will find the error between the real value of y and the predicted value y`.</p><p id="fe67">The least-squares error function can be represented using the following equation:</p><figure id="8d17"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*y_Ro4ifHTHmEXs9h0jKjfQ.png"><figcaption></figcaption></figure><p id="f1b9">Using this function we can find out the error for each predicted data point by comparing it with the actual value of the data point. You then take the summation of all these errors and square them to find out the deviation in the prediction.</p><p id="11b6">If we add the third axis to our graph containing all possible error values and plot it in 3-Dimensional space, it will look like this:</p><figure id="2198"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*O8VRFHnKgpHVssq2wnoDOw.png"><figcaption></figcaption></figure><p id="da07">In the above image, the ideal values would be in the bottom black part which will predict the prices close to the actual data point. The next step is to find the best possible values for m and c. This can be done by using the optimization technique called gradient descent.</p><p id="310d">Gradient descent is an iterative method, where we start with initializing some set of values for our variables and improve them slowly by minimizing the error between the actual value and the predicted value.</p><p id="7b09">Now if we think practically the prices of the apartment don’t really depend only on price per square feet, there are many factors such as the number of bedrooms, bathrooms, etc. If we consider those features as well then the equation will look something like this</p><p id="c855"><b><i>Y = b0 + b1x1 + b2x2 + …..+bnxn + c</i></b></p><p id="b27e">This is multilinear regression this belongs to linear algebra, here we can use matrices of size mxn where m are features and n are data points.</p><p id="2c65">Let’s consider another situation where we can use probability to find the condition of the house in order to classify a house based on whether it is in good condition or bad condition. For this, to work we will have to use a technique called Logistic Regression which works on the probability of occurrences represented by a sigmoid function.</p><p id="28c2">In this article, we covered the prerequisites of machine learning and how they are applied in machine learning. So basically, it consists of statistics, calculus, linear algebra, and probability theory. Calculus has techniques used for optimization, linear algebra has algorithms which can work on huge data sets, with probability we can predict the likelihood of occurrences and statistics helps us infer useful insights from the sample of data sets.</p><p id="3cc9">If you wish to check out more articles on the market’s most trending technologies like Python, DevOps, Ethical Hacking, then you can refer to <a href="https://www.edureka.co/blog/?utm_source=medium&amp;utm_medium=content-link&amp;utm_campaign=decision-trees">Edureka’s official site.</a></p><p id="5c9b">Do look out for other articles in this series which will explain the various other aspects of Data Science.</p><blockquote id="20a7"><p><i>1.<a href="https://readmedium.com/data-science-tutorial-484da1ff952b">Data Science Tutorial</a></i></p></blockquote><blockquote id="d7f0"><p><i>2.<a href="https://readmedium.com/math-and-statistics-for-data-science-1152e30cee73">Math And Statistics For Data Science</a></i></p></blockquote><blockquote id="fc96"><p><i>3.<a href="https://readmedium.com/linear-regression-in-r-da3e42f16dd3">Linear Regression in R</a></i></p></blockquote><blockquote id="3a35"><p><i>4.<a href="https://readmedium.com/machine-learning-algorithms-29eea8b69a54">Machine Learning Algorithms</a></i></p></blockquote><blockquote id="e637"><p><

Options

i>5.<a href="https://readmedium.com/logistic-regression-in-r-2d08ac51cd4f">Logistic Regression In R</a></i></p></blockquote><blockquote id="13a3"><p><i>6.<a href="https://readmedium.com/classification-algorithms-ba27044f28f1">Classification Algorithms</a></i></p></blockquote><blockquote id="c80b"><p><i>7.<a href="https://readmedium.com/random-forest-classifier-92123fd2b5f9">Random Forest In R</a></i></p></blockquote><blockquote id="22fa"><p><i>8.<a href="https://readmedium.com/a-complete-guide-on-decision-tree-algorithm-3245e269ece">Decision Tree in R</a></i></p></blockquote><blockquote id="761a"><p><i>9.<a href="https://readmedium.com/introduction-to-machine-learning-97973c43e776">Introduction To Machine Learning</a></i></p></blockquote><blockquote id="9aa4"><p><i>10.<a href="https://readmedium.com/naive-bayes-in-r-37ca73f3e85c">Naive Bayes in R</a></i></p></blockquote><blockquote id="17b9"><p><i>11.<a href="https://readmedium.com/statistics-and-probability-cf736d703703">Statistics and Probability</a></i></p></blockquote><blockquote id="0d11"><p><i>12.<a href="https://readmedium.com/decision-trees-b00348e0ac89">How To Create A Perfect Decision Tree?</a></i></p></blockquote><blockquote id="1658"><p><i>13.<a href="https://readmedium.com/data-scientists-myths-14acade1f6f7">Top 10 Myths Regarding Data Scientists Roles</a></i></p></blockquote><blockquote id="886b"><p><i>14.<a href="https://readmedium.com/data-science-projects-b32f1328eed8">Top Data Science Projects</a></i></p></blockquote><blockquote id="b32b"><p><i>15.<a href="https://readmedium.com/data-analyst-vs-data-engineer-vs-data-scientist-27aacdcaffa5">Data Analyst vs Data Engineer vs Data Scientist</a></i></p></blockquote><blockquote id="14f9"><p><i>16.<a href="https://readmedium.com/types-of-artificial-intelligence-4c40a35f784">Types Of Artificial Intelligence</a></i></p></blockquote><blockquote id="f094"><p><i>17.<a href="https://readmedium.com/r-vs-python-48eb86b7b40f">R vs Python</a></i></p></blockquote><blockquote id="9b00"><p><i>18.<a href="https://readmedium.com/ai-vs-machine-learning-vs-deep-learning-1725e8b30b2e">Artificial Intelligence vs Machine Learning vs Deep Learning</a></i></p></blockquote><blockquote id="9014"><p><i>19.<a href="https://readmedium.com/machine-learning-projects-cb0130d0606f">Machine Learning Projects</a></i></p></blockquote><blockquote id="2f89"><p><i>20.<a href="https://readmedium.com/data-analyst-interview-questions-867756f37e3d">Data Analyst Interview Questions And Answers</a></i></p></blockquote><blockquote id="5fa3"><p><i>21.<a href="https://readmedium.com/data-science-and-machine-learning-for-non-programmers-c9366f4ac3fb">Data Science And Machine Learning Tools For Non-Programmers</a></i></p></blockquote><blockquote id="5296"><p><i>22.<a href="https://readmedium.com/top-10-machine-learning-frameworks-72459e902ebb">Top 10 Machine Learning Frameworks</a></i></p></blockquote><blockquote id="d267"><p><i>23.<a href="https://readmedium.com/statistics-for-machine-learning-c8bc158bb3c8">Statistics for Machine Learning</a></i></p></blockquote><blockquote id="b6d5"><p><i>24.<a href="https://readmedium.com/random-forest-classifier-92123fd2b5f9">Random Forest In R</a></i></p></blockquote><blockquote id="f8c3"><p><i>25.<a href="https://readmedium.com/breadth-first-search-algorithm-17d2c72f0eaa">Breadth-First Search Algorithm</a></i></p></blockquote><blockquote id="9450"><p><i>26.<a href="https://readmedium.com/linear-discriminant-analysis-88fa8ad59d0f">Linear Discriminant Analysis in R</a></i></p></blockquote><blockquote id="30b2"><p><i>27.<a href="https://readmedium.com/supervised-learning-5a72987484d0">Supervised Learning</a></i></p></blockquote><blockquote id="2378"><p><i>28.<a href="https://readmedium.com/r-shiny-tutorial-47b050927bd2">Interactive WebApps using R Shiny</a></i></p></blockquote><blockquote id="e590"><p><i>29.<a href="https://readmedium.com/top-10-machine-learning-books-541f011d824e">Top 10 Books for Machine Learning</a></i></p></blockquote><blockquote id="bf93"><p><i>30.<a href="https://readmedium.com/unsupervised-learning-40a82b0bac64">Unsupervised Learning</a></i></p></blockquote><blockquote id="5acf"><p><i>31.<a href="https://readmedium.com/10-best-books-data-science-9161f8e82aca">10 Best Books for Data Science</a></i></p></blockquote><blockquote id="24bd"><p><i>32.<a href="https://readmedium.com/machine-learning-with-r-c7d3edf1f7b">Machine Learning using R</a></i></p></blockquote><p id="7662"><i>Originally published at <a href="https://www.edureka.co/blog/prerequisites-for-machine-learning/">https://www.edureka.co</a> on August 6, 2019.</i></p></article></body>

What are the Prerequisites for Machine Learning?

Prerequisites for Machine Learning — Edureka

Machine Learning is undoubtedly the most in-demand technology of the era! If you’re a beginner who’s getting started with Machine Learning, it’s important you know the prerequisites for Machine Learning. This blog will help you understand the different concepts that you need to know before you get started with Machine Learning.

Here’s a list of topics covered in this blog:

  1. Prerequisites for Machine Learning
  2. Understanding Machine Learning with a use case

Prerequisites For Machine Learning

To get started with Machine Learning you must be familiar with the following concepts:

  1. Statistics
  2. Linear Algebra
  3. Calculus
  4. Probability
  5. Programming Languages

Statistics

Statistics contain tools that can be used to get some outcome from the data. There is descriptive statistics which is used to transform raw data in some important information. Also, inferential statistics can be used to get important information from a sample of data instead of using complete dataset.

Linear Algebra

Linear algebra deals with vectors, matrices, and linear transformations. It is very important in machine learning as it can be used to transform and perform operations on the dataset.

Calculus

Calculus is an important field in mathematics and it plays an integral role in many machine learning algorithms. Data set having multiple features are used to build machine learning models as features are multiple multivariable calculi plays an important role to build a machine learning model. Integrations and Differentiations are a must.

Probability

Probability helps predict the likelihood of the occurrences, It helps us to reason the situation may or may not happen again. For machine learning, probability is a foundation.

Programming language

It is essential to know programming languages like R and Python in order to implement the whole Machine Learning process. Python and R both provide in-built libraries that make it very easy to implement Machine Learning algorithms.

Apart from having basic programming knowledge, it is also important that you know how to extract, process and analyze data. This is one of the most important skills that is needed for Machine Learning.

Machine Learning Use Case

Machine learning is all about creating an algorithm that can learn from data to make a prediction like what kinds of objects are there in the picture, or recommendation engine, the best combination of drugs to cure the certain disease or spam filtering.

Machine learning is built on mathematical prerequisites and if you know why maths is used in machine learning it will make it fun. You need to know the maths behind the functions you will be using and which model is suitable for the data and why.

So let’s start with an interesting problem of predicting house prices, having a dataset containing a history of different features and prices, for now, we will consider the area of the living space in square feet and the prices.

Now we have a data set containing two columns as shown below:

There must be some correlation between these two variables to find out we will need to build a model which can predict the price of houses, how can we do it?

Let’s graph this data and see how it looks like:

Here we are trying to create a line which is termed as:

Y=mX + c

This method of predicting the linear relationship between the target (dependent variable) and predictor variable (independent variable) is termed as linear regression. It allows us to study and summarize a relation between two variables.

  • X = Independent variable
  • Y = Dependent variable
  • c = y-intercept
  • m = Slop of line

If we consider the equation we have values for X which is an independent variable, so all we have to do is calculate values for m and c in order to predict the value of Y.

So how do we find these variables?

In order to find these variables, we can try a bunch of values and try to find out a line which intersects the maximum number of data points. But, how can we find the best fit line?

So in order to find the best-fit line, we can use least squares error function that will find the error between the real value of y and the predicted value y`.

The least-squares error function can be represented using the following equation:

Using this function we can find out the error for each predicted data point by comparing it with the actual value of the data point. You then take the summation of all these errors and square them to find out the deviation in the prediction.

If we add the third axis to our graph containing all possible error values and plot it in 3-Dimensional space, it will look like this:

In the above image, the ideal values would be in the bottom black part which will predict the prices close to the actual data point. The next step is to find the best possible values for m and c. This can be done by using the optimization technique called gradient descent.

Gradient descent is an iterative method, where we start with initializing some set of values for our variables and improve them slowly by minimizing the error between the actual value and the predicted value.

Now if we think practically the prices of the apartment don’t really depend only on price per square feet, there are many factors such as the number of bedrooms, bathrooms, etc. If we consider those features as well then the equation will look something like this

Y = b0 + b1x1 + b2x2 + …..+bnxn + c

This is multilinear regression this belongs to linear algebra, here we can use matrices of size mxn where m are features and n are data points.

Let’s consider another situation where we can use probability to find the condition of the house in order to classify a house based on whether it is in good condition or bad condition. For this, to work we will have to use a technique called Logistic Regression which works on the probability of occurrences represented by a sigmoid function.

In this article, we covered the prerequisites of machine learning and how they are applied in machine learning. So basically, it consists of statistics, calculus, linear algebra, and probability theory. Calculus has techniques used for optimization, linear algebra has algorithms which can work on huge data sets, with probability we can predict the likelihood of occurrences and statistics helps us infer useful insights from the sample of data sets.

If you wish to check out more articles on the market’s most trending technologies like Python, DevOps, Ethical Hacking, then you can refer to Edureka’s official site.

Do look out for other articles in this series which will explain the various other aspects of Data Science.

1.Data Science Tutorial

2.Math And Statistics For Data Science

3.Linear Regression in R

4.Machine Learning Algorithms

5.Logistic Regression In R

6.Classification Algorithms

7.Random Forest In R

8.Decision Tree in R

9.Introduction To Machine Learning

10.Naive Bayes in R

11.Statistics and Probability

12.How To Create A Perfect Decision Tree?

13.Top 10 Myths Regarding Data Scientists Roles

14.Top Data Science Projects

15.Data Analyst vs Data Engineer vs Data Scientist

16.Types Of Artificial Intelligence

17.R vs Python

18.Artificial Intelligence vs Machine Learning vs Deep Learning

19.Machine Learning Projects

20.Data Analyst Interview Questions And Answers

21.Data Science And Machine Learning Tools For Non-Programmers

22.Top 10 Machine Learning Frameworks

23.Statistics for Machine Learning

24.Random Forest In R

25.Breadth-First Search Algorithm

26.Linear Discriminant Analysis in R

27.Supervised Learning

28.Interactive WebApps using R Shiny

29.Top 10 Books for Machine Learning

30.Unsupervised Learning

31.10 Best Books for Data Science

32.Machine Learning using R

Originally published at https://www.edureka.co on August 6, 2019.

Machine Learning
Prerequisites For Ml
Machine Learning Ai
Python
Data Science
Recommended from ReadMedium