IE6400_Day23

html

School

Northeastern University *

*We aren’t endorsed by this school

Course

6400

Subject

Industrial Engineering

Date

Feb 20, 2024

Type

html

Pages

Uploaded by ColonelStraw13148

IE6400 Foundations of Data Analytics Engineering ¶ Fall 2023 ¶ Module 4: Introduction to Machine Learning ¶ Machine Learning Overview ¶ Machine learning (ML) is a branch of artificial intelligence (AI) that focuses on building systems that can learn from data. Rather than being explicitly programmed to perform a task, a machine learning algorithm uses statistical techniques to learn patterns in data and make predictions or decisions based on it. Key Aspects of Machine Learning ¶ 1. Supervised Learning ¶ • Description : An algorithm is trained on a labeled dataset, where the data comes with the correct answers. The algorithm makes predictions and is corrected if those predictions are wrong, leading it to learn over time. • Common Tasks : Classification (categorizing items) and regression (predicting numerical values). 2. Unsupervised Learning ¶ • Description : The algorithm is given data without any explicit instructions on what to do with it. It tries to learn patterns and the structure from the data. • Common Tasks : Clustering (grouping similar items) and association (finding rules that describe data). 3. Reinforcement Learning ¶ • Description : An agent learns how to behave in an environment by performing actions and receiving rewards or penalties. • Analogy : Teaching a dog new tricks. The dog is the agent, the environment is where the dog can perform tricks, and rewards (or penalties) are treats (or lack of treats). 4. Semi-Supervised and Active Learning ¶ • Description : These methods use both labeled and unlabeled data for training. Typically, a small amount of labeled data and a large amount of unlabeled data are used. 5. Deep Learning ¶ • Description : A subset of ML, deep learning models data with deep neural networks, which are algorithms inspired by the structure of the brain. Particularly powerful for tasks like image and speech recognition. Applications of Machine Learning ¶ Machine learning has a myriad of applications, including: • Web search engines • Recommendation systems (e.g., Netflix, Amazon) • Image and speech recognition • Medical diagnosis • Financial forecasting The core idea behind machine learning is that machines take data and "learn" from it, thereby improving their performance over time without being explicitly programmed for the task at hand. Popular Machine Learning Algorithms ¶ Machine learning encompasses a wide range of algorithms used for various tasks. Here's an overview of some of the most popular ones:

1. Linear Regression ¶ • Type : Supervised • Use Case : Predicting a continuous target variable based on one or more input features. • Description : Assumes a linear relationship between the inputs and the target. It tries to find the best-fit straight line that accurately predict the output values within a range. 2. Logistic Regression ¶ • Type : Supervised • Use Case : Binary classification problems. • Description : Estimates the probability that a given instance belongs to a particular category. Despite its name, it's used for classification, not regression. 3. Decision Trees ¶ • Type : Supervised • Use Case : Classification and regression tasks. • Description : Splits the data into subsets based on the value of input features. This process is repeated recursively, resulting in a tree-like model of decisions. 4. Random Forest ¶ • Type : Supervised • Use Case : Classification and regression. • Description : An ensemble method that creates a 'forest' of decision trees. Each tree is trained on a random subset of the data and makes its own predictions. The random forest algorithm then aggregates these predictions to produce a final result. 5. Support Vector Machines (SVM) ¶ • Type : Supervised • Use Case : Classification and regression. • Description : Tries to find a hyperplane that best separates the classes of data. It's particularly useful for classifying complex but small- or medium-sized datasets. 6. K-Means Clustering ¶ • Type : Unsupervised • Use Case : Clustering similar data points together. • Description : Partitions the data into 'K' number of clusters where each data point belongs to the cluster with the nearest mean. 7. Neural Networks (Deep Learning) ¶ • Type : Supervised, Unsupervised • Use Case : Complex tasks like image and speech recognition. • Description : Composed of layers of nodes or 'neurons'. Can automatically learn and extract features from raw data. 8. Naive Bayes ¶ • Type : Supervised • Use Case : Classification tasks, often used for text data. • Description : Based on Bayes' theorem with the 'naive' assumption of conditional independence between every pair of features. 9. Principal Component Analysis (PCA) ¶ • Type : Unsupervised • Use Case : Dimensionality reduction. • Description : Transforms the original variables into a new set of variables (the principal components) which are orthogonal (and linearly independent) and which reflect the maximum variance in the data.

10. Gradient Boosting Machines (GBM) ¶ • Type : Supervised • Use Case : Classification and regression. • Description : Builds an additive model in a forward stage-wise fashion. It allows for the optimization of arbitrary differentiable loss functions. Each of these algorithms has its strengths and weaknesses and is suitable for different types of tasks. The choice of algorithm often depends on the size, quality, and nature of data, the task to be performed, and the available computational resources. Linear Regression Model ¶ Linear Regression is one of the simplest and most commonly used statistical techniques for predictive modeling. It is used to model the relationship between a dependent variable and one or more independent variables by fitting a linear equation to observed data. Key Concepts of Linear Regression ¶ Dependent Variable ¶ This is the target variable that we are trying to predict or explain. In the context of the Boston Housing dataset, it is the median value of owner-occupied homes ( MEDV ). Independent Variables ¶ These are the features or predictors that we use to predict the dependent variable. In the Boston Housing dataset, these could be features like CRIM (crime rate), RM (average number of rooms per dwelling), etc. Linear Relationship ¶ Linear Regression assumes that there is a linear relationship between the independent variables and the dependent variable. This means that if you plot the independent variable(s) on the x-axis and the dependent variable on the y-axis, the data points should fall around a straight line. Equation of a Line ¶ The equation for a line in a simple linear regression (one independent variable) is: $y = \beta_0 + \beta_1x + \epsilon$ where: • ( $y$ ) is the dependent variable, • ( $\beta_0$ ) is the y-intercept, • ( $\beta_1$ ) is the slope of the line, • ( $x$ ) is the independent variable, • ( $\epsilon$ ) is the error term. Least Squares Method ¶ The parameters ( $\beta_0$ ) and ( $\beta_1$ ) are chosen such that they minimize the sum of the squared differences between the observed values and the values predicted by the model. This method is known as the Least Squares Method. Evaluation Metrics ¶ To evaluate the performance of a linear regression model, we commonly use metrics such as Mean Squared Error (MSE) and R-squared (( $R^2$ )). • MSE measures the average of the squares of the errors, i.e., the average squared difference between the estimated values and the actual value. • ( $R^2$ ) is a statistical measure of how close the data are to the fitted regression line. It is also known as the coefficient of determination. Application in Python ¶ In Python's sklearn library, the LinearRegression class is used to perform linear regression and make predictions. The model is trained using the .fit() method and predictions are made with the .predict() method. Conclusion ¶ Linear Regression is a good starting point for regression tasks. It works best when the

Your preview ends here