
Concept explainers
Explanation of Solution
Data Mining:
Data Mining is the extraction of knowledge and data patterns from various raw data sets by examining patterns, trends and other Business Intelligence reports using intelligent methods for classification and prediction.
- Data mining techniques differ from reporting applications, as they are very sophisticated and complex, hence difficult to use.
Difference of factors for reporting and data mining:
Factors | Reporting | Data mining |
Type of objective | Assessment | Prediction |
Company | Target | Netflix |
Analysis | Simple-summing, totaling | Advance statistics |
Types | Noninteractive – RFM, Interactive - OLAP |
Cluster Regression Market basket Decision tree Others |
Artificial Intelligence (AI) is the ability of machines to perform activities that require human intelligence. In AI, machines can have vision, and can perform communication, recognition and learning. In AI, machines also have the ability to make decisions.
Benefits:
- Dealing with heavy and mundane tasks become easier with the help of machines.
- In order to gather and analyze Big Data, AI is extremely useful to improve efficiency.
- AI will potential increase cyber security and improve the security of Internet of Things (IOT).
- The accuracy of working on a thing increases a lot with AI.
- Using AI the use of digital assistants will increase which in turn will decrease the need for human resources.
Difference between Data Mining and Machine Learning:
Data Mining | Machine Learning |
Data Mining is the extraction of knowledge and data patterns from various raw data sets by examining patterns, trends and other Business Intelligence reports using intelligent methods for classification and prediction |
Machine Learning uses various data mining techniques to extract knowledge from data based on |
In order to find patterns among data, Statistics and other | Based on the previously known training data, one can predict the outcome using Machine learning. |
Data Mining uses both Math and programming methods but inclination toward maths is more. | Machine Learning uses Data Mining techniques to build models that mostly use programming more than maths. |
Data mining techniques are difficult to use:
Curse of Dimensionality:
The Curse of Dimensionality is the observation that is observed that problem arises when one analyses and organizes the data in high dimensional spaces. Working with data becomes more demanding with increase with increase in dimensions.
- With the increase in number of attributes, there is more chance to build easily a model to fit all the sample data but as a predictor it is useless.
- In data mining analyses, having too many attributes is problematic as one of the major activities in Data Mining concerns efficient and effective ways of selecting attributes.
- The amount of data used for Data Mining is huge and one needs to reduce the volume the data in order to meaningfully analyse the data.
Difference between Supervised and Unsupervised Data Mining:
Unsupervised Data Mining | Supervised Data Mining |
In Unsupervised Data Mining, before running the analysis, analysts do not create a model or hypothesis. | In Supervised Data Mining, before running the analysis, data miners create a model and apply statistical techniques to the data. |
Cluster analysis is a technique that uses Unsupervised Data Mining | Regression Analysis is a technique that uses Supervised Data Mining. |
Cluster Analysis:
- Cluster Analysis is a way of arranging data such that data having similar properties are grouped together in a cluster. It is also known as clustering.
Example:
- Using Cluster Analysis, one can find patients with similar diseases from medicine history and demographic data.
Regression Analysis:
Data mining analysis which processes the consequence of a set of variables on other variables is called a regression analysis...
Want to see the full answer?
Check out a sample textbook solution
Chapter 9 Solutions
Using MIS (10th Edition)
- (3e) Test-and-Set. The Test-and-Set instruction is used in hardware to achieve synchronization. It can be defined in the following way: function Test-and-Set (var target: boolean): boolean; begin Test-and-Set:= target; target := true; end; Now show how the Test-and-Set instruction can be used to protect a critical region and hence achieve mutual exclusion (do not worry about satisfying the bounded waiting condition). (8 points) repeatarrow_forwardConsider a system with three smoker processes and one agent process. Each smoker continuously rolls a cigarette and then smokes it. But to roll and smoke a cigarette, the smoker needs three ingredients: tobacco, paper and matches. One of the smoker processes has paper, another has tobacco and the third has the matches. The agent has an infinite supply of all three materials. The agent places two of the ingredients on the table. The smoker who has the remaining ingredient then makes and smokes a cigarette, signaling the agent on completion. The agent then puts out another two of the three ingredients, and the cycle repeats. Given below is a solution to the Cigarette-Smokers Problem. Give initial conditions for the semaphores as well as plausible values for the variables i & j and r & s, such that the agent and smokers are synchronized. Write a couple of sentences on why these initial conditions are necessary and sufficient. Solution: var a: array [0..2] of semaphore (initial condition =…arrow_forwardLevel-0 Diagram for this: A customer sends in an order form containing details of their order and their membership number. A check is made to verify that they are a member. When their order is verified, a check is made to validate that the items ordered are produced by the company. Next, the valid order is used to update the daily order file, and then used to create a shipping list and invoice, which are sent on to the Order Fulfilment System.arrow_forward
- In this assignment, you will use all of the graphics commands you have learned to create an animated scene. Your program should have a clear theme and tell a story. You may pick any school-appropriate theme that you like. The program must include a minimum of: 5 circles 5 polygons 5 line commands 2 for loops 1 global variable You may wish to use the standard code for simplegui graphics below: import simplegui def draw_handler(canvas): frame = simplegui.create_frame('Testing', 600, 600) frame.set_canvas_background("Black") frame.set_draw_handler(draw_handler) frame.start() Submit Your Code After you write your code here in the programming environment, you will check it and submit it as usual. However, the grader will only perform basic checks against some requirements. If your code passes, you should submit your work, and your teacher will manually grade your submitted work using a rubric.arrow_forward1. What is the difference between a relative cell reference and an absolute cell reference and give an example of when you would use each.arrow_forwardWhat is the goal of using a chart in excel, and how is a chart useful and what is the goal of using sparklines in excel, and how are sparklines useful?arrow_forward
- Prove for each pair of expression f(n) and g(n) whether f(n) is big O, little o Ω,ω or Θ of g(n). Use limits to find these. For each case it is possible that more than one of these conditions is satisfied:1. f(n) =log(n2^n), g(n) = log(sqrt(n)2^(n^2))2. f(n) =nsqrt(n) +log(n^n), g(n) =n + sqrt(n)lognarrow_forwardNeed this expression solved for mu. This can be done using a symbolic toolbox, however it needs to end up being mu = function (theta, m, L, g). If using MATLAB or something similar, run the code to make sure it works.arrow_forwardA business case scenario and asked to formulate an appropriate software design solution. Theyshould complete the case and upload the solution. will be required to read the case,identify and document the key issues, problems, and opportunities presented, and then design,and develop an appropriate integrated design solution to the problem. mustdemonstrate good spreadsheet, database, analytical, and word-processing skills whendeveloping solutions. Additionally, must be creative and demonstrate synthesising andapplying Database Management and Data Analytics Principles learned in the course. They willalso need to research some aspects of the assessment. CASE BACKGROUNDMGMT SS STATS, an umbrella body that facilitates and serves various Social SecurityOrganizations/Departments within the Caribbean territories, stoodpoised to meet the needs of its stakeholders by launching an onlinedatabase at www.SSDCI.gov. The database will provide membersand the public access to the complete set of…arrow_forward
- A business case scenario and asked to formulate an appropriate software design solution. Theyshould complete the case and upload the solution. will be required to read the case,identify and document the key issues, problems, and opportunities presented, and then design,and develop an appropriate integrated design solution to the problem. mustdemonstrate good spreadsheet, database, analytical, and word-processing skills whendeveloping solutions. Additionally, must be creative and demonstrate synthesising andapplying Database Management and Data Analytics Principles learned in the course. They willalso need to research some aspects of the assessment. CASE BACKGROUNDMGMT SS STATS, an umbrella body that facilitates and serves various Social SecurityOrganizations/Departments within the Caribbean territories, stoodpoised to meet the needs of its stakeholders by launching an onlinedatabase at www.SSDCI.gov. The database will provide membersand the public access to the complete set of…arrow_forwardUsing MATLAB symbolic toolbox, given these 3 equations, how would you solve for mu = function(theta), making sure that there are no mu's on the right hand side, making sure theta-dot-dot, theta-dot-squared- and N aren't in the final answer either.arrow_forwardAfter playing our giving implementation, your task is to implement Dinning Philosophers with semaphore in C, by including and Your implementation will require creating five philosophers, each identified by a number 0.4. Each philosopher will run as a separate thread. Create threads using Pthreads as discussed in the Lecture slides on Chapter 4 and Practice Lab on Threads. Your solution needs to accomplish the following: Implement in C (15 points) 1. dp1.c - You are to provide your solution to this assignment as a single C program named 'dp1.c using semaphore. Explain in you code (as comments) that the dead lock will happen or not. If there is a possible deadlock, you can simply solve the deadlock by pick the fork in order like the first solution in our slides. Solve Deadlock by Footman (15 points) 1. Here is a new solution to overcome the deadlock. The Dining Philosophers decide to hire a footman whose task to allow only four philosophers to sit on the table. When entering and…arrow_forward
- Fundamentals of Information SystemsComputer ScienceISBN:9781337097536Author:Ralph Stair, George ReynoldsPublisher:Cengage LearningPrinciples of Information Systems (MindTap Course...Computer ScienceISBN:9781305971776Author:Ralph Stair, George ReynoldsPublisher:Cengage LearningFundamentals of Information SystemsComputer ScienceISBN:9781305082168Author:Ralph Stair, George ReynoldsPublisher:Cengage Learning
- Principles of Information Systems (MindTap Course...Computer ScienceISBN:9781285867168Author:Ralph Stair, George ReynoldsPublisher:Cengage LearningNp Ms Office 365/Excel 2016 I NtermedComputer ScienceISBN:9781337508841Author:CareyPublisher:Cengage



