
Concept explainers
Explanation of Solution
Data Mining:
Data Mining is the extraction of knowledge and data patterns from various raw data sets by examining patterns, trends and other Business Intelligence reports using intelligent methods for classification and prediction.
- Data mining techniques differ from reporting applications, as they are very sophisticated and complex, hence difficult to use.
Difference of factors for reporting and data mining:
Factors | Reporting | Data mining |
Type of objective | Assessment | Prediction |
Company | Target | Netflix |
Analysis | Simple-summing, totaling | Advance statistics |
Types | Noninteractive – RFM, Interactive - OLAP |
Cluster Regression Market basket Decision tree Others |
Artificial Intelligence (AI) is the ability of machines to perform activities that require human intelligence. In AI, machines can have vision, and can perform communication, recognition and learning. In AI, machines also have the ability to make decisions.
Benefits:
- Dealing with heavy and mundane tasks become easier with the help of machines.
- In order to gather and analyze Big Data, AI is extremely useful to improve efficiency.
- AI will potential increase cyber security and improve the security of Internet of Things (IOT).
- The accuracy of working on a thing increases a lot with AI.
- Using AI the use of digital assistants will increase which in turn will decrease the need for human resources.
Difference between Data Mining and Machine Learning:
Data Mining | Machine Learning |
Data Mining is the extraction of knowledge and data patterns from various raw data sets by examining patterns, trends and other Business Intelligence reports using intelligent methods for classification and prediction |
Machine Learning uses various data mining techniques to extract knowledge from data based on |
In order to find patterns among data, Statistics and other | Based on the previously known training data, one can predict the outcome using Machine learning. |
Data Mining uses both Math and programming methods but inclination toward maths is more. | Machine Learning uses Data Mining techniques to build models that mostly use programming more than maths. |
Data mining techniques are difficult to use:
Curse of Dimensionality:
The Curse of Dimensionality is the observation that is observed that problem arises when one analyses and organizes the data in high dimensional spaces. Working with data becomes more demanding with increase with increase in dimensions.
- With the increase in number of attributes, there is more chance to build easily a model to fit all the sample data but as a predictor it is useless.
- In data mining analyses, having too many attributes is problematic as one of the major activities in Data Mining concerns efficient and effective ways of selecting attributes.
- The amount of data used for Data Mining is huge and one needs to reduce the volume the data in order to meaningfully analyse the data.
Difference between Supervised and Unsupervised Data Mining:
Unsupervised Data Mining | Supervised Data Mining |
In Unsupervised Data Mining, before running the analysis, analysts do not create a model or hypothesis. | In Supervised Data Mining, before running the analysis, data miners create a model and apply statistical techniques to the data. |
Cluster analysis is a technique that uses Unsupervised Data Mining | Regression Analysis is a technique that uses Supervised Data Mining. |
Cluster Analysis:
- Cluster Analysis is a way of arranging data such that data having similar properties are grouped together in a cluster. It is also known as clustering.
Example:
- Using Cluster Analysis, one can find patients with similar diseases from medicine history and demographic data.
Regression Analysis:
Data mining analysis which processes the consequence of a set of variables on other variables is called a regression analysis...

Want to see the full answer?
Check out a sample textbook solution
Chapter 9 Solutions
Using MIS (10th Edition)
- also provide the number of moves(actions) made at state A and moves(actions) made state B. INCLUDE Java program required(this question is not graded)arrow_forwardYou are given a class that processes purchases for an online store. The class receives calls to: • Retrieve the prices for items from a database • Record the sold items • Update the database • Refresh the webpage a. What architectural pattern is suitable for this scenario? Illustrate your answer by drawing a model for the solution, showing the method calls/events. b. Comment on how applying this pattern will impact the modifiability of the system. c. Draw a sequence diagram for the update operation.arrow_forward2. The memory management has contiguous memory allocation, dynamic partitions, and paging. Compare the internal fragmentation and external fragmentation for these three approaches. [2 marks] 3. Suppose we have Logical address space = 24 = 16 (m = 4), Page size=2² =4 (n = 2), Physical address space = 26 = 64 (r = 6). Answer the following questions: [4 marks] 1) Total # of pages ? 2) Total # of frames ? 3) Number of bits to represent logical address? 4) Number of bits to represent offset ? 5) Number of bits to represent physical address? 6) Number of bits to represent a page number? 7) Number of bits to represent a frame number / 4. What is translation look-aside buffers (TLBS)? Why we need them to implement the page table? [2 marks] 5. Why we need shared pages for multiple processes? Give one example to show the benefits. [2 marks] 6. How to implement the virtual memory by using page out and page in? Explain with an example. [2 marks] 7. We have a reference string of referenced page…arrow_forward
- Good morning, please solve this trying to follow this criteria. (use Keil) Abstract describing the requirements and goals of the assignment. List file with no errors or warnings. Brief description of your implementation design and code. Debugging screen shots for different scenarios with your reference and comments. Conclusion (and please give me the code without errors, make sure it is working)arrow_forwardGood mrng, please solve this trying to follow this criteria. (use Keil) Abstract describing the requirements and goals of the assignment. List file with no errors or warnings. Brief description of your implementation design and code. Debugging screen shots for different scenarios with your reference and comments. Conclusion (and please give me the code without errors, make sure it is working)arrow_forward#include <stdio.h> #include <stdlib.h> #include <unistd.h> int global_var = 42; // int* function(int *a) { int local_var = 10; // *a = *a + local_var; int *local_pointer = (int *)malloc (size of (int) * 64); // Allocated array with 64 integers return local_pinter; } int main() { int local_main[1024*1024*1024*1024*1024] = {0}; // initialize an array and set all items as 0 int *heap_var = (int *)malloc(size of(int) * 16); // Allocated array with 16 integers *heap_var = 100; function(heap_var); printf(“the value is %d\n”, *heap_var); free(heap_var); // release the memory return 0; } 1) draw the memory layout of the created process, which should include text, data, heap and stack [2 marks]. 2) Indicate which section the variables are allocated [2 marks]: global_var local_var, local_pointer local_main heap_var, *heap_var (the data that it points to) 3) Does this code have memory leaking (heap memory is not released)? [2 marks] 4) The…arrow_forward
- 8. List three HDD scheduling algorithms. [2 marks] 9. True or False? The NVM has the same scheduling algorithms with HDD. Explain why? [2 marks] 10. Why the modern mouses use polling to detect movements instead of interrupts? [2 marks] 11. What is thrashing? How does it happen? [2 marks] 12. Given a reference string of page numbers 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and 4 frames show how the page replacement algorithms work, and how many page faults? [6 marks], 1) FIFO algorithm? [2 marks] 2) Optimal algorithm? [2 marks] 3) LRU algorithm? [2 marks] 13. List at least three file systems that you know. [2 marks] 14. In C programming, how the seek to a specific position in the file by offset? [2 marks]arrow_forwardA Personal Address Book application program allows the user to add, delete, search, save, and load contact information. The program separates the user interface (command-line) and the internal processing subsystem. The internal processing system consists of the following classes: • • Contact Manager (responsible for add and delete operations) Contact Finder (responsible for the search operation) pataManager (responsible for save and load operations) a. What design pattern can be used to implement the user interface? Explain your answer using a class diagram for the entire system. b. Draw a UML sequence diagram to show the behavioral view of the Personal Address Book program, demonstrating what happens when a user enters new contact information.arrow_forwardA system comprises three components: A, B, and C. Calling A requires calling B, and calling B requires calling A. Component C is responsible for dissimilar tasks T#1, #2, and T#3. a. Comment on the modifiability of this system. What problems do you see in this system, and how would you solve them? b. Suppose that T#1 is performed by both component A and C. What does this indicate about A and C? How would you solve this problem?arrow_forward
- Please answer questions from number 1 to 3 if these questions in the image provided below(NOTE: THESE QUESTIONS ARE NOT GRADED!)arrow_forwardIn a client-server system for a hypothetical banking application, the clients are ATM machines, and the server is the bank server. Suppose the client has two operations: withdraw and deposit. Use the broker architectural pattern to document an initial architecture design for this system: a. Draw a class diagram for this system. Use a client-side proxy to encrypt the data using an encrypt operation and a server-side proxy to decrypt the data. b. Discuss how you plan to apply redundancy tactics on the server. Additionally, identify the quality attribute that this tactic will achieve and discuss any potential side effects of applying it.arrow_forwarda. Comment on how you would achieve higher performance for a hypothetical Trent Course Registration System, assuming it utilizes a client-server architecture. b. Suppose we want greater availability of the server. Discuss what kind of tactics should be used to achieve that.arrow_forward
- Fundamentals of Information SystemsComputer ScienceISBN:9781337097536Author:Ralph Stair, George ReynoldsPublisher:Cengage LearningPrinciples of Information Systems (MindTap Course...Computer ScienceISBN:9781305971776Author:Ralph Stair, George ReynoldsPublisher:Cengage LearningFundamentals of Information SystemsComputer ScienceISBN:9781305082168Author:Ralph Stair, George ReynoldsPublisher:Cengage Learning
- Principles of Information Systems (MindTap Course...Computer ScienceISBN:9781285867168Author:Ralph Stair, George ReynoldsPublisher:Cengage LearningNp Ms Office 365/Excel 2016 I NtermedComputer ScienceISBN:9781337508841Author:CareyPublisher:Cengage



