Option #1
docx
keyboard_arrow_up
School
National University College *
*We aren’t endorsed by this school
Course
460
Subject
Information Systems
Date
Jan 9, 2024
Type
docx
Pages
4
Uploaded by praveshfiji
Option #1
Name: Pravesh Charan
MIS450
Grid-Based Clustering with Uniform Density
First, carefully read Section 8.3 in
Introduction to Data Mining
. In this assignment, you will write SAS code to examine grid-based clustering algorithm. This algorithm is given on page 645 in
Introduction to Data Mining,
and it is also provided below.
Algorithm 8.4 Basic grid-based clustering algorithm.
1.
Define a set of grid cells.
2.
Assign objects to the appropriate cells and compute the density of each cell.
3.
Eliminate cells having a density below a specified threshold.
4.
Form clusters from contiguous (adjacent) groups of dense cells.
Complete the following tasks:
1.
Upload the dataset file simulation-uniform.csv
Download simulation-
uniform.csvfrom the Module 6 to SAS Studio. Uploaded to SAS studio The first 200 observations in the dataset were generated from a uniform distribution over a circle centered at (2,3) of radius 2, and the next 100 observations were generated from a uniform distribution over
a circle centered at (6,3) of radius 1.
The data was Uploaded in SAS
2.
By writing appropriate SAS code, produce a scatterplot of the data. Your plot should look similar to the one shown in Figure 8.10 (page 646) in
Introduction to Data Mining
.
3.
By writing appropriate SAS code, generate the point counts for grid cells. It should have 49 cells using a 7-by-7 grid. Refer to Table 8.2 (page 646) in
Introduction to Data Mining
.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
For each part, take the screenshots of the SAS code(s) and SAS output(s) and
paste them into a Word document
. Snapshot pasted above.
Include all relevant calculations and your answers to all assignment items and submit the document to Canvas for grading. Clearly label all elements in your submission. The max on the X was close to 7 and Y was 5. So, to make 7 by 7 grid, a single deviation was good.
In addition, provide a short description of any challenge(s) you faced during this assignment.
None of our text or materials gives any clue other than "The point counts for grid cells can be calculated using a conditional statement such as an IF-THEN
statement. This was very confusing.