SaiAlekhya_CIS602_LabHomework1

docx

School

University of Massachusetts, Boston *

*We aren’t endorsed by this school

Course

200

Subject

Computer Science

Date

Dec 6, 2023

Type

docx

Pages

14

Uploaded by HighnessQuail3665

Report
CIS 602 Big Data Analytics Lab Homework 1 Accessing and Analyzing data by using Amazon S3 Sai Alekhya Ravi
Task 1: Creating a CloudFormation template and stack To create a CloudFormation template, we need to create a Cloud9 Instance. Environment type: New EC2 instance Instance type: t2.micro (1 Gib RAM + 1 vCPU) Network settings: Secure Shell (SSH) After creating a Cloud9 Instance, open IDE. Create a new file, paste the code for CloudFormation and save the file. Now we have created a S3 bucket with blocked public access. To validate the template, run a command in the terminal. The output shown in the screenshot below proves that the template is valid.
Creating a stack. To verify the stack is created.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
To delete the stack. Confirm the stack is deleted.
Stack status: DELETE_COMPLETE
Task 2: Uploading a sample data to an S3 bucket In this task, I downloaded a dataset file. After downloading the file, I unzipped the file.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
To view the contents of the dataset file. After copying dataset file into S3 bucket.
To confirm file is added to the bucket.
Task 3: Querying the data Using S3 to run SQL query on the uploaded data. The query results the columns of the first five data. To display the dataset in tabular format select the formatted tab.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
After changing the query to see only the first names.
Task 4: Modifying an object’s encryption properties and storage type Encrypting and modifying the storage class data object in S3. In the S3 bucket, edit the storage clss by choosinf the intelligent tiering and save the changes.
Task 5: Compressing and querying the dataset Compressing the file to ZIP format. Compressing the file to GZIP format. Listing the objects in directory.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Uploading the GZIP format file to the S3 bucket.
Using the S3 select to query the compressed file. Conclusion While preforming this guide lab, I have faced few problems related to task 6. The link Paulo was no where to be found in the IAM users and I wasn’t able to find any DataScienceGroup link to perform the managing and testing restricted access for a team member task. Other than that, I have done the previous tasks with ease and was able to understand the reason behind each step that has been performed. Although I have faced a problem during the last task, the previous task were very informative and reasonable and can be done with ease in the following lab sessions.