Bonus_project_Winter23(1)

docx

School

Seneca College *

*We aren’t endorsed by this school

Course

130

Subject

Statistics

Date

Feb 20, 2024

Type

docx

Pages

5

Uploaded by SuperEnergyTarsier27

Report
Academic Year 2022 – 2023 Semester Fall Winter Summer Course Code - Name BAN130 Instructor Assessment Projects Due Date Sunday, April 09, 2023 Student ID Student Name Role Group Lead Member Member Member
Bonus Project This project is completely optional. You are required to choose a project from the list of the projects specified in this document and complete it by yourself. The project is expected to test your technical skills in Python programming. The detailed requirements for each project are available in this document, so please go through the details and fulfil all the requirements to avoid missing any marks. Finally, follow the below mentioned instructions carefully. Instructions: In order to obtain maximum marks in this assessment, please ensure the followings: Don’t forget to write your name and ID on the first page of this document. Submit the project by writing your solution in this document under the Solution heading below. Do not use a separate document. Everything related to the project should be included in this document, e.g., code, screenshots and etc. This project has a weightage of 10% marks of the course as a bonus. The project deadline is April 9, 2023 . Submissions after the deadline will not be accepted. Rubric: Your assessment will be graded based on the following rubric: Excellent (7 - 10) Average (4 – 6.9) Poor (<4) Project Completion and Code (10) The project was completed without any errors and output is as expected. Fulfills all/most of the requirements for the project. The project was completed with few errors. Fulfills some of the requirements for the project. The project is incomplete. Does not fulfill all/most of the requirements. Detailed Explanation (10) The student has a good contribution to the project. Knows ins and outs of the project. The student has written his/her part of the project very well. Knows everything / most of his/her part. The student has average contribution to the project. Does not know the whole project. The student has averagely written his/her part of the project. Knows few of the things about his/her part. The student has no contribution to the project. Does not know anything / most about the project. The student has poorly written the project. Does not know much about the project. Report Student has Student has Student has not Page | 1
(10) contributed well in preparing the project report and knows all the aspects of the report. contributed partially in preparing the project report and knows some aspects of the report. contributed in preparing the report. Project 1 Project Name: Adventure Works Product Sales Analysis Dataset: AdventureWorks.xlsx (Available on Blackboard) Requirements: Below are bare minimum requirements for this project, however, you are free to add more features to your project: (hint: pandas library) 1. Data Import This phase requires you to import the data from the provided excel file into Python. Product sheet in excel file should be imported as Product dataset. SalesOrderDetail sheet in excel file should be imported as SalesOrderDetail dataset. 2. Data Cleaning This phase requires you to clean your data before data analysis phase. Product_Clean: Create a Product_Clean dataset from Product dataset by bringing in only ProductID, Name, ProductNumber, Color and ListPrice All the missing values in Color column should be replaced by ‘NA’ ListPrice column should be float (final column name should be ListPrice) with 2 decimal places SalesOrderDetail_Clean: Create SalesOrderDetail_Clean dataset from SalesOrderDetail dataset by bringing in only SalesOrderID SalesOrderDetailID OrderQty ProductID UnitPrice LineTotal and ModifiedDate ModifiedDate should be date with column name ModifiedDate UnitPrice should be float with column name UnitPrice LineTotal should be float with column name LineTotal OrderQty should be integer with column name OrderQty Include date for year 2013 and 2014 in ModifiedDate only ModifiedDate should be date format UnitPrice and LineTotal are float with 2 decimal places Page | 2
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Project 2 Project Name: Adventure Works Territory Sales Analysis Dataset: AdventureWorks.xlsx (Available on Blackboard) Requirements: Below are bare minimum requirements for this project, however, you are free to add more features to your project: (hint: pandas library) 1. Data Import This phase requires you to import the data from the provided excel file into Python. SalesTerritory sheet in excel file should be imported as SalesTerritory dataset. SalesOrderHeader sheet in excel file should be imported as SalesOrderHeader dataset. 2. Data Cleaning This phase requires you to clean your data before data analysis phase. SalesOrderHeader_Clean: Create a SalesOrderHeader_Clean dataset from SalesOrderHeader dataset by bringing in only SalesOrderID OrderDate OnlineOrderFlag TerritoryID TotalDue TotalDue column should be float (final column name should be TotalDue) with 2 decimal places OnlineOrderFlag column should be integer (final column name should be OnlineOrderFlag) OrderDate column should be date (final column name should be OrderDate). TerritoryID column should be integer (final column name should be TerritoryID) No un-necessary columns should be part of the SalesOrderHeader_Clean dataset. Territory_Clean: Create Territory_Clean dataset from SalesTerritory dataset by bringing in only TerritoryID Name CountryRegionCode Group SalesYTD SalesYTD column should be floar (final column name should be SalesYTD) with 2 decimal places Page | 3
TerritoryID column should be integer (final column name should be TerritoryID) Page | 4