Quiz3 1- First, you will create and assign your own dataset. In this dataset, you are free to use any name of row and columns. I expect you to create 4 columns and 6 row in the dataset. For example: You can create a student dataset. In this dataset, you can give column names as student id, gender, height, average grade and department. 2- Secondly, you need to put some null values for one string value and one integer value in the dataset. Then you will show the count of null values. Instead of null values, you will create random values. When the null values are string value, you will get unique values in that column and you will choose one of them randomly. When the null value is integer, you will find average of the column and put this average value, instead of null. For example: The height of the student1 is null. You will find average of height column for all student. If the average is 163, you will change the null value as 163. If the string value is country, you will get all country as a unique and you will choose one of them randomly. You will change the null value with your choice in random country. 3- You will show the shape, head of your dataset. You will show the is there any null value in the data set. 4- You will describe and find the mean value for one of the integer column in your dataset. I want you to use pandas mean function in this step. 5- You will use group by function for one column. For example, you will use groupby for genders, and you will find the average age of each gender. Age Gender Female 42.54 Male 37.84
SQL
SQL stands for Structured Query Language, is a form of communication that uses queries structured in a specific format to store, manage & retrieve data from a relational database.
Queries
A query is a type of computer programming language that is used to retrieve data from a database. Databases are useful in a variety of ways. They enable the retrieval of records or parts of records, as well as the performance of various calculations prior to displaying the results. A search query is one type of query that many people perform several times per day. A search query is executed every time you use a search engine to find something. When you press the Enter key, the keywords are sent to the search engine, where they are processed by an algorithm that retrieves related results from the search index. Your query's results are displayed on a search engine results page, or SER.
Quiz3
1- First, you will create and assign your own dataset. In this dataset, you are free to use any name of row and columns. I expect you to create 4 columns and 6 row in the dataset. For example:
You can create a student dataset. In this dataset, you can give column names as student id, gender, height, average grade and department.
2- Secondly, you need to put some null values for one string value and one integer value in the dataset. Then you will show the count of null values. Instead of null values, you will create random values. When the null values are string value, you will get unique values in that column and you will choose one of them randomly. When the null value is integer, you will find average of the column and put this average value, instead of null. For example: The height of the student1 is null. You will find average of height column for all student. If the average is 163, you will change the null value as 163. If the string value is country, you will get all country as a unique and you will choose one of them randomly. You will change the null value with your choice in random country.
3- You will show the shape, head of your dataset. You will show the is there any null value in the data set.
4- You will describe and find the mean value for one of the integer column in your dataset. I want you to use pandas mean function in this step.
5- You will use group by function for one column. For example, you will use groupby for genders, and you will find the average age of each gender.
Age
Gender
Female 42.54
Male
37.84
Step by step
Solved in 2 steps with 1 images