For the dataset, adultsData.csv (in the assignment folder), it is required to analyze the dataset to answer the following questions. Follow the data analysis process and highlight the different phases you follow throughout your analysis. Incorporate visualisation in your analysis and add comments that conclude your findings. Then, upload your final program as a single Jupyter Notebook file.
SQL
SQL stands for Structured Query Language, is a form of communication that uses queries structured in a specific format to store, manage & retrieve data from a relational database.
Queries
A query is a type of computer programming language that is used to retrieve data from a database. Databases are useful in a variety of ways. They enable the retrieval of records or parts of records, as well as the performance of various calculations prior to displaying the results. A search query is one type of query that many people perform several times per day. A search query is executed every time you use a search engine to find something. When you press the Enter key, the keywords are sent to the search engine, where they are processed by an algorithm that retrieves related results from the search index. Your query's results are displayed on a search engine results page, or SER.
adultsData.csv file: https://onq.queensu.ca/d2l/common/viewFile.d2lfile/
data:image/s3,"s3://crabby-images/5219d/5219d0165e0a631de477992732b5938dedd46b18" alt="For the dataset, adultsData.csv (in the assignment folder), it is required to analyze the dataset to answer the
following questions. Follow the data analysis process and highlight the different phases you follow
throughout your analysis. Incorporate visualisation in your analysis and add comments that conclude your
findings. Then, upload your final program as a single Jupyter Notebook file.
1. How many men and women are represented in this dataset?
2. What is the average age of women?
3. What is the percentage of German citizens and Canadian citizens?
4. What are the mean and standard deviation of the age for those who earn more than 50K per year and
those who earn less than 50K per year?
5. What is the education level of people who earn more than 50K? Is it true that they have at least high
school education?
6. Display age statistics for each race and each gender. Use groupby() and describe(). Find the
maximum age of men and women in each race group.
7. Among whom is the proportion of those who earn greater than 50K: married or single? Consider as
married those who have a marital-status starting with Married (e.g., Married-civ-spouse, Married-
spouse-absent, Married-AF-spouse, etc.).
8. What is the maximum number of hours a person works per week? How many people work such a
number of hours, and what is the percentage of those who earn more than 50K among them?
9. Count the average time of work (hours-per-week) for those who earn less than 50K and more than
50K for each country. Compare these averages for Japan and Canada."
data:image/s3,"s3://crabby-images/00039/00039eaf710a9765f6db01fc5b9812260bf5cade" alt=""
Step by step
Solved in 2 steps
data:image/s3,"s3://crabby-images/e0cbe/e0cbe7c1cfa79a285a06530332b315bcf077d9a4" alt="Blurred answer"
data:image/s3,"s3://crabby-images/60092/600925f3c879aa48326d2697cc12cbd501c16012" alt="Database System Concepts"
data:image/s3,"s3://crabby-images/b5b1d/b5b1d5cf4b4f0b9fa5f7299e517dda8c78973ae2" alt="Starting Out with Python (4th Edition)"
data:image/s3,"s3://crabby-images/861e9/861e9f01dc31d6a60742dd6c59ed7da7e28cd75d" alt="Digital Fundamentals (11th Edition)"
data:image/s3,"s3://crabby-images/60092/600925f3c879aa48326d2697cc12cbd501c16012" alt="Database System Concepts"
data:image/s3,"s3://crabby-images/b5b1d/b5b1d5cf4b4f0b9fa5f7299e517dda8c78973ae2" alt="Starting Out with Python (4th Edition)"
data:image/s3,"s3://crabby-images/861e9/861e9f01dc31d6a60742dd6c59ed7da7e28cd75d" alt="Digital Fundamentals (11th Edition)"
data:image/s3,"s3://crabby-images/134f1/134f1b748b071d72903e45f776c363a56b72169f" alt="C How to Program (8th Edition)"
data:image/s3,"s3://crabby-images/3a774/3a774d976e0979e81f9a09e78124a494a1b36d93" alt="Database Systems: Design, Implementation, & Manag…"
data:image/s3,"s3://crabby-images/307b2/307b272f255471d7f7dc31378bac8a580ae1c49c" alt="Programmable Logic Controllers"