Checkpoint 9

pdf

School

Purdue University *

*We aren’t endorsed by this school

Course

301

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

3

Uploaded by UltraRoseMonkey31

Report
Letter to myself Hi, Rafael This is past you from stats 301. I hope that at this point you are a cybersecurity intern at any company. I am writing to you because I want to remind you of the importance of statistics and what can be learned from them. While everything that I say might not help, it is important for you to understand the lessons and learn from the mistakes that were made in this class. Firstly, I want to remind you of all the statistical learnings and concepts that we learned through that data project we completed. I understand that we will not be using all the knowledge that we learned but I want to at least remind you of the different surveys that can be used and when to use them and remember some of the different types of tests that we used. The different types of random samples that I hope you keep track of are simple random, stratified random, clustered, longitudinal, and a census. Simple random samples are where every individual in a population has an equal chance of being selected, and this provides an unbiased representation of the entire population. Stratified random samples are where the population is split into groups and random samples are taken from each group, this ensures representation from subgroups in the population. Clustered samples are where you divide the population into groups, randomly select a couple, and then survey all groups in each cluster. This is efficient when its not practical to sample individuals directly. Longitudinal samples are where you survey the same individuals over a period, this is to track the change over time. Censuses are where you survey the entire population, this is inefficient but makes sure everyone is heard and gets rid of the chance that random samples have of being wrong. The tests you should remember are anova, two-way anova, chi- square, linear regression, Hypothesis tests, and the Z T and F tests. Both anova and two-way anova are used to analyze the differences in a group, but two way extends the test to study the influence of two categorical variables and asses the interactions between the two variables. Chi-square is used for categorical data and to determine if there is an association between two variables. Linear regression and the F test are used to model a line after data, but the F test finds out how good of a fit the line is. Hypothesis tests are procedures used to make inferences from sample data and help evaluate if observed differences are likely to impact the population. Z-tests and T-tests are both used in hypothesis testing, Z-tests are used when there is a large sample size and you know the population standard deviation, on the other hand, T-Tests are used with small sample sizes or when the population standard deviation is unknown. As a cybersecurity intern, the types of surveys and when to use them are particularly important when you will be checking on security risks that may happen. You need to understand when you can use simple random versus stratified random versus longitudinal, and which ones are the most technically and economically feasible. One major connection between this course and my goals of becoming a cybersecurity expert is being able to use statistical analysis for predictive modeling for threat detection and analyzing what threats people are most likely to fall for. Predictive modeling could help with anticipating potential security threats and being able to analyze if people are falling for different threats, phishing scams for example, to prioritize what to deal with. One significant challenge that I faced in the states was understanding how to use the statistical software (spss). I had problems with this because of all the different options available. This software
looks sort of old and has a lot of different options to be lost in. It took me forever to find out how to manipulate the software to perform the different types of tests that I needed and return the right information. I solved this by asking around for help, both my friends and the TAs in the class helped me understand what to do. This probably has no meaning but what I’m saying is make sure that we have learned that when we need help most of the time just asking will at least help with it. My advice to you is to make sure to know when and how to use surveys, make sure to communicate better, and get better with time management. I think that understanding surveys will legitimately help with your job. You already know that you do not communicate well, but you need to work on it, if you did then this class would have been much better. Time management has also been an issue, but it is one of the most important things that we both will probably still have to work on. Part 2 Jesse Dylan Elena Fraya Strength He worked hard, completed what ever was asked of him and helped others when needed. He stepped up to be the leader of the group, helped make sure everyone knew what they had to do and helped tie everything together She understood spss and how to use it very well, was very good at writing reports, and helped everyone whenever they needed help with it. She made sure that everyone was on schedule and helped complete whatever anyone had problems with Weakness He showed up late to meetings occasionally when we needed him. He could learn how to understand software, just like me it took him longer to understand spss than the others and he could grow on that. She could have been more communicative when she needed help She was a little too much of a perfectionist and while it helped us get good grades, it also impaired our decision making a little. Score 4/5 Amazing quality work, contribution, communication, but could have had better attendance 5/5 Amazing quality work, contribution, communication, and showed up on time and to every meeting 5/5 Amazing quality work, contribution, communication, and showed up on time and to every meeting 5/5 Amazing quality work, contribution, communication, and showed up on time and to every meeting
Part 3 Aspects of the data project that were effective. - This project helped us understand the different tests much better than just the labs. - It also taught us how to draw conclusions from all the data and tests that we did. - The timeline for all the due dates was very good, it gave us enough time to submit quality work while also keeping us from saving it all till the last minute. - The examples that were given for each were very helpful on how to format everything and what our checkpoints should look like. Opportunities for improvement - I wish that there was some application of surveying people ourselves. I understand that it saved time on both parts and gave more data than we would have gathered on our own, but for real world applications I think that learning how to survey ourselves is important. - I know this doesn’t have to do with the project but while the spss manual was helpful, there were some instances in the data project that I did not know how to query what I was looking for. We did just ask the TAs for help, but I feel like it would be more efficient to just go into more detail on the spss manual. Real world relevance - This project is going to help us be able to make inferences from data in the future. We will be able to make our own questions on what to find out, figure out what tests we might need, and conduct them ourselves. We also now can understand that just because a couple of studies are done about something doesn’t mean that they are right because of something like lurking variables, bias, or imbalance in representation of groups in a population. Overall rating - 4.5/5 o I have no complaints, it was a good project but I feel like incorporating surveying into it might be a nice touch.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help