![Research Methods for the Behavioral Sciences (MindTap Course List)](https://www.bartleby.com/isbn_cover_images/9781337613316/9781337613316_largeCoverImage.gif)
Concept explainers
In addition to the key words, you should also be able to define each of the following terms:
frequency distribution
histogram
bar graph
degrees of freedom, df
line graph
Pearson
Spearman correlation
regression equation
slope constant
Y- intercept
multiple-regression equation
null hypothesis
effect size
Cohen’s d
percentage of variance accounted for r2 or n2
ratio scale
interval scale
ordinal scale
nominal scale
chi-square test for independence
independent-measures t-test
repeated-measures t-test
single-factor analysis of variance or one-way ANOVA
two-factor analysis of variance or two-way ANOVA
split-half reliability
Spearman-Brown formula
Kuder-Richardson formula 20, or K-R 20
Cronbach’s alpha
Cohen’s kappa
![Check Mark](/static/check-mark.png)
To define:
The terms given in the question
Explanation of Solution
Frequency distribution:
it is a table where all the data points are mentioned as x and their corresponding frequencies are also provided. It actually summarizes the complete dataset and presents it is a more easily understandable and various statistical operations can be performed on this data.
Histogram:
It is a chart generally used for continuous datasets. These datasets are converted to class intervals, and then the number of data points in each interval is termed as its frequency. In the histogram the class intervals come on the horizontal axis, and the frequencies are labeled on the vertical axis. Each class interval is represented by a bar on the graph and the height of the bar represents the frequency.
Polygon:
Once the histogram is formed, the midpoints of all the bars are connected by straight line. It results in a frequency polygon. A frequency polygon is a very useful tool to depict about the distribution of the data set. If the polygon is more of an inverted U shape this implies that the data is normally distributed.
Bar graph:
This chart is specifically used for representing the categorical data set. Now suppose we got the information about boys and girls. There are 57 boys and 43 girls. So each category is presented on the x axis, and its frequency is presented on the Y axis. Like this bars on the graph are formed for each category. These bars are formed at a gap from each other.
Degree of freedom:
this term is used in Statistics to define the number of independent values that one is free to assign to a distribution. For example if the independent variable is temperature. This implies you are free to choose any value of temperature for the experiment. This number of values that can be selected here is called degree of freedom.
Line graph:
This is chart tool is used to represent the data that is generally dependent on time. In this only a line is displayed for a particular variable on the graph. The x axis represents the time and the Y axis represents the variable under study. Generally stock prices are studies with help of line chart.
Scatter plot:
This chart is used when there are two variables under the study x and y. The values of y and x are plotted on the chart, with x value on the horizontal line and the y value on the vertical line. Generally x variable is the one that is independent and Y variable denotes the dependent variable. The pattern of the scatter plot tells us about the relation of the two variables.
Pearson correlation:
it is a statistical tool to find out the magnitude and the direction of relation between two variables. It helps us find out if there exists any linear relation between the variables under study.
Spearman correlation:
it is a statistical tool to find out the magnitude and the direction of relation between two ranked variables. It helps to find out if there exists any relation between the two categorical variables under study.
Regression equation:
An equation that defines the relation of the dependent and independent variable is called regression equation. It consists of two constant values (a and b) they are called regression parameters.
Slope constant:
The regression parameter b that multiplies to the independent variable in the regression equation is called the slope constant. It lets us know how the dependent variable changes with one unit change in the independent variable.
Y-intercept:
This is the value of regression parameter that is a constant value denoted by a. This value tells us what is the minimum value of the dependent variable when the independent variable has a value equal to Zero.
Multiple regression equation:
When there are more independent variables that influence one dependent variable we get an equation that relates all the independent variables to that dependent variable. This is called multiple regression equation.
Null hypothesis:
In the process of hypothesis testing the main aim of the researcher is to test the claim he has about the research variables. When the hypothesis testing process is started, it starts with an assumption totally opposite to the claim of the researcher. This assumption with which the hypothesis testing process is started is called Null Hypothesis.
Effect size:
it is a method that simply tells us about difference between any two groups. It actually quantifies that difference in a standard manner. It is an effective statistical tool because it is not influenced by sample size.
Cohen's d:
this is a special method of calculating effect size. This method is employed to know the difference between two mean values. When specifically the difference between two mean values is to be found Cohen's d is the ideal measure to provide the effect size.
Percentage of variance accounted for:
In the correlation analysis, the output provides with a value R square. This value of R square is called coefficient of determination. It lets us know the percent of the variation explained by the regression model. Thus if we get a R square value of say 40%, this would imply that the regression model is able to explain on 40% of the variation in the response variable. Higher the value of R square the better the regression model is considered to be.
Ratio scale:
in this scale the variables are quantitative in nature, with fixed zero, which makes comparison possible and it helps understand the variables in terms of ratio and proportion as well.
Interval scale:
In this scale the variables are quantitative in nature. It helps in comparing the two values. But they cannot be expressed as ratios or there is no fixed zero for these variables.
Ordinal scale:
in this scale the variables are non-quantitative, but they are comparable. Thus it is easier to arrange the variables in a specific order or rank them, thus making comparison possible.
Nominal scale:
in this scale the variables are non-quantitative. They are either labeled or numbered just to create different categories. These categories can neither be compared nor ranked. They just identify different groups.
Chi-square test of independence:
The main purpose of this test is to verify if there exists any association between the dependent and the independent variable. Purpose-wise it is similar to the correlation test, the only major difference is that for this test both the variables must be categorical in nature. For example: If the research question is: does the maturity level of students get influenced by their gender. In this case gender is the independent variable, and it is categorical in nature (male and female). Maturity level is the dependent variable, and again it is categorical in nature.
Independent measures T test:
This test is specifically applied only when there are two independent samples. The aim of this test is to compare the means of the two groups and decide upon if the two groups have different mean values or not.
Repeated measures T test:
In a 2 sample T test, at times there is a situation when the same group of sampling units are measured twice For example: if a set of people are sample before the training and then they are measured after the training to note the effect of the training, in this scenario the same sample is measured twice. In this case a Repeated measures T test is used.
One way ANOVA:
There are multiple groups, the aim of this test it to compare the mean value of all the groups. This test applies when there are 2 or more groups and all the groups relate to same variable. In this case the independent variable is categorical and dependent variable is continuous. Example: Suppose there are different colleges, and we compare the performance of students from each college. Thus the independent variable here is "college" it is categorical in nature. The dependent variable is "performance scores". It is continuous in nature.
Two way ANOVA:
When the multiple groups are compared on the basis of two different factors, then the Two Way Anova test is used. Again in this scenario the two independent variables and one dependent variable all are continuous in nature. For example: we can consider a scenario when the dependent variable is salary, and the independent variables can be age of the employee and experience in form of number of years. All these factors are continuous in nature, and since there are two independent factors, this calls for a Two Way Anova test.
Split half reliability:
in this method the sample is split in two equal parts to test the same knowledge. This helps us get two sets of items. It is expected that the statistical values from both the sets comes out to be same. This ensures that there is reliability in the measurement system. This method of testing the reliability of measurement system is called split half reliability.
Spearman Brown formula:
it is a specific formula related to psychometric reliability. This formula is used to tell how reliable a test is after changing the test length. It helps us understand how the relation between test length and its reliability is not a linear one.
Kuder-Richardson formula 20:
this formula applies only to the situations that have only two choices or are dichotomous choices. This formula helps check that the measurement are in tune and almost similar every time. It lets us know about the consistency of the measurement.
Cronbach's alpha:
Again, this method is designed to let us know if the measurements are consistent or not. It is a coefficient of consistency or as we consider it as reliability. It tells us if the measurements can be relied upon based on how the measurements are obtained for similar situations. If there is consensus in the measurements this implies there is good consistency, else we cannot rely on the measurements obtained.
Cohen' kappa:
It is a tool that helps the research measure the inter-rater agreement. It relates to qualitative dataset only. This measure is considered to be more powerful because it also considers the likelihood of agreement of the raters just by chance. It is a simpler tool to understand conceptually. This measure deals with only two raters. The problem that Cohen's Kappa addresses is the issue of "inter-rater reliability"
Want to see more full solutions like this?
Chapter 15 Solutions
Research Methods for the Behavioral Sciences (MindTap Course List)
- Suppose a random sample of 459 married couples found that 307 had two or more personality preferences in common. In another random sample of 471 married couples, it was found that only 31 had no preferences in common. Let p1 be the population proportion of all married couples who have two or more personality preferences in common. Let p2 be the population proportion of all married couples who have no personality preferences in common. Find a95% confidence interval for . Round your answer to three decimal places.arrow_forwardA history teacher interviewed a random sample of 80 students about their preferences in learning activities outside of school and whether they are considering watching a historical movie at the cinema. 69 answered that they would like to go to the cinema. Let p represent the proportion of students who want to watch a historical movie. Determine the maximal margin of error. Use α = 0.05. Round your answer to three decimal places. arrow_forwardA random sample of medical files is used to estimate the proportion p of all people who have blood type B. If you have no preliminary estimate for p, how many medical files should you include in a random sample in order to be 99% sure that the point estimate will be within a distance of 0.07 from p? Round your answer to the next higher whole number.arrow_forward
- A clinical study is designed to assess the average length of hospital stay of patients who underwent surgery. A preliminary study of a random sample of 70 surgery patients’ records showed that the standard deviation of the lengths of stay of all surgery patients is 7.5 days. How large should a sample to estimate the desired mean to within 1 day at 95% confidence? Round your answer to the whole number.arrow_forwardA clinical study is designed to assess the average length of hospital stay of patients who underwent surgery. A preliminary study of a random sample of 70 surgery patients’ records showed that the standard deviation of the lengths of stay of all surgery patients is 7.5 days. How large should a sample to estimate the desired mean to within 1 day at 95% confidence? Round your answer to the whole number.arrow_forwardIn the experiment a sample of subjects is drawn of people who have an elbow surgery. Each of the people included in the sample was interviewed about their health status and measurements were taken before and after surgery. Are the measurements before and after the operation independent or dependent samples?arrow_forward
- iid 1. The CLT provides an approximate sampling distribution for the arithmetic average Ỹ of a random sample Y₁, . . ., Yn f(y). The parameters of the approximate sampling distribution depend on the mean and variance of the underlying random variables (i.e., the population mean and variance). The approximation can be written to emphasize this, using the expec- tation and variance of one of the random variables in the sample instead of the parameters μ, 02: YNEY, · (1 (EY,, varyi n For the following population distributions f, write the approximate distribution of the sample mean. (a) Exponential with rate ẞ: f(y) = ß exp{−ßy} 1 (b) Chi-square with degrees of freedom: f(y) = ( 4 ) 2 y = exp { — ½/ } г( (c) Poisson with rate λ: P(Y = y) = exp(-\} > y! y²arrow_forward2. Let Y₁,……., Y be a random sample with common mean μ and common variance σ². Use the CLT to write an expression approximating the CDF P(Ỹ ≤ x) in terms of µ, σ² and n, and the standard normal CDF Fz(·).arrow_forwardmatharrow_forward
- Compute the median of the following data. 32, 41, 36, 42, 29, 30, 40, 22, 25, 37arrow_forwardTask Description: Read the following case study and answer the questions that follow. Ella is a 9-year-old third-grade student in an inclusive classroom. She has been diagnosed with Emotional and Behavioural Disorder (EBD). She has been struggling academically and socially due to challenges related to self-regulation, impulsivity, and emotional outbursts. Ella's behaviour includes frequent tantrums, defiance toward authority figures, and difficulty forming positive relationships with peers. Despite her challenges, Ella shows an interest in art and creative activities and demonstrates strong verbal skills when calm. Describe 2 strategies that could be implemented that could help Ella regulate her emotions in class (4 marks) Explain 2 strategies that could improve Ella’s social skills (4 marks) Identify 2 accommodations that could be implemented to support Ella academic progress and provide a rationale for your recommendation.(6 marks) Provide a detailed explanation of 2 ways…arrow_forwardQuestion 2: When John started his first job, his first end-of-year salary was $82,500. In the following years, he received salary raises as shown in the following table. Fill the Table: Fill the following table showing his end-of-year salary for each year. I have already provided the end-of-year salaries for the first three years. Calculate the end-of-year salaries for the remaining years using Excel. (If you Excel answer for the top 3 cells is not the same as the one in the following table, your formula / approach is incorrect) (2 points) Geometric Mean of Salary Raises: Calculate the geometric mean of the salary raises using the percentage figures provided in the second column named “% Raise”. (The geometric mean for this calculation should be nearly identical to the arithmetic mean. If your answer deviates significantly from the mean, it's likely incorrect. 2 points) Starting salary % Raise Raise Salary after raise 75000 10% 7500 82500 82500 4% 3300…arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtCollege AlgebraAlgebraISBN:9781305115545Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage Learning
- Holt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALAlgebra and Trigonometry (MindTap Course List)AlgebraISBN:9781305071742Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage Learning
![Text book image](https://www.bartleby.com/isbn_cover_images/9780079039897/9780079039897_smallCoverImage.jpg)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781680331141/9781680331141_smallCoverImage.jpg)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781305115545/9781305115545_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9780547587776/9780547587776_smallCoverImage.jpg)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781305071742/9781305071742_smallCoverImage.gif)