Quantitative Methods- Assessment
POPULATION AND SAMPLE
A
population is an entire group or complete set of individuals or
elements about which some specific information is required or in other words
something you want to draw conclusions about.
So
far researchers are interested in answering general questions of interest such
as:
· Architects
are interested in understanding the design preferences of all homeowners in a
specific urban area.
· Computer
scientists are interested in exploring the efficiency of all algorithms used in
a specific category of software applications.
· Economists
are interested in studying the spending habits of all households in a
particular country.
· Environmental
scientists are interested in monitoring the pollution levels of all water
bodies within a designated region.
· Psychologists
are looking for general rules about all people (how do people learn a
language?)
· Statisticians
are interested in analyzing the salary distribution of all employed
statisticians in the field of academia.
Whatever
it is you want to make generalizations about, you have to make a large
collection of data either individuals or objects that you are interested in.
This is known as population. Reaching the whole population might be cumbersome
due to time or resources, so researchers often make use of samples drawn
from the population to make a conclusion about the larger population. This is
known as a sample.
A
sample is a subset of a population in which true inferences
about the population can be made. Note that a sample can be used to make a good
guess about the population. Therefore, the larger the sample size the better
our confidence level.
There
are 2 types of sampling methods
1. Probability
sampling- This is a sapling technique where every member of the
population and a known and equal chance of being selected for a sample.
Types of Probability Sampling Techniques
· Random
Sampling
· Stratified
Sampling
· Systematic
Sampling
· Cluster Sampling
2. Non-probability
sampling- This is a sapling technique where not every member of
the population has a known and equal chance of being selected for a sample.
Types of non-probability sampling technique
· Convenient
Sampling
· Purposive
Sampling
· Snowball
Sampling
· Quota Sampling
All
statistical techniques are divided into two broad categories: descriptive and
inferential statistics.
https://www.blogger.com/blog/post/edit/1781268278613007246/5860578026225684807
Experimental
Design
Experimental
designs represent systematic research methods conducted objectively and under
controlled conditions. These approaches involve manipulating one or more
independent variables to examine their impact on a dependent variable. By
carefully controlling variables and ensuring objectivity, experimental designs
aim to maximize precision, facilitating the drawing of conclusions regarding
the stated hypotheses.
Questionnaire
Design
Designing
an effective questionnaire means creating reliable, meaningful, and valid questions in a research study capable of achieving the research objectives.
An important element in writing good questions
·
Question clarity and short- Keep
language simple if explaining anything. Ensure jargon, ambiguity, and complex
terms that are capable of confusing the respondents are avoided.
·
Avoid a leading question- Ensure
questions are not leading the respondents to a particular answer.
·
Question structure- Ensure
questions are organized logically. Non-threatened and simple questions can be
begun with, this is capable of building the respondents' confidence before
bringing in more sensitive and complex topics.
·
Avoid bias questions- The
questions should be neutral and unbiased. Objectivity must be maintained
throughout the questions.
·
Avoid lengthy questions- The questionnaire must maintain a reasonable length to ensure respondent engagement. Too lengthy
questions can lead to fatigue or receiving inadequate responses from the
respondents.
·
Avoid phrasing in the negative- Negative
phrasing can lead to confusion and increase the likelihood of respondents
misinterpreting the intended meaning.
DESCRIPTIVE
STATISTICS
Descriptive
statistics are informational and help to summarise and describe the actual
characteristics of the data set. Descriptive statistics provides an initial
understanding of the data distribution.
Descriptive statistics has three (3) basic categories.
1. Measure
of central tendency (mean, mode, median)
2. Measure
of variability i.e. spread of the data set (variance, standard deviation)
3. Measures
of frequency distribution, count of occurrence of each value (count)
To obtain descriptive statistics on your variables, the first step is to
gather all relevant background information before conducting any statistical
analyses. These descriptive statistics include the range, mean, standard
deviation, skewness, and kurtosis.
https://www.blogger.com/blog/post/edit/1781268278613007246/4839363183018201007
Let's
interpret each statistic in the table one by one:
1. N
(Sample Size):
· The
number of observations in the dataset for the variable "age" is 439.
2. Range:
· The
difference between maximum and minimum values is referred to as range.
· The
range of “age” is 80 (82 - 2).
3. Minimum:
· In
the dataset, the lowest value for the variable "age" is 2.
4. Maximum:
· The
largest value in the dataset for the variable "age" is 82.
5. Mean
(Average):
· The
average age in the dataset is 37.39.
6. Standard
Deviation:
· Standard
deviation measures the amount of dispersion in the dataset.
· For
the variable "age," the standard deviation is 13.293.
7. Variance:
· The
square of the standard deviation.
· For
the variable "age," the variance is 176.695.
8. Skewness:
· Skewness
measures the asymmetry of the distribution.
· When
the distribution is skewed to the right, it is referred to as a positive
skewness (0.564).
9. Kurtosis:
· A
measure of the "tailedness" of the distribution.
· A
positive kurtosis (0.117) indicates that the distribution has heavier tails
than a normal distribution.
10. Std.
Error of Mean:
· Standard
error related to the mean.
· The
standard error of mean for variable "age," is 0.634.
11. Std.
Error of Skewness:
· Standard
error associated with skewness.
· For
the variable "age," the standard error of skewness is 0.157.
The
second step is to check whether any assumptions of the individual tests are not
violated. To ensure data meets the required requirement, an assumption check is
important for any statistical procedures.
- Normality Assumption:
- Visualizing the shape of our
distribution kurtosis and skewness provides insight, and helps in assessing
the normality assumption.
- Homogeneity of Variance:
- The homogeneity of variance assumption
is important. The variance of the residual should be the same between the
independent variable and the predicted scores.
- Linearity:
- This is an important assumption in
regression analysis. This can be achieved by scatterplots. It is used in
assessing the linearity between variables. There should be a
straight-line relationship between the residual and the predicted
dependent variable score.
- Independence:
- Descriptive statistics give an
insight into the assumption of independence between our predictor
variables. In statistical tests, the assumption of independence should not
be violated.
- Outlier Detection:
- Descriptive statistics, such as the
identification of extreme values (outliers), are important for assessing
the impact of outliers on statistical analyses.
- Sample Size:
- The sample size is important for
determining whether the sample size is adequate for the statistical
method being applied.
Understanding the relationship between Sample Mean and Population Mean
When
samples are taken repeatedly, we do not get the same mean value every time,
there will always be a discrepancy between the true and the estimated
population mean, (i.e. ) because of sampling variation.
But
in most cases, the population mean (μ, ‘mu’) is unknown
INFERENTIAL STATISTICS
Inferential
statistics involves drawing inferences on the broader population based on
inferences made on the sample. For an estimation to be meaningful, we need to
understand our measuring scale. To measure is to know https://quotestats.com/author/lord-kelvin-quotes/
. This is simply a way to categorize and understand the nature of the data we are
dealing with. The four primary types of measurement scales are:
1. Nominal:
Nominal scales are the most basic of measurement where variables are named or
labeled in no specific order. The categories have no inherent order. For
example, Gender (Male or Female) and colors (Red, Yellow, Green, Blue).
2. Ordinal:
The ordinal scale has all its variables in a specific order or rank. However,
the interval between the categories is not uniform or meaningful. The
categories have a relative order indicating which is higher or lower and the
differences between the categories are not measurable nor consistent. For
example, educational levels (PhD, Master’s, Bachelor’s), social class (Upper,
Middle, Lover), and customer satisfaction (Excellent, Good, Fair, Poor).
3. Interval:
Interval scales have labels, orders, as well as uniform intervals between them,
but they lack zero true points. The intervals between them are consistent and
measurable.
4. Ratio
Scale: Ratio scales have the same features as interval scales
except that they have a true zero point.
It is important to understand the level of measurement and data types. If you cannot measure it, you cannot improve it https://quotestats.com/author/lord-kelvin-quotes/.
For
Summary statistics: To measure average and measure of spread
1. Continuous
2. Categorical
To select appropriate
statistical methods: Dependent Variable
1. Continuous
2. Categorical
Demonstrate some Quantitative Analysis using SPSS
In this case, I will analyze
this dataset by making use of binary logistic regression in evaluating the
impact of all the independent variables (age, highest education level, weight,
height, general health, alcohol drinks per day, caffeine drinks per day,
physical fitness, current weight, hours sleep/weekends, how many hours sleep
needed) on the dependent variable (problem with sleep), because my outcomes are
binary/dichotomous, and binary logistics only assumes two possible outcomes i.e
YES or NO. Binary logistic regression tries to predict whether someone has a problem with sleep (score = 1) or someone does not have a problem with sleep (score
= 0).
Some assumptions need to be met in logistic regression analysis.
·
In logistic regression a linear relationship
between dependent and independent variables is not required.
·
Homoscedasticity of variance, linearity, and interval of the independent variable are not required.
·
The residuals (error term) need not be
normally distributed.
·
The binary logistic regression typically
requires little or no multicollinearity among independent variables.
· Larger samples are required for logistic regression.
The
whole Demonstration of quantitative Analysis using SPPS can be found in my blog
https://twinniegrace.blogspot.com/2024/01/demonstrating-quantitative-analysis.html
Comments
Post a Comment