Level 234 Level 236
36 words 0 ignored
Ready to learn Ready to review
Check the boxes below to ignore/unignore words, then click save at the bottom. Ignored words will never appear in any learning session.
The objects described by a set of data.
an alphabetic character representing a number, called the value, which is either arbitrary or not fully specified or unknown. It is usually a letter like x or y.
the entire group of items or individuals for which a sample is taken (the entire American _______________________, New jersey is a sample)
a randomly selected group chosen for the purpose of collecting data. ie 3 middle schools in Monmouth county to determine information regarding typical Middle School student in Monmouth County
A variable that names categories (words/numbers)
A variable in which the numbers act as numerical values - always have units.
A quantitative variable that can take any real numerical value over an interval.
A quantitative variable that can only take a limited, finite number of values, ex. the number of petals on a flower.
Qualitative and unordered variables, ex. flower color.
Data that can be ranked, like star ratings. Although they can be ranked, they are not true quantitative variables because the intervals between consecutive ranks are often not identical.
A way of recording data; usually in a table in which each row is an individual and each column is a variable.
Exploratory Data Analysis
An examination of data in order to describe their main features. Starting by examining the variables, their relationships, and creating a graph are usually a good idea.
The _____________________________ of a var. gives: possible values of the variance; the relative frequency of each value.
Distribution of a Categorical Variable
This lists the categories and gives either the count or the persent of individuals that fall into each category.
how many time you get it
the ratio of the frequency of a category to the total frequency
Errors in rounding up or down that lead to inconsistent data, ex. not totalling to 100%.
What graphs are appropriate for categorical data?
Bars do not touch; categorical data is typically on the horizontal axis; to describe: comment on which occurred the most often or least often
Graphical representation of data in the form of a circle containing wedges.
Bar graph for qualitative data, with the bars arranged in order according to frequencies.
A bar graph depicting a frequency distribution. The height of the bars indicates the frequency of a group of scores.
Creating a Histogram
Bin sizes are important; rule of thumb says start with 5-10 bins and refine accordingly.
The overall pattern of a histogram can be described in terms of its shape, center, and spread.
Can be unimodel (single-peaked) or bimodel (double-peaked).
Can be symmetric, left-skewed (left side extends out more), or right-skewed (right side extends out more). Skewed only applies to unimodel graphs.
The range of values; often altered by deviations such as outliers.
a data value that is either much greater or much less than the median
A stemplot with two sets of leaves, one on the right, one on the left.
Consists of a graph in which each data value is plotted as a point (or dot) along a scale of values. Dots representing equal values are stacked.
Plots each observation against the time at which it was measured. Time is always on the horizontal scale, and the variable on the vertical scale. Time plots can reveal trends, cycles, and other patterns.
Time Series Data
Data collected over time.
the sum of all the values divided by the number of values
y tends to decrease, as x increases
y tends to increase, as x increases