1 / 39

Data Variables & Units

Data Variables & Units. Statistics. (The field of) Statistics is the systematic study of data. The word “data” is plural… “The data are the price gains of 200 stocks on the NYSE.” Singular? “Datum.” (Uncommon.) Shares of Exxon-Mobil gained 2.3%. The datum is 2.3%.

martha
Download Presentation

Data Variables & Units

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. DataVariables & Units

  2. Statistics (The field of) Statistics is the systematic study of data. The word “data” is plural… “The data are the price gains of 200 stocks on the NYSE.” Singular? “Datum.” (Uncommon.) Shares of Exxon-Mobil gained 2.3%. The datum is 2.3%. What characterizes data is variability.

  3. Variables / Statistical Units Units of observation: Set of entities (things / objects) being studied Variable: An attribute of each unit Suppose X describes a variable and U describes the units. “X varies among the (statistical) units.”

  4. Units: Math 158-800 students. Variable: Gender. Gender is a Categorical Variable Gender varies among Math 158-800 students.

  5. Units: Math 158-800 students. Variable: Number of FB friends. Number of FB friends is a Quantitative Variable Number of FB friends varies among Math 158-800 students.

  6. 1. An experiment was conducted to test the performance of four brands of batteries in three different environments (room temperature; hot and humid; cold). For each combination of brand and environment, batteries were put into a flashlight. The flashlight was then turned on and allowed to run until the light went out. The amount of time until the flashlight stopped shining (in minutes) was recorded. Do brand and environment play a role in the lifetime of these batteries? Minutes are measurement units. Most quantitative variables have a measurement unit. If I want the measurement unit, I’ll say exactly that. By “unit” I mean “unit of observation” = thing / object that is studied.

  7. 2. 55 year old men are recruited into a study about heart attacks. The heart rate of each man is recorded. Each is tracked for a one-year period, and whether or not he has a heart attack is determined.

  8. 3. A student runs an experiment to study the effect of tire pressure on gas mileage. He devises a system so that his car uses gasoline from a one-liter container. Each time the container is filled, he randomly selects a tire pressure between 20 and 35 psi, then drives the car at 60 mph on a divided highway. When he runs out of gas, he records the distance driven on that fill. Does tire pressure impact the distance driven? Something like “drives” would also suffice for the units.

  9. Variables / Statistical Units The units are the countries (of the world). Describe a variable. Write the sentence _________ varies among __________. Is the variable quantitative or categorical?

  10. Variables / Statistical Units The units are the countries (of the world). Describe a variable. Write the sentence _________ varies among countries. Is the variable quantitative or categorical?

  11. GDP per capita and Longevity

  12. GDP per capita and Longevity NOT a unit

  13. Types of variables Quantitative Variable Naturally measured as numbers for which ordering and at least some of the usual operations (addition, multiplication, subtraction, etc.) make sense. Discrete All the possible values are easily listed Frequent “ties” Often count or related to counts Continuous Technically: “ties” are impossible In practice ties are uncommon

  14. Types of variables Categorical Variable Not quantitative (usually verbal, but sometimes expressed as numbers having little or no number meaning). Virtually all categorical variables are discrete. So, the term discrete is rarely used in speaking about categorical variables – it is redundant.

  15. Distribution A variable’s distribution is a description of what values it takes and how often it takes them. Categorical Variables Distributions are always summarized in terms of percents (falling into each category). Quantitative Variables There are many ways to summarize quantitative variables. Among them: Mean + Standard Deviation Median + Interquartile Range

  16. Purposes of variables Explanatory and Response Variable Changing the value of the explanatory variable (EV) results in a change in the distribution of the response variable (RV). Loosely: A change in the explanatory variable alters the prediction of the response variable.

  17. Variable: Form of study. Units: The (200) college students involved in the experiment. Form of study varies from student to student.

  18. Variable: Score on the short answer test. Units: The (200) college students involved in the experiment. Score on the short answer test varies from student to student.

  19. Experimental study The explanatory variable is assigned (often by the people conducting the study). Units do not enter the study with a value for this variable. Observational study The explanatory variable is a characteristic of the unit.

  20. Statistics Data vary A population is a collection of all the units of interest. If we have information on all the units of a population we have a complete description of the variation in the data. Such a description of a population is a census. Characteristics of populations are parameters. A sample is an incomplete collection of units from the population. A sample necessarily provides incomplete information. Characteristics of samples are called (the word) statistics.

  21. In Class Survey Your gender (M or F) Guess your instructor’s age Which finger is longer? I = Index R = Ring S = same How many people are there who have the same mother and father as you do? What company is your cell phone carrier? How long was the last call you received on your phone? G: A: F: S: C: D:

  22. In a data table each unit takes a row; each variable occupies a column. Column headers identify variable names. There are other ways to organize data, and some are preferable when the idea is to display the data efficiently. However, in most cases, a data table is how data are organized in a spreadsheet.

  23. Here are the monthly fees (in $) paid by a random sample of 50 users of internet service providers in 2008: 42 31 33 34 65 47 37 38 32 40 32 36 31 42 32 32 72 42 45 37 41 46 39 38 34 31 41 51 42 37 32 42 31 43 40 32 37 34 44 41 36 34 45 45 42 35 39 83 30 39 VARIABLE: ____________ UNITS: ____________

  24. Here are the monthly fees (in $) paid by a random sample of 50 users of internet service providers in 2008: 42 31 33 34 65 47 37 38 32 40 32 36 31 42 32 32 72 42 45 37 41 46 39 38 34 31 41 51 42 37 32 42 31 43 40 32 37 34 44 41 36 34 45 45 42 35 39 83 30 39 VARIABLE: Monthly fee (for use of internet) UNITS: Users of internet service

  25. You can start almost any problem in this course by first asking: What are the units? What is the variable? *Perhaps identified by name? (Names aren’t given here.) Often, unit identifiers will not be given or displayed.

  26. Variables / Statistical Units The units are the companies listed on the New York Stock Exchange. Describe a variable. Write the sentence variable _________ varies from company to company. Is the variable quantitative or categorical? If quantitative, is it discrete or continuous?

  27. GDP per capita and Longevity Parameters Not statistics NOT a unit

  28. Distribution The distribution of a variable tells us what values it takes and the likelihood of those values. What the fees are. How often those fees occur.

  29. VARIABLES (there are 5) UNITS X (Variable) varies from unit to unit.

  30. VARIABLES (there are 5) UNITS City MPG varies from car model to car model.

  31. VARIABLES (there are 5) UNITS Number of cylinders varies from car model to car model.

  32. VARIABLES (there are 5) UNITS Transmission type varies from car model to car model.

  33. VARIABLES (there are 5) UNITS Transmission type CATEGORICAL VARIABLE

  34. VARIABLES (there are 5) UNITS City MPG QUANTITATIVE VARIABLE

  35. VARIABLES (there are 4) UNITS

More Related