ch2 - organizing and presenting data if you cant share your work with other people so they understand it, then what was the point - frequency distribution (table) ...how often does each data value appear ex) is this your first semester at BMCC? yes 12 no 10 frequency: total number of occurrences of a data value relative frequency: also: probability when written "out of 100", its a percentage note: relative freqencies add up to 1. sometimes, because of rounding, that might not happen on the chart presenting qualitative data (words) Bar Graph ex) what is your major? ex) # of people in poverty in Lower East Side can include frequency labels to be precise -Pie Chart ex) # of people with a particular major label each slice with: category; freq (or relative freq or percentage) note: if the textbook asks "how many degrees in a slice?" ignore that - if you need to be that precise, use software presenting quantitative data a frequency distribution (table) looks the same when each data value gets its own frequency when might it look different? ex) what is your age? 31 19 22 33 25 32 22 51 21 20 21 22 31 26 43 break data up into data classes (from low to high) (what class you choose is up to you) here, break it up into 10s, 20s, 30s 40s 50s ...this is typical for dealing with ages each age range or "data group" is called a data class the class width is the distance between classes ex) here its 10 20-10=10 or 30-20=10 note: the class width must stay the same (in general) the class limits (or class boundaries) are the highest and lowest values in the class (there are other technical definitions for this, we will use this simple one) ex) what are the lower and upper class limits of the third data class? how do you make a frequency distribution table with continuous data? ...adjust class limits according to your precision (number of decimal places) ex) if you record people's ages with 2 decimal places (20.47years, 21.65, etc) then write your class limits with 2 decimal places: 10-19.99, 20-29.99, 30-30.99 etc - histogram ..bar graph for quantitative data # math classes note: there is no gap between the bars (some software programs get this wrong) what if each class contains several values? ex) what about a graph for relative frequency? note: for all these presentations, anywhere you put frequency, you can instead put relative frequency (you get same shape, 1 replaced by 1/15 ) what if your data has a big gap? ex) 19,22,56,20,21,27,32,22,47,25,24,42, 105,107,115,117, * frequency polygon (line graph) histogram, but instead of bars, connect points with a line 10 20 30 40 50 60 * cumulative frequency graph (ogive: "oh-jive") 19,22,56,20,21,27,32,22,47,25,24,42 great for small data sets: it is visual (like a histogram) and it preserves all your data ex) 105, 107, 112, 114 we can make: bar graph using Excel / OpenOffice / StatDisk pie chart using Excel / OpenOffice / StatDisk histogram using StatDisk there are lots of creative ways to represent data visually. want to see a better visual representation we can all use? how about this NYC subway map: http://www.kickmap.com/comparison/ for one guy, that creativity is his full-time job: http://www.informationisbeautiful.net/ this fellow tells the story of the world with animated statistics: http://www.youtube.com/watch? v=jbkSRLYSojo 2.4 misleading representations - a graph which is correct, but gives the wrong impression > 3-dimensional pictures exaggerate the appearance of change ex) money pile * misleading graph ex) salary graph "look at the raises we give!" short y-axis range - dont start from zero exaggerates change we can make: bar graph using Excel / OpenOffice / StatDisk pie chart using Excel / OpenOffice / StatDisk histogram using StatDisk