130 likes | 330 Views
Scatter plots. Mini lessons. Clustering. What is clustering? Finding patterns in data, or grouping similar groups of data-points together into clusters. Clustering. Example. http://glowingpython.blogspot.com/2012/04/k-means-clustering-with-scipy.html. OUTLIERS.
E N D
Scatter plots Mini lessons
Clustering • What is clustering? • Finding patterns in data, or grouping similar groups of • data-points together into • clusters.
Clustering • Example http://glowingpython.blogspot.com/2012/04/k-means-clustering-with-scipy.html
OUTLIERS • What are outliers in data? • An outlier is an observation that lies an abnormal distance from other values in a random sample from a population. In a sense, this definition leaves it up to the analyst
…OUTLIERS continued • (or a consensus process) to decide what will be considered abnormal. Before abnormal observations can be singled out, it is necessary to characterize normal observations.
OUTLIERS • Example • / http://www.theanalysisfactor.com/outliers-to-drop-or-not-to-drop
Positive • If the data displayed on the graph resembles a line rising from left to right the slope of the line is positive. There is a positive correlation between the two sets of data.
Positive example http://www4.uwsp.edu/psych/stat/7/correlat.htm
Negative • If the data set resembles a line falling from left to right, the slope of the line is negative. There is a negative correlation would exist.
Negative Example http://www.psychologynoteshq.com/correlation/
Linear association • A statistical term used to describe the relationship between a variable and a constant. This Constant is a proportion or slope. Linear relationships can be expressed in a graphical format where the variable and the constant are connected via a straight line or Y = 2x + 5
Linear association • in a mathematical format where the independent variable is multiplied by the slope coefficient, added by a constant, which determines the dependent variable. Slope Coefficient Y = 2x + 5 Independent Variable Dependent Variable
Nonlinear association • When graphed points do not form a straight line (linear) is called a Nonlinearassociation. The points could form a bell curve for example. http://gilgamesh.cheme.cmu.edu/doc/software/jacapo/9-numerics/9.4-curve-fitting/9.4.0-curve-fitting.html