Stat 3rd test

Dependent variable

goes on the y-axis, variable being estimated, variable that is being predicted, and response variable in regression analysis.

Independent variable

goes on the x-axis, variable being used to estimate, and used to predict dependent variable, can be manipulated by the investigator, explanatory, and predictor.

Least squares

minimizes the term E(Y-Y^) to find a and b.

Slope

amount Y' increases when X increases by 1, represents the change in Y per unit change in X.

Intercept

value when X=0.

Standard Error of estimate

measure of the variation around the regression line, is zero if all the points on the scatter diagram lie on the regression line.

Coefficient of Determination

percent of explained variation, usually labeled r^2, determined by SSR/SS total, tells us the proportion of the total variation that is explained, and ranges from 0 to +1.00, is 1 if all the points lie on the least squares regression line.

Correlation Coefficient

may range from -1.00 to +1.00, interval measurements are required to compute, is zero if there is no correlation between two variables.

Scatter Diagram

graphic tool to show the relationship between X and Y.

Confidence interval

shows the range of values of Y^.

Regression equation

used to estimate the value of the dependent variable based on the independent variable.

Regression analysis

describes how one dependent variable and one or more independent variables are related, used for the purpose of prediction.

Least square method

provides the best approximation for the relationship between the independent and dependent variables, used to arrive at the "best fitting" straight line in regression analysis.

Regression model

describes how the dependent and independent variables are related.

Correlation analysis

determines the strength of the relationship between the dependent and independent variables.

Total sum of squares

E(Yi-Ybar)^2.

Unexplained sum of squares

E(Yi-Y^)^2.

Explained sum of squares

E(Y^-Ybar)^2.

Regression surface

relationship among several variables.

Regression where b=0 means

regression equation is parallel to the X-axis.

Regression where a=0 means

regression equation is parallel to the Y-axis.

The ________ and the __________ always have the same sign

slope of the regression line, and the coefficient of correlation, (b and r).

A coefficient of correlation was computed to be -.70, this means

that as the X value increases, the Y value decreases.

Coefficient of Correlation

equal to the coefficient of determination but it can be negative or positive, has the same sign as the slope in simple linear regression, measures the strength of the linear relationship between two variables, and is the square root of the coefficient of

Testing for the existence of correlation is qual to

testing for the existence of the slope (b1).

The width of the confidence interval estimate for the predicted value of Y is dependent on:

the sample size, the value of X for which the prediction is being made, and the standard error of the estimate.

Total variation

Explained+Unexplained variation.

The Y intercept

represents the predicted value of Y when X=0.

If correlation coefficient (r)=1.00 then,

there is no unexplained variation.

Multiple regression analysis

there can be several independent variables but only one dependent variable.

Interpretation of the standard error of the estimate is equal to,

the interpretation of the standard deviation.

Direct linear relationship

if B is positive, and if X increases and Y increases.

Inverse linear relationship

if B is negative, and if X increases and Y decreases.

No linear relationship

if B=0, and if X increases and Y can increase or decrease.

Given: y^=a+bx
a is?
b is?

the y intercept and the slope of the regression line.

Simple linear regression

if the coefficient of correlation is zero, the the regression line can go through the origin but it doesn't have to.

Correlation means

the degree of association between two variables.

Regression analysis involving two quantitative variables, we assume

that the variation around the regression line are the same for each X.

Larger values of r^2 mean that

observations are more closely grouped about the least square line.

Given: Y^=7-.5X, Coefficient of Determination=.81
At what point does the equation cross the y-axis?
What is the Coefficient of Correlation?
Increasing the independent variable by 1 will result in what amount of increase or decrease to the dependent variab

7
-.9
decrease of .5