You decided to join a fantasy Baseball league and you think the best way to pick your players is to look at their Batting Averages.
You want to use data from the previous season to help predict Batting Averages to know which players to pick for the upcoming season. You want to use Runs Score, Doubles, Triples, Home Runs and Strike Outs to determine if there is a significant linear relationship for Batting Averages.
You collect data to, to help estimate Batting Average, to see which players you should choose. You collect data on 45 players to help make your decision.
x1 = Runs Score/Times at Bat
x2 = Doubles/Times at Bat
x3 = Triples/Times at Bat
x4 = Home Runs/Times at Bat
x5= Strike Outs/Times at Bat
Is there a significant linear relationship between these 5 variables and the Batting Average?
If so, what is/are the significant predictor(s) for determining the Batting Average?
You are thinking about opening up a Starbucks in your area but what to know if it is a good investment. How much money do Starbucks actually make in a year? You collect data to, to help estimate Annual Net Sales, in thousands, of dollars to know how much money you will be making.
You collect data on 27 stores to help make your decision.
x1 = Rent in Thousand per month
x2 = Amount spent on Inventory in Thousand per month
x3 = Amount spent on Advertising in Thousand per month
x4 = Sales in Thousand per month
x5= How many Competitors stores are in the Area
Approximately what percentage of the variation in Annual Net Sales is accounted for by these 5 variables in this model?
With Obesity on the rise, a Doctor wants to see if there is a linear relationship between the Age and Weight and estimating a person’s Systolic Blood Pressure. Is there a significant linear relationship between Age and Weight and a person’s Systolic Blood Pressure?
If so, what is/are the significant predictor(s) for Systolic Blood Pressure?
You move out into the country and you notice every Spring there are more and more Deer Fawns that appear. You decide to try and predict how many Fawns there will be for the up coming Spring.
You collect data to, to help estimate Fawn Count for the upcoming Spring season. You collect data on over the past 10 years.
x1 = Adult Deer Count
x2 = Annual Rain in Inches
x3 = Winter Severity
Where Winter Severity Index:
1 = Warm
2 = Mild
3 = Cold
4 = Freeze
5 = Severe
Estimate Fawn Count when Adult Deer Count = 10, Annual Rain = 13.5 and Winter Severity = 4
In the context of regression analysis, what is the definition of an influential point?
Question 8 (1 point)
The least squares regression line for a data set is yˆ=2.3−0.1x and the standard deviation of the residuals is 0.13.
Does a case with the values x = 4.1, y = 2.34 qualify as an outlier?
Question 9 (1 point)
The following data represent the weight of a child riding a bike and the rolling distance achieved after going down a hill without pedaling.
Weight (lbs.)
Rolling Distance (m.)
59
26
84
43
97
48
56
20
103
59
87
44
88
48
92
46
53
28
66
32
71
39
100
49
Can it be concluded at a 0.05 level of significance that there is a linear correlation between the two variables?
Question 10 (1 point)
A negative linear relationship implies that larger values of one variable will result in smaller values in the second variable.
Question 11 (1 point)
You determine there is a strong linear relationship between two variables using a test for linear regression. Can you immediately claim that one variable is causing the second variable to act in a certain way?
Question 12 (1 point)
Which of the following describes how the scatter plot appears? Select all that apply.
Question 13 (1 point)
The following data represent the weight of a child riding a bike and the rolling distance achieved after going down a hill without pedaling.
Weight (lbs.)
Rolling Distance (m.)
59
26
84
43
97
48
56
20
103
59
87
44
88
48
92
46
53
28
66
32
71
39
100
49
Regression Statistics
Multiple R
0.956806
R Square
0.915477
Adjusted R Square
0.907025
Standard Error
3.483483
Observations
12
ANOVA
df
SS
MS
F
Significance F
Regression
1
1314.32
1314.32
108.3113
1.1E-06
Residual
10
121.3466
12.13466
Total
11
1435.667
Coefficients
Standard Error
t Stat
P-value
Lower 95%
Upper 95%
Intercept
-8.56472
4.7892
-1.78834
0.104007
-19.2357
2.106284
Weight (lbs.)
0.611691
0.058775
10.40727
1.1E-06
0.480731
0.742651
Find the standard error of estimate. Round answer to 4 decimal places.