# Use technology to find the regression equation for the linear association between the “percent of…

Part 1: Correlation and Linear Regression It is widely believed that the more education one receives the higher the income earned at the time of first employment and over the course of a career. However, due to varying reasons, many people never complete high school and, thus, never receive their high-school diploma. Although individuals without a high-school diploma are often able to find employment, they experience economic outcomes quite different from those who finish high school before entering the workforce to earn a living. Across the nation, there are millions of individuals with families who are now working but do not possess the credentials of a high-school diploma. Many of these individuals and their families are considered to be a part of the working poor that make up a considerable portion of this nation’s labor force. 1. Use technology to create and provide a scatterplot of the association between the “percent of low-income working families” and the “percent of 18-64 yr-olds with no high school diploma” data for each jurisdiction. Write at least two sentences explaining how/why it is appropriate to create such a scatterplot, and describe the characteristics of the association seen in the scatterplot. Be sure to use the actual names of the variables in their appropriate places in your response(s). (Print or copy-and-paste the scatterplot and be sure to clearly identify the predictor and response variables based on the possible believed association.) 2. Use technology to find the regression equation for the linear association between the “percent of low-income working families” and the “percent of 18-64 yr-olds with no high school diploma.” (Round final values to two decimal places.) Provide this equation and write a brief interpretation of the slope using the variable names. (Print or copy-and-paste the printout that identified the equation of the linear regression line, or any other form of evidence that technology was used.) 3. A student states that a decrease in the “percent of 18-64 yr-olds with no high school diploma” will lead to a decrease in the “percent of low-income working families.” Write at least two concise sentences addressing the key uses of linear correlation and comment on its limitations in a response to the student’s statement. 4. Calculate and provide the R-squared value for the regression equation. Provide a statement about its meaning, in general, and, its specific interpretation in the context of this assignment. 5. After examining these data for all the jurisdictions, someone notes that certain areas have an unusually high “percent of 18-64 yr-olds with no high school diploma.” Based on this finding, this individual concludes that the high percentages are due to the rising population of immigrants in those areas. Further, the individual argues that any estimates of the associated “percent of low-income working families” in those areas should be recalculated after removing this sub-population from the data set, as they are causing the area to “look bad”. In addition to thinking critically, use the key rules about linear regression and extrapolation to write a statistically appropriate and socially responsible response to the individual’s conclusion and argument.