Statistics homework help. Case Study #4 will assess your ability to apply the concepts of chapter 10 to conduct simple and multiple regression analyses to create a prediction model for home prices based on up to four independent variables. You will calculate various descriptive statistics, create summary tables, create various charts and develop five regression prediction models. Finally, you will create a written report summarizing your findings. You will need to use the Data Analysis ToolPak Add-in as you did for the previous two case studies.
The data file contains data for a random sample of 1,000 houses located in the greater Baltimore, MD area. The data fields included are as follows:

•   Home Price
•   Living area (square feet)
•   Number of bedrooms
•   Number of bathrooms
•   Age (years)

In developing both your model and the report, address the items below.

1. There are numerous variables that are believed to be predictors of housing prices, including the ones in the data set for this project. Using the web, find the key variables that determine home price including any not include in this data set.
2. Using Data>Data Analysis>Descriptive Statistics in Excel, calculate the mean, median, range and standard deviation of each variable and summarize the results in table.
3. Using Excel, create histograms for price of the home, living area (square feet) and age of the home. Be sure to give each chart a title and label the axes clearly.
4. Using Excel, create scatterplots of each variable with each other variable. Be sure to give each chart a title and label the axes clearly.
5. Using Data>Data Analysis>Correlation in Excel, calculate the correlation coefficient each variable with each other variable.
6. Using Data>Data Analysis>Regression in Excel, run 4 separate simple regression models to predict the dependent variable (price of the home) with each of the independent variables. Use an alpha level of 0.05 to determine significance.
7. Using Data>Data Analysis>Regression in Excel, run a multiple regression model to predict the dependent variable with all 4 independent variables. Use an alpha level of 0.05 to determine significance.
8. In Word, write a summary report of the findings that includes the tables, charts and regression analyses from steps 1-7 and includes the following:
1. An introductory paragraph summarizes the purpose of the analysis. Also include information that found in your web search about the key variables that determine home price.
2. A section (1 or more paragraphs) describing what the tabular data from step 2 indicate about the central tendency, variability and distribution of each variable. For example, do the variables appear to be distributed in a symmetric or skewed pattern?

c. A section (1 or more paragraphs) describing how the frequency histograms from step 3 support and clarify the findings of the tabular data. Include in this section any evidence suggesting outliers in the data.

1. A section (1 or more paragraphs) describing what the scatterplots from step 4 and correlations from step 5 indicate about the relationship between the various pairs of variables (e.g., are the variables related?, does the relationship appear to be linear or nonlinear?, is the direction of the relationship positive or negative?).
2. A section (1 or more paragraphs) summarizing the findings of the 4 simple regression models from step 6. Which models (if any) show that the independent variable in the model is a significant predictor of price of the home? Which models (if any) show that the independent variable in the model is not a significant predictor of price of the home? Which model is the best fitting? Which model is the poorest fitting?
3. A section (1 or more paragraphs) summarizing the findings of the multiple regression model from step 7. Which variables in the model (if any) show that are a significant predictor of price of the home? Which variables in the model (if any) show that are not a significant predictor of price of the home? Does the multiple regression model provide a better fit than the best fitting simple regression model?
4. A concluding paragraph summarizing the key findings of the analysis and making about which model is the best fitting. Based on your web research, indicate any other variables that are not included in the current best fitting model that might improve the fit if they were included.

Submit a single Excel workbook showing all work for Steps 2-7 and a Word document of your summary report that addresses all parts of Step 8 and that also includes/interweaves all supporting tables and charts from Steps 2-7 (to tell a story with the data and through visualization means).

CaseStudy6-DataFile-Fall2020

Understanding descriptive statistics and their variability is a fundamental aspect of statistical analysis. On their own, descriptive statistics tell us how frequently an observation occurs, what is considered “average”, and how far data in our sample deviate from being “average.” With descriptive statistics, we are able to provide a summary of characteristics from both large and small datasets. In addition to the valuable information they provide on their own, measures of central tendency and variability become important components in many of the statistical tests that we will cover. Therefore, we can think about central tendency and variability as the cornerstone to the quantitative structure we are building.

For this Discussion, you will examine central tendency and variability based on two separate variables. You will also explore the implications for positive social change based on the results of the data.
To prepare for this Discussion:

• Review this week’s Learning Resources and the Descriptive Statistics media program.
• For additional support, review the Skill Builder: Visual Displays for Categorical Variables and the Skill Builder: Visual Displays for Continuous Variables, which you can find by navigating back to your Blackboard Course Home Page. From there, locate the Skill Builder link in the left navigation pane.
• Review the Chapter 4 of the Wagner text and the examples in the SPSS software related to central tendency and variability.
• From the General Social Survey dataset found in this week’s Learning Resources, use the SPSS software and choose one continuous and one categorical variable Note: this dataset will be different from your Assignment dataset).
• As you review, consider the implications for positive social change based on the results of your data.

#### By Day 3

Post, present, and report a descriptive analysis for your variables, specifically noting the following:

1. Report the mean, median, and mode.
2. What might be the better measure for central tendency? (i.e., mean, median, or mode) and why?
3. Report the standard deviation.
4. How variable are the data?
5. How would you describe this data?
6. What sort of research question would this variable help answer that might inform social change?

Post the following information for your categorical variable:

1. A frequency distribution.
2. An appropriate measure of variation.
3. How variable are the data?
4. How would you describe this data?
5. What sort of research question would this variable help answer that might inform social change?

Be sure to support your Main Post and Response Post with reference to the week’s Learning Resources and other scholarly evidence in APA Style.

#### By Day 5

Respond to at least one colleagues’ post with a comment on the presentation and interpretation of their analysis. In your response, address the following questions:

1. Was the presentation of results clear? If so, provide some specific comments on why. If not, provide constructive suggestions.
2. Are you able to understand how the results might relate back to positive social change? Do you think there are other aspects of positive social change related to the results?

Before beginning work on this week’s discussion post, review the following resources:

From the below list, select one topic for which you will lead the discussion in the forum this week. Early in the week, reserve your selected topic by posting your response (reservation post) to theDiscussion Area, identifying key words about your topic in the subject line.
By the due date assigned, research your topic and start a scholarly conversation as you respond with your initial or primary post to your own reservation post in the Discussion Area. Make sure your response does not duplicate your colleagues’ responses:
Topic:

• Analyze why we conduct pairwise comparisons of treatment means in ANOVA.

As the beginning of a scholarly conversation, your initial post should be:

• Succinct—no more than 500 words.
• Provocative—use concepts and combinations of concepts from the readings to propose relationships, causes, and/or consequences that inspire others to engage (inquire, learn). In other words, take a scholarly stand.
• Supported—scholarly conversations are more than opinions. Ideas, statements, and conclusions are supported by clear research and citations from course materials as well as other credible, peer-reviewed resources.

Provide (2) 200 words response for RESPONSES 1 AND 2 below.  Each post must be a different topic. So, you will have your initial post from one topic, your first follow-up post from a different topic, and your second follow-up post from one of the other topics. Response provided should further discuss the subject or provide more insight. To further understand the response, below is the discussion post that’s discusses the responses. 100% original work and not plagiarized. Must meet deadline.
RESPONSE 1:
For the entire class I think that the resources that I liked best were the PDFs included each week that consolidated the information that we had gone over for the week and provided examples and even broke down the process for Excel as well.
Question two is one I always struggle with, seeing the bigger picture. I know that the math classes that are required for my Natural Science degree will become useful in future classes I take, especially those with lab requirements.
For me, the discussion posts were very useful as it allowed me to see how other individuals approached each weeks work and sometimes those posts shed new light on the subject making it easier for me to understand the weeks work.
From here, I am keeping up my trend of taking two classes at a time, this starting semester I will begin Chemisty (with Lab) and Biology (with lab). Both of these classes are 16 weeks long and begin the day after this class and my Physical Geography class close this Sunday. Now with Statistics and Physical Geography completed I am over 50% complete with my degree.
Again, congrats to everyone for making it this far in the class and good luck on your last week’s tests and the final paper. For those of you who have more classes, keep going strong.
Thomas
RESPONSE 2:
Which resources did you like best? Why?
My favorite resources were the ones that were attached to our discussion each week. Those helped me the most when it came to prepping for the homework quizzes and tests.
How will you use the information in the future? This is your opportunity to consider–and then share–how you will employ the knowledge that you have gained during our course.
The information I learned in this class won’t make its way into my everyday life but I can see where I would use this knowledge to make big purchases in my life. I can use statistics when comparing potential vehicles, houses, and even stocks if I were inclined to invest in the stock market.
Were the discussions useful?  Will they be useful for you in real life?  Why or why not?
I found the discussions very helpful in the course because it allowed me to apply the new concepts learned each week. That was helpful when it came to the homework and test each week. I can only speak for myself but those homework quizzes and tests were no joke. I needed all the help I could get.
Where are you going from here? What course(s) do you plan to take next?
This is the first time I’m saying this at the end of a course, but I’m not signed up for any other classes after this one concludes. I am due to have my second baby in roughly 40 days and this mama needs a break. I need to rest and prep for having another baby. I plan to take about 4 months off from school while I acclimate to having a second baby and see how that is going to alter my daily schedule. I plan to start classes back up again in July/August. At that time, I will be taking BUSN312 – Operations Research and BUSN313 – Global and Competitive Strategy. I have eight classes plus my senior seminar left until I finish my degree. I’m sad to be taking a break because I could be done with my degree by the end of this year. I know I’ll need the break though.
Feel free to share something interesting about yourself and/or how it relates to the course.
I don’t have anything interesting about myself that pertains to statistics.
Something interesting about myself just in general though, is that I can solve a regular Rubix cube in under a minute. I have a Rubix obsession. I have a pyramix (pyramid one), a 5×5 Rubix cube, as well as a 12-Sided Megaminx (it has 12 pentagons). The pyramid one is the easiest one to solve, but the other two take me a while to solve. There are a lot of algorithms to remember with those.
Does anyone else know how to solve the Rubix cube?
It was great conversing with everyone throughout this course and it was great that we could rely on each other to check our math.
– Emily

## Getting Started

A template has been provided for you to use as you write the research report throughout this course. This activity explains the sections and what they are for.
In order to successfully complete this assignment, you should be able to:

• Identify sections and the purposes of those sections in a research report.

## Resources

• File: Research Report Template Information – Workshops 1 and 2

## Background Information

When asked to perform research using a data set, there are three main areas of focus. First is the background, which includes any sources to explain the data, variable definitions, research questions, and descriptive statistics in the form of graphs and central tendency statistics and variation statistics. Second is the analysis of the raw data using hypothesis testing. In this course, regression and means comparison and chi-square testing are employed. The third section is the summarization of the research results and any future research that can be beneficial.

• ResearchReportTemplateInformationWorkshops1and25.docx

