Math/Physic/Economic/Statistic
Computer Assignment
Some critics of television complain that the amount of violence shown on television contributes to violence in our society. Others point out that television contributes to the high level of obesity among children. Now, we may have to add financial problems to the list. A sociologist theorised that people who watch television frequently are exposed to many commercials, which in turn lead them to buy, resulting in increasing debt. To test this belief, a researcher plans to survey a sample of families across the country.
QUESTION 1
(a) What type of survey method the researcher could use and why?
(b) What sampling method could the researcher use to select his/her sample and why?
(c) What are the variables the researcher should consider collecting data for the purpose of the analysis and why? Identify the data type(s) for the variables.
(d) What kind of issues the researcher may face in this data collection?
Suppose the researcher collected data from 395 randomly selected families. For each family, the total debt and the number of hours the television is turned-on per week were recorded. The data are stored in file TVDEBT2015SP2.XLS which is available in the “Assessment” ? “ASSIGNMENT 2: COMPUTER APPLICATION PROJECT” section of the AFE134 unit website. Using this data and EXCEL, answer the questions below.
QUESTION 2
First, the researcher wishes to use the graphical descriptive methods to present the data.
(a) He suggests using class intervals such as 0-6, 6-12, 12-18, … for one variable and class intervals 0-30000, 30000-60000, 60000-90000, …. , for the other variable. Explain how he would decide on the number of classes and the above class intervals.
(b) Use appropriate BIN values to draw a histogram for each variable and comment on the shape of the two distributions.
(c) Use an appropriate plot to investigate the relationship between the two variables. Briefly explain the selection of each variable on the X and Y axes and why? On the same plot, fit a linear trend line including the equation and the coefficient of determination.
QUESTION 3
Second, the researcher wishes to use the numerical descriptive measures to summarize the data.
(a) Prepare a numerical summary report about the data on the two variables the researcher has considered by including the summary measures, mean, median, range, variance, standard deviation, smallest and largest values and the three quartiles, for each variable.
(b) Use five of the above summary measures to represent the summary information in a box plot for each variable. Draw the box plot by hand.
(c) Compute a numerical summary measure to measure the strength of the relationship between the two variables. Interpret this value.
QUESTION 4
The researcher considers using regression analysis to establish a linear relationship between the two variables.
(a) What is his dependent variable and independent variable? Why?
(b) Estimate a simple linear regression model and present the estimated linear equation. Interpret the coefficient estimates of the linear relationship.
(c) Interpret the coefficient of determination, R-squared (R2) value.
QUESTION 5 (Show all working in EXCEL by setting up a table)
A shopping mall estimates the probability distribution of the number of stores mall customers actually enter (X), as shown below:
X 0 1 2 3 4 5 6
p(x) 0.04 0.16 0.22 0.28 2k 0.09 k
(a) Find the value of k.
(b) Find the mean of number of stores entered.
(c) Find the standard deviation of the number of stores entered.

+1 862 207 3288 