MIM 700 - Data Analytics

Announcements

Day 1: Introduction to Data Analytics

Successful completion of the MIM degree is far from providing the full skill set for data analytics, but we will provide you with the basics. The reason is the continuing and growing need for Data Analytics in Business.

Introduction to Data Analysis

Sample Data [Excel]

Sample Data [csv]

Day 2

Content

Day 3

Mean and Standard Deviation

SD

Excel Content

Homework for Week 1

For any questions about the homework: gerbing@pdx.edu

Turn-in the D2L Dropbox named: Stats_Week_1.

Short-answer questions.

  1. What is data analytics? How does it fit into the task of management decision making?
  2. Specify and briefly describe the 4 basic steps of a complete data analysis.
  3. Describe how data are organize for a data analysis. Why is this organization so amenable to a worksheet program such as Excel?
  4. In data analysis, what is the concept of degrees of freedom mean?

The following questions apply to the data set analyzed in class: employee data.

  1. Convert the data to a formal Excel data table and sort by Gender.
  2. Create the frequency distribution and bar chart for HealthPlan. Interpret.
  3. Create the frequency distribution and histogram of Years worked at the company. Interpret.

For the following questions, construct your own two small distributions of numerical data, with n=5 (number of data values) in each distribution. Have the distributions represent measurements of an actual business situation.

  1. Define the type and characteristics of the data entered. Formalize this characterization with data validation. What happens when you enter data out of range?

  2. Calculate the mean of each distribution manually (as we did in class) and then with the specific Excel function. Interpret each and compare.
  3. Calculate the standard deviation of each distribution manually (as we did in class) and then with the specific Excel function. Interpret each and compare.

Day 4

Populations

Day 5

Normal Distribution

Multiple Samples

Excel Content

Day 6

Inference for the Mean

Confidence Interval Applied

CI: Interpretation

Hypothesis Test

Excel Template for the Inference of the Mean

Inference for the Mean Difference

Mean Difference

Excel Template for the Inference of the Mean Difference

Homework for Week 2

For any questions about the homework: gerbing@pdx.edu

Turn-in the D2L Dropbox named: Stats_Week_1.

Short-answer questions.

  1. Explain the concept of simulated data.
  2. What is the reason for computing a confidence interval?
  3. What is accomplished with a hypothesis test?

For the following questions, simulate the data from a normal distribution of IQ scores, which have a population mean of 100 and a population standard deviation of 15.

  1. Draw a simulated sample of 8 scores (data values).
  2. Calculate the sample mean and standard deviation of the 8 scores
  3. Compare the sample values to their corresponding population values. Do the sample values equal or approximate the corresponding population values? Why?
  4. Simulate 5 more distributions of 8 scores and calculate the corresponding sample means and sample standard deviations.
  5. Calculate the standard deviation the sample means (estimated standard error of the mean). How does this standard deviation compare to the standard deviation of the data? Why?

The following questions apply to the first set of 8 simulated IQ scores.

  1. Calculate the confidence interval of the mean.
  2. Interpret the confidence interval.
  3. Does the interval contain the (in this case known) population mean?
  4. Conduct the hypothesis test against the (in this case known) population mean?
  5. Interpret the hypothesis test.

Looking Ahead

For our MIM 535 course in Data Analytics this coming Spring, here is the text we will use.

Data Analysis Explained: Accessible R without Programming