联系方式

  • QQ:99515681
  • 邮箱:99515681@qq.com
  • 工作时间:8:00-21:00
  • 微信:codinghelp

您当前位置:首页 >> Database作业Database作业

日期:2025-07-02 08:42

•  Complete all tasks in this assignment.

• Some of the tasks will begin with the word Excel, which means you need to use Excel to complete the task and copy the necessary output to the word document. If the task begins with Written, that means you need to work out the calculation and write/type your steps in the document. If the task begins with SQL, it means you need to write SQL queries to complete the task, and the screenshot of the SQL queries should be included in the document. If the task begins with Weka, it means you need to use Weka to complete the task, and the screenshot of the Weka should be included in the document.


Question 1.

You are given the following data about the water consumption of a household.


1. (Excel) Visualize the data using an appropriate chart. Justify your choice

of chart so that it can illustrate enough information in a correct way.

2. (Written) Can you suggest a relationship between the average monthly

temperature and water consumption? Please quantify such a relationship.

(Precision of your numbers should be in 2 decimal places).

3. (Excel) Normalize the column Average Monthly Temperature using Zscore

normalization.

4. (Excel) Normalize the column Water Consumption using min-max normalization.


Question 2.

You are given the following data about the performance review scores of employees.


1. (Written) To predict the performance result for Henry, is it better to

apply K-NN algorithm or K-means algorithm? Explain your reason.

2. (Weka) Conduct a prediction for Henry using an appropriate algorithm.

3. (Written) Demonstrate the prediction result for Henry using the perceptron

algorithm.


Question 3.

You are given the course feedback data of Kevin in COMP7990 in 2023-24

semester 1. They are real data.

1. (Excel) Compute the average score for each question in the feedback form.

The score should be in the scale of 0 to 5.

2. (Excel) Contrast the result from DAAI students and non-DAAI students.

Note: ITM students enrolled in section 1, DAAI students enrolled in section

2, and MPA students enrolled in section 7.

3. (Written) By looking at question 11 only, can you argue that DAAI

students are more satisffed than non-DAAI students? Justify your answer

using an appropriate statistical test.

4. (SQL) Convert the two sheets into two tables in a database. Also, insert

the data into the tables.

5. (SQL) Write a SQL query to compute the average score for each question

in the feedback form.

6. (SQL) Write a SQL query to retrieve the average score for Q11 for each

section.

7. (SQL) Write a SQL query to retrieve the average score for Q11 for each

section but exclude the students who attend less than 70% of the class.

8. (SQL) Write a SQL query to display the section (section 1, 2, or 7) that

has the highest average score for Q11 and also the average score.


Question 4.

Given a set of sample data, calculate the following statistics.

12, 18, 23, 29, 35, 42, 48, 50, 54, 60, 67, 74, 82, 90

1. (Excel) Calculate the median.

2. (Written) Calculate the range.

3. (Written) Calculate the IQR.

4. (Written) Identify any outliers using the IQR.

5. (Excel) Calculate the mean of the data set.

6. (Excel) Calculate the variance of the data set.


Question 5.

The table below shows the scores of three students on 8 assessments. We would like to know if these three students have signiffcantly different average scores. Assume the scores of Student A, Student B, and Student C follow normal distributions.


Questions

1. (Written) How should we handle the missing data for Student C in Assessment

7 and Assessment 8? What impact might this have on our analysis?

2. (Written) What is the null hypothesis for comparing the average scores

of the three students?

3. (Written) Select an appropriate statistical test to determine if there are

signiffcant differences in the average scores of the students. Explain your

choice.

4. (Written) If the test indicates a signiffcant difference, what follow-up

analysis could be performed to identify which students’ scores differ from

each other?


相关文章

【上一篇】:到头了
【下一篇】:没有了

版权所有:编程辅导网 2021 All Rights Reserved 联系方式:QQ:99515681 微信:codinghelp 电子信箱:99515681@qq.com
免责声明:本站部分内容从网络整理而来,只供参考!如有版权问题可联系本站删除。 站长地图

python代写
微信客服:codinghelp