最美情侣中文字幕电影,在线麻豆精品传媒,在线网站高清黄,久久黄色视频

歡迎光臨散文網(wǎng) 會員登陸 & 注冊

R語言代做編程輔導IS4240 Business Intelligence Systems Assignment 1(附答案)

2022-11-30 12:18 作者:拓端tecdat  | 我要投稿

全文鏈接:http://tecdat.cn/?p=30629

Learning Objectives

·?????? Use the R environment to do data exploration and data preparation.

Submission Information

·?????? This assignment contributes 5% to the final course grade. The total marks for this assignment is 20.

·???????Please ensure that you have written your name and matric number in the document.

1.???? This question will be based on the Heart Disease dataset (processed.va.data). The dataset consists of 200 instances, each having 14 numeric attributes. The description of the dataset can be found in?http://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/heart-disease.names?(Long Beach VA)

a)??? Provide the R codes for loading the dataset into a variable heart. The attributes should be given reasonable attribute names based on the description given above. Ensure that all the attributes are of numeric (or integer) type.
(Hint: you should be able to easily convert missing values to be of NA type by using an appropriate function argument) (3 marks)

colnames(heart)=c("age","sex","cp","trestbps","chol","fbs","restecg","thalach","exang","oldpeak","slope","ca","thal","num")?for(i in 1:nrow(heart)){ heart[i,which(heart[i,]=="?")]=NA}

a)??? Provide the R codes for getting the number of missing values for each attribute. Fill in the table below. (5 marks)

for(i in 1:ncol(heart)){ ?sumna[i]=0?for(j in 1:nrow(heart)){ Attribute****Number of missing values****age0sex0cp0trestbps56chol7fbs7restecg0thalach53exang53oldpeak56slope102ca198thal166num0

a)??? Based on the number of missing values for each attribute, discuss one potential issue if we were to remove instances with one or more missing attributes.
(4 mark)

for(i in 1:nrow(heart)){ ?sum[i]=0?for(j in 1:ncol(heart)){ ?if(is.na(heart[i,j])) sum[i]=sum[i]+1;

a)??? Instead of removing instances with one or more missing attributes, propose an alternative approach for handling this problem? (4 mark)

for(i in 1:ncol(heart)){ ??? sum[i]=0 ; ??? for(j in 1:nrow(heart)){ ??????? if(is.na(heart[j,i])==FALSE)sum[i]=sum[i]+heart[j,i];

a)??? Provide the R codes for generating the correlation matrix for the attributes: age, sex, cp, restecg, num. Show the correlation matrix. (4 mark)

cor(heart[c("age", "sex", "cp", "restecg", "num")])


R語言代做編程輔導IS4240 Business Intelligence Systems Assignment 1(附答案)的評論 (共 條)

分享到微博請遵守國家法律
灯塔市| 星座| 榆中县| 荔浦县| 黄石市| 漳浦县| 正阳县| 平阳县| 扎兰屯市| 新竹市| 宜宾县| 镇雄县| 鹿邑县| 陈巴尔虎旗| 会昌县| 长白| 遵化市| 习水县| 抚远县| 拉萨市| 新昌县| 周至县| 龙岩市| 祁连县| 浦江县| 大方县| 安仁县| 社会| 新泰市| 天水市| 台州市| 清水县| 卓尼县| 穆棱市| 乐陵市| 咸宁市| 红原县| 玛纳斯县| 太湖县| 双牌县| 镇远县|