SDSC5001 - Question of Assignment 1
发表于
|浏览量:
#assignment #sdsc5001
文章作者: Eric_Chen
版权声明: 本博客所有文章除特别声明外,均采用 CC BY-NC-SA 4.0 许可协议。转载请注明来源 迷麟の小站!
相关推荐
2025-09-28
SDSC5001 - Assignment 1
SDSC5001 - Assignment 1 #assignment #sdsc5001 1. For each of parts(a) through(d), indicate whether we would generally expect the performance of a flexible statistical learning method to be better or worse than an inflexible method. Justify your answer. (a) The sample size n is extremely large, and the number of predictors p is small. Better. With a large amount of data, complex models (flexible methods) have a lower risk of overfitting and can better learn the underlying patterns. (b) The nu...
2025-10-26
SDSC5001 - Assignment 2
#assignment #sdsc5001 题目链接SDSC5001 - Question of Assignment 2 Question 1 When the number of features p is large, there tends to be a deterioration in the performance of KNN and other local approaches that perform prediction using only observations that are near the test observation for which a prediction must be made. This phenomenon is known as the curse of dimensionality, and it ties into the fact that non-parametric approaches often perform poorly when p is large. We will now investigate t...
2025-10-26
SDSC5001 - Question of Assignment 2
#assignment #sdsc5001 Question 1 When the number of features p is large, there tends to be a deterioration in the performance of KNN and other local approaches that perform prediction using only observations that are near the test observation for which a prediction must be made. This phenomenon is known as the curse of dimensionality, and it ties into the fact that non-parametric approaches often perform poorly when p is large. We will now investigate this curse. (a) Suppose tha...
2025-09-18
SDSC5001 Course 1-Review: Probability and Statistics
#sdsc5001 English / 中文 Population and Sample Population: Refers to the entire set of individuals from which we attempt to draw conclusions. Sample: Refers to a subset observed from the population. Relationship: Samples are used to infer characteristics of the population; the core of statistics and machine learning is to estimate or predict population parameters based on sample data. For example, in a coin toss experiment, the population is all possible coin toss outcomes, while the s...
2025-09-19
SDSC5001 Course 2-Exploring Data
#sdsc5001 English / 中文 Data Definition Data is a collection of data objects and their attributes. Data objects are also called records, points, samples, entities, or instances. Attributes are properties or characteristics of objects, such as age, height, weight, education level, etc. Attributes are also called variables, fields, features. For example, a dataset about people might include attributes like ‘age’, ‘height’, etc. Data Types 1. Continuous Variable e.g., length, time, c...
2025-09-19
SDSC5001 Course 3-Overview of Statistical Machine Learning
#sdsc5001 English / 中文 Comparison of Terminology Between Statistics and Machine Learning Statistics Machine Learning Classification/RegressionClusteringClassification/Regression with missing responses(Nonlinear) Dimensionality Reduction Supervised LearningUnsupervised LearningSemi-supervised LearningManifold Learning Covariates/Response VariablesSample/PopulationStatistical ModelMisclassification/Prediction Error Features/OutcomeTraining Set/Test SetLearnerGeneralization Error Mul...
