Relation between statistics and machine learning


Excerpt from 

https://www.jiqizhixin.com/articles/2019-04-24-16

May share similar methods, but main difference is their purpose. Machine learning models aim at more accurate prediction. Statistic models aim at infering relations between variables.

Machine learning sacrifices interpretability, statistics model may predict but not so satisfactory.

A example

Linear regression is a statistics method. Through the same linear regression method, we can train a linear classifier, we can also build a regression model by minimum least squares. The former needs the step of training, which means it requires a training set, then it will be tested on a test set to evaluate its performance. In machine learning, the ultimate goal is reaching a high accuracy. For the latter, we will assume it is a linear regressor with gaussian noise, and try to find a line to minimize the mean square error of all the data. The purpose of modeling is to describe the relation between variables and output, instead of making prediction. We can use this model for prediction, but model evaluation is not test set performance, but the robustness and signifance of parameters in the model. 


An interesting table

GlossaryRobert Tibshiriani

Machine   learning

Statistics

network, graphs

model

weights

parameters

learning

fitting

generalization

test set performance

supervised learning

regression/classification

unsupervised learning

density estimation, clustering

large grant = $1,000,000

large grant = $50,000

nice place to have a meeting:
    Snowbird, Utah, French Alps

nice place to have a meeting:
    Las Vegas in August


Last Article

Comment 评论



Share 分享

New Users 最新加入

  • hokurikustr

  • refrain

New comments 最新评论

test123: aasdas Details Apr 13 16:39
admin: Thanks! Details Apr 09 11:46
admin: Google map api Details Apr 09 11:46
lqj12: cooooooooool Details Apr 08 21:34
Yunhan Huang: 这个功能是如何实现的? Details Apr 08 13:23