新西兰机械学习与统计的相对客观分析


在新西兰



我当然写不出这么叼B的东西,但以下是相关的一些读后感和节选。。。
原文是以下地址:
http://brenocon.com/blog/2008/12/statistics-vs-machine-learning-fight/



## 一个图表比较



## 前文提及:machine learning 大部分建基于统计的probability theory...
##.以下是我认为比较贴切的一点,特别是最后几句。。。
## 在实际中,双方的目标是不同的。。
I’ll also note that there are definitely a number of topics in ML that aren’t very related to statistics or probability. Max-margin methods: if all we care about is prediction, why bother using a probability model at all? Why not just optimize the spatial geometry instead? SVM’s don’t require a lick of probability theory to understand. (Of course probability-based approaches are huge in ML, but it’s important to remember they’re not the only game in town, and there is no necessary reason they must be.) And then there are non-traditional settings such as online learning, reinforcement learning, and active learning, where the structure of access to information is in play. There are certainly plenty of things in statistics that aren’t considered part of ML — say, regression diagnostics and significance testing. Finally, many ML problems involve large, high dimensional data and models, where computational issues are very important. For example, in statistical machine translation, alignment models are described with probability theory and fit to data, but their structure is complex enough that optimal inference is intractable, and how you do approximate inference (EM, Viterbi, beam search, etc.) is a very major issue.

这一点也相当有趣:
think this is reflective of the differences in institutional culture between CS and Stats. There’s an interesting John Langford post on part of the issue, which he calls “The Stats Handicap”. He points out that stats Ph.D.’s have a big disadvantage in the job market because statistics has an old-school journal-oriented publishing culture, so students publish much less and have less experience engaging with a research community. CS is conference-oriented — certain conferences have a higher prestige than many journals (e.g. NIPS in ML, CHI in HCI) — and this results in faster turnaround, dissemination, and collaboration. (I’ve heard others make similar comparisons between CS and psychology.) I’d expect any discipline with a larger conference emphasis to have better courses since they should reward presentation/teaching skills — or at least encourage practice — more than in journal world.

## 用machine learning的算法(当然这些很多的算法是基于统计理论的完善的)做data mining
## 以下是一些统计与data mining的看法
Another issue is the definition of statistics itself. In 1997, Jerome Friedman wrote an extremely interesting analysis of the situation: “Data Mining and Statistics: What’s the Connection?”. He points out, quite correctly, the statistical impoverishment of some common approaches to data mining. You can certainly blame statistics for not marketing its ideas well enough, or blame CS for ignoring statistics.

## 以下是一些看法:统计人都被打成这样了,怎么可以阿Q精神一下。
That is not to say statistics is not important — it’s incredibly important. He quotes Efro(boostraping(统计) 的主要贡献人)n as saying “Statistics has been the most successful information science.” However, information science is becoming bigger and broader and more exciting, thanks to computation and ever-increasing amounts of data. What should statisticians do? Friedman continues (light editing and emphasis is mine):


One view says that our field should concentrate on that small part of information science that we do best, namely probabilistic inference based on mathematics. If this view is adopted, we should become resigned to the fact that the role of Statistics as a player in the “information revolution” will steadily diminish over time.

Another point of view holds that statistics ought to be concerned with data analysis. The field should be defined in terms of a set of problems — rather than a set of tools — that pertain to data. Should this point of view ever become the dominant one, a big change would be required in our practice and academic programs.
First and foremost, we would have to make peace with computing. It’s here to stay; that’s where the data is. This has been one of the most glaring omissions in the set of tools that have so far defined Statistics. Had we incorporated computing methodology from its inception as a fundamental statistical tool (as opposed to simply a convenient way to apply our existing tools) many of the other data related fields would not have needed to exist. They would have been part of our field.

Friedman wrote this article more than 10 years ago. All his observations about the importance and increasing prevalence of data and computing power are even more true today than back then. Has the field of statistics changed? Not clear. (I’d appreciate seeing evidence to the contrary.)


## 总结,真心话,其实奥大经济系的计量经济亦有“类统计分析”的效果。。
## 类统计分析指,你会学到为什么会这样在统计系了,但其它系都在用,而且给你相关数据告诉你怎么用。。。
## 奥大的统计往往会令不少人失望,他们会期望教得像澳洲精算那样都是概率模型,或者,教得像中国那样大部分都是数学。
## 没有!奥大的统计现在主要贡献生物,医疗等自然科学。想学偏社会科学的统计,还是早登极乐,脱离苦海,选择经济,社会,心理学(奥大心理学其实更偏向于脑/认知科学。) 吧
I know that I’m interested in quantitative information science, including statistics and data analysis. Machine learning has many strengths, but it is definitely an odd way to go about analysis. But there’s a good case that statistics, as traditionally defined, is only going to have a smaller role in the future. “Data mining” sounds more relevant, but does it even exist as a coherent subject? Maybe it’s time to study a more applied statistical field like econometrics.


评论
以下是一些非电脑,非统计的学生的讨论,他们会应用到统计以及电脑,这比单方面一个统计系学生说统计好,CS学生说CS好,黄婆卖瓜的逻辑来得好.

chemometric : 化学计量学
I come from yet another closely related field: chemometrics which is usually defined as applying statistics to chemical problems/data. Never heard machine learning in the place of statistics here. But chemometrics is heavily focused on prediction (also DoE, but far less about hypothesis testing)

I don't think it is fair to exclude prediction from statistics.  

I rather see a difference in the approach (Ahmed's culture): My guess would be that machine learning is maybe more pragmatic than "pure statistics": if machine learning has an algorithm that solves a problem that's good. Statisticians tend to want thorough theoretical foundations as well. Chemometrics would also be more on the pragmatic side.
(Source: personal experience with chemometrics, where e.g. partial least squares regression has an extremely successful track of records for some 30 years now, including industrial application. Statistics now start to take the approach seriously because finally some statisticians bothered to have a look at the mathematical properties - before it was just an algorithm that happened to work very well with the chemometric data sets).

评论

.......................................

新西兰移民留学

在中国换护照,PR签证怎么办

新西兰人在国内,想在中国换护照,但PR签证怎么办好,求解 评论 非常简单,把pr转到新护照上去 评论 https://www.immigration.govt.nz/ ... a-to-a-new-passport 评论 谢谢 评论 谢谢 评论 先按正常办理换新 ...

新西兰移民留学

什么时候还能大射

新西兰come in soon 评论 多死些人就可以大赦了, 比如64, 比如疫情, 总之死的人越多, 大赦得概率越高。。。。。。。。 评论 别着急,世界越乱越可能大赦,当然也可能都进集中营 评论 不 ...

新西兰移民留学

现在的工签可以办pr吗

新西兰就是那种5年的工签,可以办pr吗? 据说前段时间发了很多工签,他们那些人在国内花了大钱办了工钱,接着带着孩子来读免费书的,他们这种工签可以办到pr吗? 评论 现在都办肉卡 评论 ...

新西兰移民留学

请问一下绿名单上的职业

新西兰想帮国内朋友问一下,如果职业是属于移民局一类绿名单的职业,想技术移民时时需要有这边雇主的job offer 吗?如果相关专业工作经验丰富,雇主会要求有本地学历吗?二类名单上的是 ...

新西兰移民留学

绿卡和PR有啥差别?

新西兰为什么跟新西兰的一些老华侨说拿新西兰绿卡会被他们鄙视呢? 拿新西兰绿卡和拿新西兰PR有差别吗? 评论 你有吗?没有别打听。拿绿卡pr不换护照的都是纽奸。对新西兰不忠心。随时 ...

新西兰移民留学

我是PR,给两个孩子申请RV

新西兰请问一下,我拿到PR后就回国了,后来在国内生了孩子,现在准备回新西兰,如果要给自己的两个孩子申请新西兰RV的话,移民局对我的收入要求是不是比只申请一个孩子RV的要求要更高 ...

新西兰移民留学

出生证明困局

新西兰大家好,最近要着手申请签证。需要出生证明。但是父母的那个年代没有出生证。该怎么办啊 评论 那也没法 证明 给的不是啊。 评论 不是 我都给不出来 评论 去派出所开一个证明,都 ...

新西兰移民留学

父母移民团聚群

新西兰有没有父母移民团聚群可以拉一下 谢谢!!!顺便问一下父母出生证明可以用户口本公证代替吗? 评论 - 户口本上的信息并不一定完全提供了出生公证上的信息,严格来说是没法替代的 ...

新西兰移民留学

PR等待期可以出国吗?

新西兰RV登陆后回到国内,如果卡在签证2年有效期前回纽西兰常住,在等待PR的时间内还可以回国或出国旅游吗? 评论 答案是可以出国,但是如果想换领PR,你必须要在2年内每年留在新西兰超 ...

新西兰移民留学

父母访客签证体检问题

新西兰如果申请人有肺结节,而申请这个父母访客签证要求提供X光片,所以签证官是可以看到申请人有肺结节这个情况的,会影响签证申请通过吗? 谢谢 评论 请问您说的父母访客签证是普通 ...

新西兰移民留学

孩子是否有必要入籍?

新西兰孩子1岁来的新西兰,现在快成年了,一直是中国护照,十多年来除了跟着父母回国探亲了2次,就一直在新西兰。 想了解一下,是否有必要给孩子入籍新西兰? 对孩子以后的发展有帮助 ...

新西兰移民留学

雇主移民项目靠谱吗

新西兰现在国内移民中介宣传的雇主担保项目像六分制,绿名单这些靠谱吗?中介说成功率很高,有知道的吗 评论 有办过的朋友吗 评论 那要看自己的资历啊,如果是优秀的应该靠谱不 评论 ...

新西兰移民留学

工签的有什么办法能拿到pr啊

新西兰工签有什么办法能拿pr 啊 评论 高工资就可以了吧 评论 满足普通6分制要求或者绿名单要求 评论 肉卡最快现在 评论 找个大射的谈谈价就能拿到pr。。 评论 肉卡现在啥行情,多钱 评论 ...

新西兰移民留学

入籍移民监

新西兰入籍后要做移民监吗,还是入籍后直接可以走人了? 评论 直接走人就可以 评论 拿到护照就完全不管你了,绝对没有入籍监了。 评论 这位哥哥 那我听别人说入籍后 12个月内不能离境, ...