大數(shù)據(jù)時(shí)代,似乎數(shù)據(jù)科學(xué)也越來(lái)越引起人們的關(guān)注。那么到底什么是數(shù)據(jù)科學(xué)呢?

維基百科說(shuō):
In general terms,Data Science is the extraction of knowlege from ata,which is a continuation of the fiel ata mining an preictive analytics,also known as knowlege iscovery an ata mining.(一般來(lái)講,數(shù)據(jù)科學(xué)就是從數(shù)據(jù)中提取信息知識(shí),即是數(shù)據(jù)挖掘與預(yù)測(cè)分析的延伸,亦是發(fā)掘知識(shí)與數(shù)據(jù)的過(guò)程。)所以,通俗來(lái)講,數(shù)據(jù)科學(xué),就是通過(guò)分析數(shù)據(jù),來(lái)挖掘獲得這些數(shù)據(jù)中的潛在信息。
Data science還有其他幾個(gè)類似的別稱,比如Data Mining(NJIT)、Data Analytics(Cornell University)、Data Stuies、Data Science an Management(Imperial College (Lonon,UK))、Preictive Analytics(DePaul University)、Business Analysis(NYU/Forham University)、Business Intelligence an Data Analytics (Carnegie Mellon University)等等。
所以,在針對(duì)這個(gè)專業(yè)選校時(shí),一定要注意,不要只看名稱,而是要重點(diǎn)看課程設(shè)置。
那么,Data Science的主要學(xué)什么呢?
根據(jù)對(duì)于一些課程的整合,大概是以下三個(gè)方面:
1.模型,算法;
2.數(shù)據(jù)結(jié)構(gòu)
3.visualization(可視化)。
由此課程設(shè)置,可以預(yù)測(cè)該專業(yè)的背景要求。仔細(xì)觀察,可以看出這些課程,都是與計(jì)算機(jī)密切相關(guān)的。并且,比如可視化,目前應(yīng)用比較多的,當(dāng)屬machine learning,也就是通過(guò)計(jì)算機(jī)圖形與圖像處理,從而將我們所需要的數(shù)據(jù)在電腦屏幕上顯示出來(lái)。所以,整個(gè)過(guò)程,需要一定的計(jì)算機(jī)技能,如編程、算法。另一方面,通過(guò)數(shù)據(jù),分析挖掘出有用信息,因此,如果申請(qǐng)者具有一定的數(shù)學(xué)、統(tǒng)計(jì)分析基礎(chǔ)的話,更有利于獲得申請(qǐng)成功。
根據(jù)這個(gè)專業(yè)的開(kāi)設(shè)情況,一般是開(kāi)設(shè)在計(jì)算機(jī)相關(guān)院系下,或者商學(xué)院下,所以其就業(yè)方向,大多是計(jì)算機(jī)領(lǐng)域,或者商業(yè)領(lǐng)域。
下面為大家介紹一下哥倫比亞大學(xué)和康奈爾大學(xué)的數(shù)據(jù)科學(xué)專業(yè)
Columbia University
項(xiàng)目名稱: Master of Science in Data Science
項(xiàng)目鏈接:
該學(xué)校專門(mén)開(kāi)設(shè)了Data Science Institute,由此可見(jiàn)其對(duì)于這一專業(yè)的重視,而該項(xiàng)目是2014年秋季新開(kāi)的。
課程設(shè)置:
一共需要修完30學(xué)分,其中21學(xué)分為必修課:
STAT W4105 PROBABILITY
CSOR W4246 ALGORITHMS FOR DATA SCIENCE
STAT W4702 STATISTICAL INFERENCE AND MODELING
COMS W4121 COMPUTER SYSTEMS FOR DATA SCIENCE
COMS W4721 MACHINE LEARNING FOR DATA SCIENCE
STAT W4701 EXPLORATORY DATA ANALYSIS AND VISUALIZATION
ENGI E4800 DATA SCIENCE CAPSTONE AND ETHICS
申請(qǐng)要求:
*ELIGIBILITY REQUIREMENTS
Unergrauate egree
Prior quantitative coursework (calculus,linear algebra,etc…)
Prior introuctory to computer programming coursework
- online application
- Uploae transcripts from every post-seconary institution attene
*Three recommenation letters
*Personal statement
*Curriculum vitae / resumé
- Official Grauate Recor Examination (GRE) general test scores
*$85 non-refunable application fee
- TOEFL,IELTS or PTE Acaemic test scores
- 截止日期:2月15日
Cornell University
項(xiàng)目名稱:
Masters of Engineering in Operations Research an Information Engineering – Data Analytics Concentration
該項(xiàng)目開(kāi)設(shè)在School of Operations Research an Information Engineering下面。The Data Analytics Concentration focuses on the theory an tools neee to make fact-base,>Tuition for 2014-15 is currently $23,525 per semester,or $47,050 for the acaemic year.
該項(xiàng)目分為如下三個(gè)方向:
申請(qǐng)要求:
1.M.Eng. Program Prerequisites:
A stanar engineering calculus sequence,incluing linear algebra an vector calculus
A calculus-base probability an statistics course equivalent to ENGRD 2700**
An intermeiate-level computer programming course equivalent to ENGRD 2110**,conucte in a wiely use language such as C,C++,Java,or MATLAB
**ENGRD 2700 an ENGRD 2110 are offere Fall,Spring,an Summer at Cornell
2.截止日期:秋季-12月1日;春季-9月1日
3.托福要求:總分不低于100分,其中Writing 20,Listening 15,Reaing 20,Speaking 22。特別注意,這是硬性要求,達(dá)不到這個(gè)要求,是不會(huì)被錄取的。準(zhǔn)確的說(shuō),是沒(méi)有辦法提交網(wǎng)申的。
免托的條件:
- 英語(yǔ)國(guó)家的公民;
- 在說(shuō)英語(yǔ)的國(guó)家在申請(qǐng)期間內(nèi)的近5年內(nèi)上了至少兩年英語(yǔ)授課的課程。
4.GRE
5.常規(guī)材料:CV、2 RL、PS、成績(jī)單
Fall 2014 MEng Class
?71 new MEng stuents (其中,ata analytics、Applie Operations Research、Information Technology這三個(gè)分支的學(xué)生總數(shù)占比35%-50%)
?37 universities
?23 unergrauate majors
?9 countries
?Class Statistics
Meian Unergrauate GPA: 3.68 / 4.00
Meian GRE - Q: 169 GRE - V: 156 GRE - A: 3.5
55% Male,45% Female
75% International,25% US Citizens
55% have UG egrees from US institutions
這里大概算一下,假如ata analytics、Applie Operations Research、Information Technology這三個(gè)分支的學(xué)生總數(shù)占比50%,學(xué)生數(shù)平均分布的,加之國(guó)際生占比75%,那么Data Analytics的錄取學(xué)生數(shù)大概是7175%50%/3=8.875,也就是大概是9個(gè)人,那么中國(guó)學(xué)生會(huì)錄取幾個(gè)呢?由此可想而知,這個(gè)學(xué)校該專業(yè)的申請(qǐng)難度。
【微語(yǔ)】他在知識(shí)的海洋里遨游,每一滴汗水都澆灌著未來(lái)的花朵,綻放著希望的芬芳。