走进数据科学英文版课件.pptx
《走进数据科学英文版课件.pptx》由会员分享,可在线阅读,更多相关《走进数据科学英文版课件.pptx(92页珍藏版)》请在三一办公上搜索。
1、Data Mining: Theory & Algorithms,Mining? Warehousing?,5,Technology Advancement,6,Technology Advancement,7,The World of Data,8,Data Rich, Information Poor,9,10,Learning Resources,11,International Conference on Data MiningInternational Conference on Data EngineeringInternational Conference on Machine
2、LearningInternational Joint Conference on Artificial IntelligencePacific-Asia Conference on Knowledge Discovery and Data MiningACM SIGKDD Conference on Knowledge Discovery and Data Mining,Learning Resources,12,Learning Resources,13,Xindong Wu,Zhihua Zhou,Jiawei Han,Jian Pei,Qiang Yang,Chih-Jen Lin,P
3、hilip S. Yu,Changshui Zhang,Learning Resources,14,Interdisciplinary,15,Ubiquitous,16,Comprehensive Learning,17,Learning Listening,18,20,Data,Definition“Data are pieces of information that represent the qualitative or quantitative attributes of a variable or set of variables. Data are often viewed as
4、 the lowest level of abstraction from which information and knowledge are derived.”Data TypesContinuous, BinaryDiscrete, StringSymbolicStoragePhysicalLogicalMajor IssuesTransformationErrors and Corruption,21,What is Big Data?,“Big data is high-volume, high-velocity and high-variety information asset
5、s that demand cost-effective, innovative forms of information processing for enhanced insight and decision making.” Gartner“Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.” Mckinsey & Company,22,Big Data,23,Publi
6、c Security,24,Health Care Application,25,Effectiveness Research,Personalized Medicine,Location Data: Urban Planning,26,Location Data: Mobile User,27,Location Data: Shopper,28,Retail Data: Targeted Marketing,29,Retail Data: Sentiment Analysis,30,Social Networks,31,Sports,32,Attractiveness Mining,33,3
7、4,Open Data,Technically Open: available in a machine-readable standard format, which means it can be retrieved and meaningfully processed by a computer application.Legally Open: explicitly licensed in a way that permits commercial and non-commercial use without restrictions.,35,Where to find data?,3
8、6,Open Government Data,37,Data Mining,People have been analysing and investigating data for centuries.StatisticsMean, Variance, Correlation, Distribution In modern days, data are often far beyond human comprehension.Diversity, Volume, DimensionalityDefinitionData Mining is the process of automatical
9、ly extracting interesting and useful hidden patterns from usually massive, incomplete and noisy data.Not a fully automatic processHuman interventions are often inevitable.Domain KnowledgeData Collection and Pre-processingSynonym: Knowledge Discovery,38,“If you are looking for a career where your ser
10、vices will be in high demand, you should find something where you provide a scarce, complementary service to something that is getting ubiquitous and cheap. So whats getting ubiquitous and cheap? Data. And what is complementary to data? Analysis. So my recommendation is to take lots of courses about
11、 how to manipulate and analyze data: databases, machine learning, econometrics, statistics, visualization, and so on.”An interview with Google Chief Economist Hal Varian from the New York Times,Is DM really important?,39,40,Business Intelligence,41,From Data To Intelligence,42,Data Integration & Ana
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 走进 数据 科学 英文 课件

链接地址:https://www.31ppt.com/p-1922174.html