大型网站所使用的工具.ppt
《大型网站所使用的工具.ppt》由会员分享,可在线阅读,更多相关《大型网站所使用的工具.ppt(33页珍藏版)》请在三一办公上搜索。
1、大型網站所使用的工具,Perlbal-http:/-http:/分散式檔案系統有公司認為 MogileFS 比起 Hadoop 適合拿來處理小檔案 memcached-http:/memcached.org/共享記憶體?把資料庫或其他需要經常讀取的部分,用記憶體快取(Cache)方式存放Moxi-http:/的 PROXYMore Resource:http:/,How to scale up web service in the past?,Source:http:/,Source:http:/,Source:http:/,Source:http:/,6,HBase Intro,王耀聰 陳威
2、宇jazznchc.org.twwauenchc.org.tw,教育訓練課程,HBase is a distributed column-oriented database built on top of HDFS.,HBase is.,A distributed data store that can scale horizontally to 1,000s of commodity servers and petabytes of indexed storage.Designed to operate on top of the Hadoop distributed file system
3、(HDFS)or Kosmos File System(KFS,aka Cloudstore)for scalability,fault tolerance,and high availability.Integrated into the Hadoop map-reduce platform and paradigm.,Benefits,Distributed storageTable-like in data structure multi-dimensional mapHigh scalabilityHigh availabilityHigh performance,Who use HB
4、ase,Adobe 內部使用(Structure data)Kalooga 圖片搜尋引擎 http:/Meetup 社群聚會網站 http:/Streamy 成功從 MySQL 移轉到 Hbase http:/Trend Micro 雲端掃毒架構 http:/Yahoo!儲存文件 fingerprint 避免重複 http:/-http:/wiki.apache.org/hadoop/Hbase/PoweredBy,Backdrop,Started toward by Chad Walters and Jim2006.11Google releases paper on BigTable200
5、7.2Initial HBase prototype created as Hadoop contrib.2007.10First useable HBase2008.1Hadoop become Apache top-level project and HBase becomes subproject2008.10HBase 0.18,0.19 released,HBase Is Not,Tables have one primary index,the row key.No join operators.Scans and queries can select a subset of av
6、ailable columns,perhaps by using a wildcard.There are three types of lookups:Fast lookup using row key and optional timestamp.Full table scanRange scan from region start to end.,HBase Is Not(2),Limited atomicity and transaction support.HBase supports multiple batched mutations of single rows only.Da
7、ta is unstructured and untyped.No accessed or manipulated via SQL.Programmatic access via Java,REST,or Thrift APIs.Scripting via JRuby.,Why Bigtable?,Performance of RDBMS system is good for transaction processing but for very large scale analytic processing,the solutions are commercial,expensive,and
8、 specialized.Very large scale analytic processingBig queries typically range or table scans.Big databases(100s of TB),Why Bigtable?(2),Map reduce on Bigtable with optionally Cascading on top to support some relational algebras may be a cost effective solution.Sharding is not a solution to scale open
9、 source RDBMS platformsApplication specificLabor intensive(re)partitionaing,Why HBase?,HBase is a Bigtable clone.It is open sourceIt has a good community and promise for the futureIt is developed on top of and has good integration for the Hadoop platform,if you are using Hadoop already.It has a Casc
10、ading connector.,HBase benefits than RDBMS,No real indexesAutomatic partitioningScale linearly and automatically with new nodesCommodity hardwareFault toleranceBatch processing,Data Model,Tables are sorted by RowTable schema only define its column families.Each family consists of any number of colum
11、nsEach column consists of any number of versionsColumns only exist when inserted,NULLs are free.Columns within a family are sorted and stored togetherEverything except table names are byte(Row,Family:Column,Timestamp)Value,Row key,Column Family,value,TimeStamp,Members,MasterResponsible for monitorin
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 大型 网站 使用 工具
链接地址:https://www.31ppt.com/p-5814390.html