teradata_参考资料(某著名外企内部培训所用资料)
合集下载
相关主题
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
How Large is a Trillion?
1 Kilobyte 1 Megabyte 1 Gigabyte 1 Terabyte 1 Petabyte = 103 = 106 = 109 = 1012 = 1015 = 1000 bytes = 1,000,000 bytes = 1,000,000,000 bytes = 1,000,000,000,000 bytes = 1,000,000,000,000,000 bytes = 11.57 days = 31.6 years = 31,688 years = 15.7 miles = 15,700,000 miles
STAGE 5
ACTIVE WAREHOUSING MAKING it happen!
Primarily Batch
Increase in Ad Hoc Queries
Ad Hoc
Analytical Modeling Grows
Analytics
Continuous Update & Time Sensitive Queries Become Important
• • • • •
Based on enterprise-wide model Can begin small but may grow large rapidly Populated by extraction/loading data from operational systems Responds to end-user “what if” queries Can store detailed as well as summary data Operational Data
ATM
PeopleSoft ®
Point of Service (POS)
Data Warehouse Teradata Database Teradata Warehouse Miner
Cognos ®
MicroStrategies ®
Examples of Access Tools End Users
DSS
Large
Seco源自文库ds or minutes
OLCP
T o d a y
Instant credit – How much credit can be extended to this person? Show the top ten selling items across all stores for 2003.
Type
T R A D I T I O N A L
Example
Number of Rows Accessed Small
Response Time Seconds
OLTP
Update a checking account to reflect a deposit How many child size blue jeans were sold across all of the our Eastern stores in the month of March?
Module 1: Teradata Product Overview
After completing this module, you will be able to: • Describe the purpose of the Teradata product • Give a brief history of the product • List major architectural features of the product
Query Language (SQL)
• Manageable growth via modularity • Fault tolerance at all levels of hardware and
software
• Data integrity and reliability
Evolution of Data Processing
What is a Data Warehouse?
A Data Warehouse is a central, enterprise-wide database that contains information extracted from Operational Data Stores (ODS).
Designed for Today’s Business
Teradata’s Charter meets the business needs of today and tomorrow with:
• Relational database – standard for database design • Enormous capacity – billions of rows, terabytes of
What is Teradata?
Teradata is a Relational Database Management System (RDBMS). Designed to run the world’s largest commercial databases.
• • • • • • •
Preferred solution for enterprise data warehousing Executes on UNIX MP-RAS and Windows 2000 operating systems Compliant with ANSI industry standards Runs on a single or multiple nodes Acts as a “database server” to client applications throughout the enterprise Uses parallelism to manage “terabytes” of data Capable of supporting many concurrent users from various client platforms (over a TCP/IP or IBM channel connection).
• Primarily batch feeds and updates • Ad hoc queries to support strategic decisions that return in minutes and maybe
hours
Active Data Warehousing … is the timely, integrated, logically consistent store of detailed data available for strategic, tactical driven business decisions.
Data Warehouse Usage Evolution
STAGE 1
REPORTING WHAT happened?
STAGE 2
ANALYZING WHY did it happen?
STAGE 3
PREDICTING WHY will it happen?
STAGE 4
OPERATIONALIZING WHAT IS Happening?
1 million seconds 1 billion seconds 1 trillion seconds 1 million inches 1 trillion inches
(30 roundtrips to the moon)
1 million square inches = .16 acres = .0002 square miles 1 trillion square inches = 249 square miles (larger than Singapore) $1 million $1 billion $1 trillion = < $ .01 for every person in U.S. = $ 3.64 for every person is U.S. = $ 3,636 for every person in U.S.
data
• High performance parallel processing • Single database server for multiple clients – “Single
Version of the Truth”
• Network and mainframe connectivity • Industry standard access language – Structured
Continuous Update Short Queries
Event Based Triggering Takes Hold
Event-Based Triggering
Batch
What is Active Data Warehousing?
Data Warehousing … is the timely, integrated, logically consistent store of detailed data available for analytic business decision making.
Win 2000 Win XP
Teradata DATABASE
UNIX Client
Mainframe Client
Teradata – A Brief History
1979 – Teradata Corp founded in Los Angeles, California – Development begins on a massively parallel computer 1982 – YNET technology is patented 1984 – Teradata markets the first database computer DBC/1012 – First system purchased by Wells Fargo Bank of Cal. – Total revenue for year - $3 million 1987 – First public offering of stock 1989 – Teradata and NCR partner on next generation of DBC 1991 – NCR Corporation is acquired by AT&T – Teradata revenues at $280 million 1992 – Teradata is merged into NCR 1996 – AT&T spins off NCR Corp. with Teradata product 1997 – Teradata database becomes industry leader in data warehousing 2000 – 100+ Terabyte system in production 2002 – Teradata V2R5 released 12/2002; major release including features such as PPI, roles and profiles, multi-value compression, and more. 2003 – Teradata V2R5.1 released 12/2003; includes UDFs, BLOBs, CLOBs, and more.
Small to moderate; possibly across multiple databases Large number of detail rows or moderate number of summary rows
Minutes
OLAP
Seconds or minutes
The need to process DSS, OLCP, and OLAP type requests across an enterprise and its data leads to the concept of a “Data Warehouse”.