---
title: 科技英文:EMR Training:Getting Started with EMR
---
#EMR Training: Getting Started with EMR (2 of 11)
12/13(五)
1
00:00:00,170 --> 00:00:02,079
so let's get started there's a lot of
因此,讓我們開始吧,有很多
2
00:00:02,279 --> 00:00:03,458
acronyms that get thrown around when
首字母縮略詞
Acronyms: 像AWS=縮寫
3
00:00:03,658 --> 00:00:04,960
you're talking about Amazon Web Services
您在談論亞馬遜網路服務
4
00:00:05,160 --> 00:00:05,939
like AWS
像AWS
5
00:00:06,139 --> 00:00:08,410
in this class we're going to be using
在本課程中,我們將使用
6
00:00:08,609 --> 00:00:10,300
them so I wanted to cover the key ones
他們,所以我想涵蓋關鍵的
7
00:00:10,500 --> 00:00:14,309
now s3 simple storage service so that is
現在s3簡單的存儲服務,所以是
s3= simple storage service
8
00:00:14,509 --> 00:00:18,640
the distributed replicated high
分散式複製高
Replicated=copy
9
00:00:18,839 --> 00:00:22,679
availability key value store that Amazon
亞馬遜的可用性金鑰值存儲
10
00:00:22,879 --> 00:00:25,568
provides and is the basis for any kind
提供並且是任何種類的基礎
11
00:00:25,768 --> 00:00:27,519
of persistent data when you're working
工作時的持久性資料
12
00:00:27,719 --> 00:00:30,249
with elastic MapReduce now elastic
具有彈性的MapReduce現在具有彈性
13
00:00:30,449 --> 00:00:32,339
MapReduce as I mentioned actually uses
我提到的MapReduce實際上使用了
Mentioned=said
14
00:00:32,539 --> 00:00:35,320
servers that are part of Amazon's
屬於亞馬遜的伺服器
15
00:00:35,520 --> 00:00:37,959
elastic compute cloud or ec2 which is
彈性計算雲或ec2
ec2= elastic compute cloud
16
00:00:38,159 --> 00:00:41,018
the fundamental foundation of being able
能力的基本基礎
Fundamental(Adj) foundation(N)
Fundamental=空氣/水
Foundation=base
17
00:00:41,219 --> 00:00:43,000
to get a server of the appropriate class
獲得適當級別的伺服器
Appropriate=適合
18
00:00:43,200 --> 00:00:44,649
provision with the software you need and
提供所需的軟體,以及
Provision=provide
19
00:00:44,850 --> 00:00:46,149
then finally on top of that you have
最後,最重要的是
20
00:00:46,350 --> 00:00:48,038
elastic MapReduce which as I mentioned
就像我提到的那樣,彈性MapReduce
21
00:00:48,238 --> 00:00:52,869
is Amazon's system for automatically
是亞馬遜的自動系統
22
00:00:53,070 --> 00:00:55,209
creating clusters of servers provisioned
創建已配置的伺服器集群
23
00:00:55,409 --> 00:00:56,739
with Hadoop that you can use to run your
與Hadoop,您可以用來運行您的
24
00:00:56,939 --> 00:01:01,989
jobs so how do you actually interact
工作,所以你如何實際互動
25
00:01:02,189 --> 00:01:03,640
with Amazon Web service well there's
借助Amazon Web服務,
26
00:01:03,840 --> 00:01:05,980
three ways there's the first way which
三種方式是第一種
27
00:01:06,180 --> 00:01:09,099
is via a web browser the AWS console and
通過Web流覽器,AWS控制台和
Via=through=by
Console=就是像Eclipse的Console
28
00:01:09,299 --> 00:01:10,599
that's we're going to use for most of
那就是我們將用於大多數
29
00:01:10,799 --> 00:01:12,700
this class because you don't have to
這節課是因為你不必
30
00:01:12,900 --> 00:01:15,250
worry about things like you know what
擔心像你知道的事情
31
00:01:15,450 --> 00:01:19,230
version of Ruby do you have installed
版本的Ruby是否已安裝
32
00:01:19,430 --> 00:01:22,808
now once you get beyond sort of learning
現在,一旦您超越了學習範圍
Beyond=past
sort of=kind of
33
00:01:23,009 --> 00:01:25,689
how to use AWS and elastic MapReduce
如何使用AWS和Elastic MapReduce
34
00:01:25,890 --> 00:01:28,450
then typically you'll wind up using
然後通常您會使用
wind up=probably use
35
00:01:28,650 --> 00:01:30,159
command-line tools and most commonly
命令列工具,最常見
36
00:01:30,359 --> 00:01:32,378
this elastic MapReduce command-line tool
這個彈性的MapReduce命令列工具
37
00:01:32,578 --> 00:01:35,140
that is actually a ruby script and then
這實際上是一個紅寶石腳本,然後
38
00:01:35,340 --> 00:01:37,359
finally depending on your use case if
最終取決於您的用例
39
00:01:37,560 --> 00:01:39,250
you need to you can directly interact
您需要直接互動
40
00:01:39,450 --> 00:01:42,189
with Amazon Web Services using the API
使用API使用Amazon Web Services
41
00:01:42,390 --> 00:01:44,500
and they've got clients in Java Python
他們有Java Python客戶
42
00:01:44,700 --> 00:01:49,000
Ruby etc now if you do want to start
Ruby等,如果您想開始
Etc(只有寫會寫)=etcetera(說的時候用這個)
43
00:01:49,200 --> 00:01:50,738
digging into the API and the command
深入研究API和命令
Digging into=try something difficult
44
00:01:50,938 --> 00:01:51,609
line tool there's documentation
線工具有文檔
45
00:01:51,810 --> 00:01:54,308
available online in where else
在其他地方線上可用
46
00:01:54,509 --> 00:01:57,099
aws.amazon.com slash documentation slash
aws.amazon.com斜線文檔斜線
aws.的.=dot
Slash=/
47
00:01:57,299 --> 00:01:58,959
elastic MapReduce that's the point where
彈性MapReduce就是關鍵
48
00:01:59,159 --> 00:01:59,980
you want to start from when you're
你想從當你開始
49
00:02:00,180 --> 00:02:06,009
finding a lot of PDFs and HTML pages so
找到很多PDF和HTML頁面,因此
PDF=一張/PDFs=很多張
50
00:02:06,209 --> 00:02:07,569
it's the first step it's actually
這是第一步,實際上
51
00:02:07,769 --> 00:02:10,330
getting an account all services require
獲得所有服務所需的帳戶
52
00:02:10,530 --> 00:02:11,890
an account
一個帳戶
53
00:02:12,090 --> 00:02:13,910
the sign-up process is pretty
註冊過程很漂亮
54
00:02:14,110 --> 00:02:14,570
straightforward
直截了當
55
00:02:14,770 --> 00:02:16,240
you go in there and you say here's my
你去那裡,你說這是我的
56
00:02:16,439 --> 00:02:18,380
email address and password I want to use
我要使用的電子郵寄地址和密碼
57
00:02:18,580 --> 00:02:20,530
it goes through this kind of interesting
它經歷了這種有趣的
58
00:02:20,729 --> 00:02:22,520
validation where you provide a phone
驗證您提供電話的位置
59
00:02:22,719 --> 00:02:24,070
number they call you on the phone number
他們用電話號碼打給你的號碼
60
00:02:24,270 --> 00:02:26,510
you tell them what number you're seeing
你告訴他們你看到什麼號碼
61
00:02:26,710 --> 00:02:28,130
on the screen and that way they know
在螢幕上,他們知道
62
00:02:28,330 --> 00:02:29,870
that it's a real person or they hope it
那是一個真實的人,或者他們希望
63
00:02:30,069 --> 00:02:32,870
is with a telephone number that they can
他們有一個電話號碼
64
00:02:33,069 --> 00:02:35,210
use and end result is you wind up with
使用和最終結果是你最終
65
00:02:35,409 --> 00:02:39,170
an account and accounts have these three
一個帳戶和三個帳戶
66
00:02:39,370 --> 00:02:41,120
four digit numbers that create an
四個數字構成一個
67
00:02:41,319 --> 00:02:45,380
account ID so let's go get an account so
帳戶ID,讓我們去獲取一個帳戶
68
00:02:45,580 --> 00:02:47,210
we're going to go to the website AVO
我們要去AVO網站
69
00:02:47,409 --> 00:02:49,250
amazon.com and we're going to click the
amazon.com,我們將點擊
click=滑鼠點左鍵2下
70
00:02:49,449 --> 00:02:53,270
sign up now button all right let's go
立即註冊按鈕,好的,走吧
71
00:02:53,469 --> 00:02:56,000
sign up so where do you sign up well you
註冊,以便您在哪裡註冊得好
72
00:02:56,199 --> 00:03:02,030
go to aws.amazon.com and they got a nice
去aws.amazon.com,他們得到了一個很好的
73
00:03:02,229 --> 00:03:04,750
sign up now button that you click on
按一下立即註冊按鈕
74
00:03:04,949 --> 00:03:06,560
you're going to enter an email address
您將輸入一個電子郵寄地址
75
00:03:06,759 --> 00:03:11,719
like AWS test at scale unlimited you say
就像您說的AWS無限規模測試
76
00:03:11,919 --> 00:03:15,410
you're a new user you sign in so that's
您是登錄的新用戶,因此
77
00:03:15,610 --> 00:03:19,930
going to ask you for your real name and
要問你的真實姓名,
78
00:03:20,129 --> 00:03:22,189
it's going to ask you for that email
會問你那封電子郵件
79
00:03:22,389 --> 00:03:27,530
address again and a password and I'm
再次輸入位址和密碼,我
80
00:03:27,729 --> 00:03:30,560
going to grab a handy-dandy password
將獲得一個方便的密碼
handy-dandy=Convenience
81
00:03:30,759 --> 00:03:31,700
that none of you're going to be able to
你們所有人都將無法
82
00:03:31,900 --> 00:03:37,670
guess and I've just created an account
猜猜,我剛剛創建了一個帳戶
83
00:03:37,870 --> 00:03:39,560
though it's still waiting for me to
雖然它還在等我
84
00:03:39,759 --> 00:03:41,689
verify that I'm a real person and I've
驗證我是一個真實的人並且我已經
85
00:03:41,889 --> 00:03:51,170
got the expected set of information so
得到了預期的資訊,因此
86
00:03:51,370 --> 00:03:53,240
I'm actually going to enter real stuff
我實際上要輸入真實的東西
87
00:03:53,439 --> 00:03:56,300
here so you can see there's nothing up
在這裡,您可以看到沒有任何反應
there's nothing up=no problem
88
00:03:56,500 --> 00:04:00,680
my sleeve hmm do I really want to do
我的袖子我真的很想做嗎
my sleeve=真實的…(我把我的真實面都說了~美國人都這樣說)
89
00:04:00,879 --> 00:04:03,850
this let's just try that number there
這個讓我們嘗試一下那個數字
90
00:04:04,050 --> 00:04:08,790
and if I can actually figure out
如果我真的能弄清楚
91
00:04:11,629 --> 00:04:14,620
and now it wants credit card information
現在它想要信用卡資訊
92
00:04:14,819 --> 00:04:18,310
and that's of course to pay for any
那當然是要付出任何
93
00:04:18,509 --> 00:04:19,509
expenses that you incur when you're
您花費的費用
Incur=pay money
94
00:04:19,709 --> 00:04:22,360
actually using AWS so in this case I
實際上使用AWS,所以在這種情況下,我
95
00:04:22,560 --> 00:04:25,150
think I might stop the recording right
想我可能會停止錄製
96
00:04:25,350 --> 00:04:27,430
here and enter that and then resume
在這裡輸入,然後繼續
Resume=播影片時按stop-start again-
97
00:04:27,629 --> 00:04:30,240
after I've completed this phase
在完成此階段之後
Phase=part
98
00:04:30,439 --> 00:04:32,530
all right I've entered my credit card
好的,我已經輸入了我的信用卡
99
00:04:32,730 --> 00:04:34,600
information and now we're in the next
資訊,現在我們在下一個
100
00:04:34,800 --> 00:04:36,970
phase of identity verification and here
身份驗證階段和此處
101
00:04:37,170 --> 00:04:38,560
it's asking me to provide a telephone
要求我提供電話
102
00:04:38,759 --> 00:04:40,030
number it's going to call me with the
號碼會打電話給我
103
00:04:40,230 --> 00:04:43,600
special pin code essentially and ask me
特殊的個人識別碼,然後問我
104
00:04:43,800 --> 00:04:46,629
to enter that so I'm going to also do
輸入那個,所以我也要做
105
00:04:46,829 --> 00:04:50,020
this offline all right I assume by now
我現在認為這個離線狀態
106
00:04:50,220 --> 00:04:52,770
you've got an account this account has a
您有一個帳戶,該帳戶有一個
107
00:04:52,970 --> 00:04:55,840
name it's got an email address and a
名稱,其中有一個電子郵寄地址和一個
108
00:04:56,040 --> 00:04:57,400
password and has a bunch of other
密碼,還有很多其他密碼
109
00:04:57,600 --> 00:05:00,100
attributes beyond the account ID that we
超出我們的帳戶ID的屬性
110
00:05:00,300 --> 00:05:01,689
talked about previously it's got an
之前談到它有一個
111
00:05:01,889 --> 00:05:04,360
access key ID and sometimes you'll hear
訪問金鑰ID,有時您會聽到
112
00:05:04,560 --> 00:05:06,370
people talk about this as your public
人們以您的公眾身份談論此事
113
00:05:06,569 --> 00:05:08,290
key but it's not a public private key
金鑰,但這不是公共私密金鑰
public key->這是一個詞
114
00:05:08,490 --> 00:05:10,840
it's just the less secret key there's
這只是那裡的秘密鑰匙
Less=opposite of more
115
00:05:11,040 --> 00:05:13,810
also a secret access key and there's
還有一個秘密訪問金鑰,
116
00:05:14,009 --> 00:05:15,939
something called a canonical user ID and
所謂的規範使用者ID和
117
00:05:16,139 --> 00:05:19,060
each of these has different uses so
這些都有不同的用途,所以
118
00:05:19,259 --> 00:05:19,930
we're going to do now is we're going to
我們現在要做的是
119
00:05:20,129 --> 00:05:22,300
switch back to the web browser so we can
切換回網路流覽器,以便我們可以
120
00:05:22,500 --> 00:05:23,800
go take a look at all these different
去看看所有這些不同
121
00:05:24,000 --> 00:05:28,180
attributes of the account let's go take
該帳戶的屬性讓我們開始
122
00:05:28,379 --> 00:05:29,410
a look at all those bits of information
看所有這些資訊
Bits=(生活中)小東西/電腦用語BIT
123
00:05:29,610 --> 00:05:31,090
we have about the account we just
我們有關於我們剛剛使用的帳戶
124
00:05:31,290 --> 00:05:33,430
created here we are at the top level of
在這裡創建,我們位於
125
00:05:33,629 --> 00:05:36,100
the IDS console and it shows all of the
IDS控制台,它顯示了所有
126
00:05:36,300 --> 00:05:37,930
different Amazon Web Services that we
我們提供的不同的Amazon Web Services
127
00:05:38,129 --> 00:05:40,028
have access to what we're interested in
可以訪問我們感興趣的內容
128
00:05:40,228 --> 00:05:41,980
is the account menu that's over here at
是這裡的帳戶菜單
129
00:05:42,180 --> 00:05:44,410
the top right if I click here I can
如果按一下此處,我可以在右上角
130
00:05:44,610 --> 00:05:47,110
select security credentials that's
選擇的安全憑證
131
00:05:47,310 --> 00:05:48,750
actually going to open a new window at
實際上將在打開一個新視窗
132
00:05:48,949 --> 00:05:53,560
AWS portal amazon.com and here we can
AWS門戶amazon.com,在這裡我們可以
Portal=http~(網址最前面)
133
00:05:53,759 --> 00:05:55,389
see information about the account so one
查看有關該帳戶的資訊,這樣一來
134
00:05:55,589 --> 00:05:58,410
of the things is the account name
事情就是帳戶名
135
00:05:58,610 --> 00:06:01,090
followed by the account number and this
然後是帳號和這個
136
00:06:01,290 --> 00:06:04,060
was that series of three series of four
是那個三個系列的四個
137
00:06:04,259 --> 00:06:06,670
digits that I mentioned scrolling down
我提到的數字向下滾動
138
00:06:06,870 --> 00:06:10,060
you can see access keys we have this
您可以看到便捷鍵,我們有這個
139
00:06:10,259 --> 00:06:13,240
access key ID right here which is public
此處的訪問金鑰ID是公開的
140
00:06:13,439 --> 00:06:15,370
if I clicked here it would actually pop
如果我點擊這裡,它實際上會彈出
141
00:06:15,569 --> 00:06:16,840
up a little window that shows me the
上一個小視窗,向我展示
pop up:這是一個詞/彈出
142
00:06:17,040 --> 00:06:18,790
secret access key which I'm not going to
我不會使用的秘密訪問金鑰
143
00:06:18,990 --> 00:06:21,278
reveal to you scrolling down
向您展示向下滾動
144
00:06:21,478 --> 00:06:23,860
see a little bit more information here
在這裡看到更多資訊
145
00:06:24,060 --> 00:06:24,550
this is
這是
146
00:06:24,750 --> 00:06:26,290
the email address that we use to sign-in
我們用於登錄的電子郵寄地址
147
00:06:26,490 --> 00:06:27,910
that the account is really keyed off of
該帳戶確實已被鎖定
148
00:06:28,110 --> 00:06:30,370
and it has a password if we go down near
如果我們靠近附近,它有一個密碼
149
00:06:30,569 --> 00:06:33,040
the bottom once again you see the AWS
在底部,您再次看到AWS
150
00:06:33,240 --> 00:06:35,020
account ID and then there's this
帳戶ID,然後有這個
151
00:06:35,220 --> 00:06:36,639
canonical user ID which is what you use
所使用的規範使用者ID
152
00:06:36,839 --> 00:06:38,470
when you're controlling access to
當您控制對
153
00:06:38,670 --> 00:06:41,230
buckets and files in s3 and if you click
s3中的存儲桶和檔,如果您按一下
buckets=老師說應該是packages
154
00:06:41,430 --> 00:06:43,060
on that it pops up a window where you
在它上面彈出一個視窗
155
00:06:43,259 --> 00:06:44,680
can get access to this very long ID
可以訪問這個非常長的ID
156
00:06:44,879 --> 00:06:47,460
string so we've got all these different
字串,所以我們有所有這些不同
157
00:06:47,660 --> 00:06:51,280
settings for AWS all these different
所有這些不同的AWS設置
158
00:06:51,480 --> 00:06:53,740
credentials but we need more the issue
憑據,但我們需要更多的問題
159
00:06:53,939 --> 00:06:55,480
is that in order to use elastic
是為了使用彈性
160
00:06:55,680 --> 00:06:59,079
MapReduce we need an ec2 key pair once
MapReduce我們一次需要一個ec2金鑰對
161
00:06:59,279 --> 00:07:01,270
we have a real account we can then ask
我們有一個真實帳戶,然後我們可以詢問
162
00:07:01,470 --> 00:07:03,670
Amazon to create a key pair for us and
亞馬遜為我們和
163
00:07:03,870 --> 00:07:06,629
these this key pair is a classic
這對鑰匙很經典
164
00:07:06,829 --> 00:07:08,800
public/private key pair when you ask
詢問時的公開金鑰/私密金鑰對
165
00:07:09,000 --> 00:07:11,770
Amazon to create a key pair for you it
亞馬遜為您創建金鑰對
166
00:07:11,970 --> 00:07:13,990
will save the public portion of the key
將保存金鑰的公開部分
167
00:07:14,189 --> 00:07:16,540
pair with your account and it will let
與您的帳戶配對,它將讓
168
00:07:16,740 --> 00:07:18,129
you download the private portion of the
您下載了
169
00:07:18,329 --> 00:07:20,980
key pair you need this private portion
金鑰對,您需要此私人部分
170
00:07:21,180 --> 00:07:23,379
of the key pair in order to log into an
金鑰對中的一個,以便登錄到
171
00:07:23,579 --> 00:07:26,500
ec2 or elastic MapReduce cluster so how
ec2或彈性MapReduce群集如何
172
00:07:26,699 --> 00:07:29,410
do we get a key pair well you go back to
我們能得到一對鑰匙對嗎?
173
00:07:29,610 --> 00:07:32,500
the Amazon Web Services console you go
您要使用的Amazon Web Services控制台
174
00:07:32,699 --> 00:07:35,110
to the ec2 section not the EMR section
到ec2部分而不是EMR部分
175
00:07:35,310 --> 00:07:37,028
on the bottom left is going to be a link
左下角將是一個連結
176
00:07:37,228 --> 00:07:39,400
to key pairs once you click on that link
按一下該連結以配對金鑰
177
00:07:39,600 --> 00:07:42,009
you're going to be in the key pair
您將成為金鑰對
178
00:07:42,209 --> 00:07:44,139
section of ec2 where you can actually
ec2的部分,您實際上可以
179
00:07:44,339 --> 00:07:46,449
click on a create key pair button and
按一下創建金鑰對按鈕,然後
180
00:07:46,649 --> 00:07:48,400
from there it's pretty straightforward
從那裡很簡單
181
00:07:48,600 --> 00:07:50,590
to give it a name and you're done so
給它起個名字就好了
182
00:07:50,790 --> 00:07:54,400
let's go ahead and do that so if I click
讓我們繼續操作,如果我按一下
183
00:07:54,600 --> 00:08:00,730
over here on ec2 and it just so happens
在這裡在ec2上就發生了
184
00:08:00,930 --> 00:08:03,218
to open up showing me my key pairs
打開給我看我的鑰匙對
185
00:08:03,418 --> 00:08:04,778
because I'd previously clicked on the
因為我以前點擊過
186
00:08:04,978 --> 00:08:06,278
key pairs link down here in the bottom
金鑰對在底部連結到這裡
187
00:08:06,478 --> 00:08:09,040
left of the navigation pane it's not
功能窗格的左側不是
navigation pane:上下左右按鍵/選單
188
00:08:09,240 --> 00:08:10,329
showing any key pairs because I haven't
顯示任何金鑰對,因為我沒有
189
00:08:10,529 --> 00:08:13,060
created any yet if I click on create key
如果我按一下創建金鑰,則創建任何
Any:幾個 yet:幾個
190
00:08:13,259 --> 00:08:16,150
pair it will let me enter a name for the
配對,我將輸入一個名稱
191
00:08:16,350 --> 00:08:17,980
key pair in this case I'm going to call
在這種情況下,我要打電話給鑰匙對
192
00:08:18,180 --> 00:08:20,949
it again AWS test and I'll click the
再次進行AWS測試,然後點擊
193
00:08:21,149 --> 00:08:23,659
Create button
創建按鈕
194
00:08:24,480 --> 00:08:25,900
this generates a new key pair which
這將生成一個新的金鑰對
Generates:to make by computer
195
00:08:26,100 --> 00:08:27,730
means I've got both a public and a
意味著我既有公眾也有
196
00:08:27,930 --> 00:08:31,449
private key the public key Amazon hangs
私密金鑰亞馬遜掛起的公開金鑰
197
00:08:31,649 --> 00:08:34,329
on to and it's downloading now the
繼續,現在正在下載
198
00:08:34,529 --> 00:08:35,169
private key
私密金鑰
199
00:08:35,370 --> 00:08:37,539
this PEM file and yes I do want to save
這個PEM檔,是的,我確實要保存
Yes:choose OK
200
00:08:37,740 --> 00:08:38,949
that because I will need it at various
因為我會在各種場合需要它
201
00:08:39,149 --> 00:08:43,089
times so now you can see that amazon
所以現在你可以看到亞馬遜
202
00:08:43,289 --> 00:08:45,609
knows about this key pair called AWS
知道這個稱為AWS的金鑰對
203
00:08:45,809 --> 00:08:50,109
test there's one final step we need to
測試我們需要完成的最後一步
204
00:08:50,309 --> 00:08:52,209
do to set up to run a job and that is we
做準備工作,那就是我們
205
00:08:52,409 --> 00:08:55,719
need to create an s3 bucket so elastic
需要創建一個具有彈性的s3桶
206
00:08:55,919 --> 00:08:59,799
MapReduce uses s3 both as the source for
MapReduce使用s3作為來源
207
00:09:00,000 --> 00:09:04,449
your job and also as the destination for
您的工作,也是工作的目的地
208
00:09:04,649 --> 00:09:08,019
where it puts log files and where it
它放置日誌檔的位置以及位置
log files:history and deteil files
209
00:09:08,220 --> 00:09:09,549
assumes you're going to put your results
假設您要發表結果
Assumes:
210
00:09:09,750 --> 00:09:13,870
so s3 organizes all of its data in
因此s3將所有資料整理到
211
00:09:14,070 --> 00:09:16,539
buckets at the very top level in s3 you
在s3的最高級別上存儲分區
212
00:09:16,740 --> 00:09:18,309
have some number of buckets you're going
你要去一些水桶
213
00:09:18,509 --> 00:09:20,229
to need to create a bucket that has
需要創建一個具有
214
00:09:20,429 --> 00:09:22,209
directories for where you're going to
您要去的目錄
215
00:09:22,409 --> 00:09:24,579
put your job jar and where you're going
把你的工作罐和你要去的地方
216
00:09:24,779 --> 00:09:28,149
to ask EMR to put the logs from the job
要求EMR將作業中的日誌放入
217
00:09:28,350 --> 00:09:30,370
and the results from the job so let's go
以及工作的結果,讓我們開始吧
218
00:09:30,570 --> 00:09:32,620
ahead and do that now once again we
向前,現在再做一次,我們
219
00:09:32,820 --> 00:09:34,569
start at the top level of the AWS
從AWS的頂層開始
220
00:09:34,769 --> 00:09:36,250
console and this time we're going to
控制台,這次我們要
221
00:09:36,450 --> 00:09:42,490
click on the link to s3 and here it
按一下連結到s3,在這裡
s3:simple/storage/service
222
00:09:42,690 --> 00:09:44,589
shows me a list of all the buckets that
顯示給我所有水桶的清單
223
00:09:44,789 --> 00:09:46,389
I've got defined in my account currently
我已經在我的帳戶中定義了
Currently=now
224
00:09:46,590 --> 00:09:48,189
I have none so I'm going to create a
我沒有,所以我要創建一個
225
00:09:48,389 --> 00:09:49,990
bucket so I'm clicking on this create
桶,所以我點擊此創建
226
00:09:50,190 --> 00:09:53,139
bucket link and ask me for a name as we
桶形連結,問我一個名字,因為我們
227
00:09:53,340 --> 00:09:54,609
mentioned the bucket name has to be
提到存儲桶名稱必須是
Mentioned:you said already
228
00:09:54,809 --> 00:09:57,219
unique across all users so if I go in
在所有用戶中都是唯一的,所以如果我進去
Unique:only one
229
00:09:57,419 --> 00:10:00,219
here and I give it some common lame name
在這裡,我給它取一個常見的la腳名稱
230
00:10:00,419 --> 00:10:03,189
like test and I try and create I'm going
喜歡測試,我嘗試創造我要去的地方
231
00:10:03,389 --> 00:10:05,199
to get an error it says this name is not
為了得到一個錯誤,它說這個名字不是
232
00:10:05,399 --> 00:10:08,049
available as a note you can see that I'm
作為便箋,您可以看到我在
233
00:10:08,250 --> 00:10:10,659
trying to create the bucket in a region
試圖在一個區域中創建存儲桶
Region:location
234
00:10:10,860 --> 00:10:13,149
and the region selected here should
並且此處選擇的區域應
235
00:10:13,350 --> 00:10:15,939
match the region that I'm using to run
匹配我正在運行的區域
236
00:10:16,139 --> 00:10:19,449
my EMR jobs I'm running in the US east
我在美國東部地區從事的EMR工作
E(Elastic)M(map)R(reduce)
Reduce:make it small
237
00:10:19,649 --> 00:10:21,429
region so I'm going to use US standard
地區,所以我要使用美國標準
238
00:10:21,629 --> 00:10:24,250
and now to create a unique name I'm
現在要創建一個唯一的名稱
239
00:10:24,450 --> 00:10:27,250
going to call this AWS test and my
會叫這個AWS測試和我的
240
00:10:27,450 --> 00:10:34,059
initials so now I've got an AWS - test -
縮寫,所以現在我有一個AWS-測試-
Initials:第1個字縮寫
241
00:10:34,259 --> 00:10:36,579
KK bucket over here on the left and it
KK桶在這裡在左邊,它
242
00:10:36,779 --> 00:10:37,659
has nothing in it
裡面什麼都沒有
243
00:10:37,860 --> 00:10:39,878
so it's ready now to be used as a
所以現在就可以用作
244
00:10:40,078 --> 00:10:43,059
destination for elastic MapReduce job
彈性MapReduce作業的目的地
245
00:10:43,259 --> 00:10:46,979
output and also John job logging
輸出以及約翰作業記錄
246
00:10:47,179 --> 00:10:49,328
excellent we are now ready to run a
太好了,我們現在可以運行一個
Excellent=awesome
247
00:10:49,528 --> 00:10:51,250
Hadoop job we've got everything we need
Hadoop工作,我們擁有所需的一切
248
00:10:51,450 --> 00:10:53,589
we have an account that account has
我們有一個該帳戶擁有的帳戶
249
00:10:53,789 --> 00:10:55,929
credentials we've created a key pair and
憑證,我們創建了一個金鑰對,並且
250
00:10:56,129 --> 00:10:58,809
we've created the bucket we need now the
我們已經創建了我們現在需要的存儲桶
251
00:10:59,009 --> 00:10:59,859
next module we're actually going to use
我們將實際使用的下一個模組
Module:part
252
00:11:00,059 --> 00:11:02,979
all of that setup in order to run a
所有這些設置,以便運行
253
00:11:03,179 --> 00:11:08,179
custom Hadoop job
定制Hadoop作業