--- title: 科技英文:EMR Training:Getting Started with EMR --- #EMR Training: Getting Started with EMR (2 of 11) 12/13(五) 1 00:00:00,170 --> 00:00:02,079 so let's get started there's a lot of 因此,讓我們開始吧,有很多 2 00:00:02,279 --> 00:00:03,458 acronyms that get thrown around when 首字母縮略詞 Acronyms: 像AWS=縮寫 3 00:00:03,658 --> 00:00:04,960 you're talking about Amazon Web Services 您在談論亞馬遜網路服務 4 00:00:05,160 --> 00:00:05,939 like AWS 像AWS 5 00:00:06,139 --> 00:00:08,410 in this class we're going to be using 在本課程中,我們將使用 6 00:00:08,609 --> 00:00:10,300 them so I wanted to cover the key ones 他們,所以我想涵蓋關鍵的 7 00:00:10,500 --> 00:00:14,309 now s3 simple storage service so that is 現在s3簡單的存儲服務,所以是 s3= simple storage service 8 00:00:14,509 --> 00:00:18,640 the distributed replicated high 分散式複製高 Replicated=copy 9 00:00:18,839 --> 00:00:22,679 availability key value store that Amazon 亞馬遜的可用性金鑰值存儲 10 00:00:22,879 --> 00:00:25,568 provides and is the basis for any kind 提供並且是任何種類的基礎 11 00:00:25,768 --> 00:00:27,519 of persistent data when you're working 工作時的持久性資料 12 00:00:27,719 --> 00:00:30,249 with elastic MapReduce now elastic 具有彈性的MapReduce現在具有彈性 13 00:00:30,449 --> 00:00:32,339 MapReduce as I mentioned actually uses 我提到的MapReduce實際上使用了 Mentioned=said 14 00:00:32,539 --> 00:00:35,320 servers that are part of Amazon's 屬於亞馬遜的伺服器 15 00:00:35,520 --> 00:00:37,959 elastic compute cloud or ec2 which is 彈性計算雲或ec2 ec2= elastic compute cloud 16 00:00:38,159 --> 00:00:41,018 the fundamental foundation of being able 能力的基本基礎 Fundamental(Adj) foundation(N) Fundamental=空氣/水 Foundation=base 17 00:00:41,219 --> 00:00:43,000 to get a server of the appropriate class 獲得適當級別的伺服器 Appropriate=適合 18 00:00:43,200 --> 00:00:44,649 provision with the software you need and 提供所需的軟體,以及 Provision=provide 19 00:00:44,850 --> 00:00:46,149 then finally on top of that you have 最後,最重要的是 20 00:00:46,350 --> 00:00:48,038 elastic MapReduce which as I mentioned 就像我提到的那樣,彈性MapReduce 21 00:00:48,238 --> 00:00:52,869 is Amazon's system for automatically 是亞馬遜的自動系統 22 00:00:53,070 --> 00:00:55,209 creating clusters of servers provisioned 創建已配置的伺服器集群 23 00:00:55,409 --> 00:00:56,739 with Hadoop that you can use to run your 與Hadoop,您可以用來運行您的 24 00:00:56,939 --> 00:01:01,989 jobs so how do you actually interact 工作,所以你如何實際互動 25 00:01:02,189 --> 00:01:03,640 with Amazon Web service well there's 借助Amazon Web服務, 26 00:01:03,840 --> 00:01:05,980 three ways there's the first way which 三種方式是第一種 27 00:01:06,180 --> 00:01:09,099 is via a web browser the AWS console and 通過Web流覽器,AWS控制台和 Via=through=by Console=就是像Eclipse的Console 28 00:01:09,299 --> 00:01:10,599 that's we're going to use for most of 那就是我們將用於大多數 29 00:01:10,799 --> 00:01:12,700 this class because you don't have to 這節課是因為你不必 30 00:01:12,900 --> 00:01:15,250 worry about things like you know what 擔心像你知道的事情 31 00:01:15,450 --> 00:01:19,230 version of Ruby do you have installed 版本的Ruby是否已安裝 32 00:01:19,430 --> 00:01:22,808 now once you get beyond sort of learning 現在,一旦您超越了學習範圍 Beyond=past sort of=kind of 33 00:01:23,009 --> 00:01:25,689 how to use AWS and elastic MapReduce 如何使用AWS和Elastic MapReduce 34 00:01:25,890 --> 00:01:28,450 then typically you'll wind up using 然後通常您會使用 wind up=probably use 35 00:01:28,650 --> 00:01:30,159 command-line tools and most commonly 命令列工具,最常見 36 00:01:30,359 --> 00:01:32,378 this elastic MapReduce command-line tool 這個彈性的MapReduce命令列工具 37 00:01:32,578 --> 00:01:35,140 that is actually a ruby script and then 這實際上是一個紅寶石腳本,然後 38 00:01:35,340 --> 00:01:37,359 finally depending on your use case if 最終取決於您的用例 39 00:01:37,560 --> 00:01:39,250 you need to you can directly interact 您需要直接互動 40 00:01:39,450 --> 00:01:42,189 with Amazon Web Services using the API 使用API使用Amazon Web Services 41 00:01:42,390 --> 00:01:44,500 and they've got clients in Java Python 他們有Java Python客戶 42 00:01:44,700 --> 00:01:49,000 Ruby etc now if you do want to start Ruby等,如果您想開始 Etc(只有寫會寫)=etcetera(說的時候用這個) 43 00:01:49,200 --> 00:01:50,738 digging into the API and the command 深入研究API和命令 Digging into=try something difficult 44 00:01:50,938 --> 00:01:51,609 line tool there's documentation 線工具有文檔 45 00:01:51,810 --> 00:01:54,308 available online in where else 在其他地方線上可用 46 00:01:54,509 --> 00:01:57,099 aws.amazon.com slash documentation slash aws.amazon.com斜線文檔斜線 aws.的.=dot Slash=/ 47 00:01:57,299 --> 00:01:58,959 elastic MapReduce that's the point where 彈性MapReduce就是關鍵 48 00:01:59,159 --> 00:01:59,980 you want to start from when you're 你想從當你開始 49 00:02:00,180 --> 00:02:06,009 finding a lot of PDFs and HTML pages so 找到很多PDF和HTML頁面,因此 PDF=一張/PDFs=很多張 50 00:02:06,209 --> 00:02:07,569 it's the first step it's actually 這是第一步,實際上 51 00:02:07,769 --> 00:02:10,330 getting an account all services require 獲得所有服務所需的帳戶 52 00:02:10,530 --> 00:02:11,890 an account 一個帳戶 53 00:02:12,090 --> 00:02:13,910 the sign-up process is pretty 註冊過程很漂亮 54 00:02:14,110 --> 00:02:14,570 straightforward 直截了當 55 00:02:14,770 --> 00:02:16,240 you go in there and you say here's my 你去那裡,你說這是我的 56 00:02:16,439 --> 00:02:18,380 email address and password I want to use 我要使用的電子郵寄地址和密碼 57 00:02:18,580 --> 00:02:20,530 it goes through this kind of interesting 它經歷了這種有趣的 58 00:02:20,729 --> 00:02:22,520 validation where you provide a phone 驗證您提供電話的位置 59 00:02:22,719 --> 00:02:24,070 number they call you on the phone number 他們用電話號碼打給你的號碼 60 00:02:24,270 --> 00:02:26,510 you tell them what number you're seeing 你告訴他們你看到什麼號碼 61 00:02:26,710 --> 00:02:28,130 on the screen and that way they know 在螢幕上,他們知道 62 00:02:28,330 --> 00:02:29,870 that it's a real person or they hope it 那是一個真實的人,或者他們希望 63 00:02:30,069 --> 00:02:32,870 is with a telephone number that they can 他們有一個電話號碼 64 00:02:33,069 --> 00:02:35,210 use and end result is you wind up with 使用和最終結果是你最終 65 00:02:35,409 --> 00:02:39,170 an account and accounts have these three 一個帳戶和三個帳戶 66 00:02:39,370 --> 00:02:41,120 four digit numbers that create an 四個數字構成一個 67 00:02:41,319 --> 00:02:45,380 account ID so let's go get an account so 帳戶ID,讓我們去獲取一個帳戶 68 00:02:45,580 --> 00:02:47,210 we're going to go to the website AVO 我們要去AVO網站 69 00:02:47,409 --> 00:02:49,250 amazon.com and we're going to click the amazon.com,我們將點擊 click=滑鼠點左鍵2下 70 00:02:49,449 --> 00:02:53,270 sign up now button all right let's go 立即註冊按鈕,好的,走吧 71 00:02:53,469 --> 00:02:56,000 sign up so where do you sign up well you 註冊,以便您在哪裡註冊得好 72 00:02:56,199 --> 00:03:02,030 go to aws.amazon.com and they got a nice 去aws.amazon.com,他們得到了一個很好的 73 00:03:02,229 --> 00:03:04,750 sign up now button that you click on 按一下立即註冊按鈕 74 00:03:04,949 --> 00:03:06,560 you're going to enter an email address 您將輸入一個電子郵寄地址 75 00:03:06,759 --> 00:03:11,719 like AWS test at scale unlimited you say 就像您說的AWS無限規模測試 76 00:03:11,919 --> 00:03:15,410 you're a new user you sign in so that's 您是登錄的新用戶,因此 77 00:03:15,610 --> 00:03:19,930 going to ask you for your real name and 要問你的真實姓名, 78 00:03:20,129 --> 00:03:22,189 it's going to ask you for that email 會問你那封電子郵件 79 00:03:22,389 --> 00:03:27,530 address again and a password and I'm 再次輸入位址和密碼,我 80 00:03:27,729 --> 00:03:30,560 going to grab a handy-dandy password 將獲得一個方便的密碼 handy-dandy=Convenience 81 00:03:30,759 --> 00:03:31,700 that none of you're going to be able to 你們所有人都將無法 82 00:03:31,900 --> 00:03:37,670 guess and I've just created an account 猜猜,我剛剛創建了一個帳戶 83 00:03:37,870 --> 00:03:39,560 though it's still waiting for me to 雖然它還在等我 84 00:03:39,759 --> 00:03:41,689 verify that I'm a real person and I've 驗證我是一個真實的人並且我已經 85 00:03:41,889 --> 00:03:51,170 got the expected set of information so 得到了預期的資訊,因此 86 00:03:51,370 --> 00:03:53,240 I'm actually going to enter real stuff 我實際上要輸入真實的東西 87 00:03:53,439 --> 00:03:56,300 here so you can see there's nothing up 在這裡,您可以看到沒有任何反應 there's nothing up=no problem 88 00:03:56,500 --> 00:04:00,680 my sleeve hmm do I really want to do 我的袖子我真的很想做嗎 my sleeve=真實的…(我把我的真實面都說了~美國人都這樣說) 89 00:04:00,879 --> 00:04:03,850 this let's just try that number there 這個讓我們嘗試一下那個數字 90 00:04:04,050 --> 00:04:08,790 and if I can actually figure out 如果我真的能弄清楚 91 00:04:11,629 --> 00:04:14,620 and now it wants credit card information 現在它想要信用卡資訊 92 00:04:14,819 --> 00:04:18,310 and that's of course to pay for any 那當然是要付出任何 93 00:04:18,509 --> 00:04:19,509 expenses that you incur when you're 您花費的費用 Incur=pay money 94 00:04:19,709 --> 00:04:22,360 actually using AWS so in this case I 實際上使用AWS,所以在這種情況下,我 95 00:04:22,560 --> 00:04:25,150 think I might stop the recording right 想我可能會停止錄製 96 00:04:25,350 --> 00:04:27,430 here and enter that and then resume 在這裡輸入,然後繼續 Resume=播影片時按stop-start again- 97 00:04:27,629 --> 00:04:30,240 after I've completed this phase 在完成此階段之後 Phase=part 98 00:04:30,439 --> 00:04:32,530 all right I've entered my credit card 好的,我已經輸入了我的信用卡 99 00:04:32,730 --> 00:04:34,600 information and now we're in the next 資訊,現在我們在下一個 100 00:04:34,800 --> 00:04:36,970 phase of identity verification and here 身份驗證階段和此處 101 00:04:37,170 --> 00:04:38,560 it's asking me to provide a telephone 要求我提供電話 102 00:04:38,759 --> 00:04:40,030 number it's going to call me with the 號碼會打電話給我 103 00:04:40,230 --> 00:04:43,600 special pin code essentially and ask me 特殊的個人識別碼,然後問我 104 00:04:43,800 --> 00:04:46,629 to enter that so I'm going to also do 輸入那個,所以我也要做 105 00:04:46,829 --> 00:04:50,020 this offline all right I assume by now 我現在認為這個離線狀態 106 00:04:50,220 --> 00:04:52,770 you've got an account this account has a 您有一個帳戶,該帳戶有一個 107 00:04:52,970 --> 00:04:55,840 name it's got an email address and a 名稱,其中有一個電子郵寄地址和一個 108 00:04:56,040 --> 00:04:57,400 password and has a bunch of other 密碼,還有很多其他密碼 109 00:04:57,600 --> 00:05:00,100 attributes beyond the account ID that we 超出我們的帳戶ID的屬性 110 00:05:00,300 --> 00:05:01,689 talked about previously it's got an 之前談到它有一個 111 00:05:01,889 --> 00:05:04,360 access key ID and sometimes you'll hear 訪問金鑰ID,有時您會聽到 112 00:05:04,560 --> 00:05:06,370 people talk about this as your public 人們以您的公眾身份談論此事 113 00:05:06,569 --> 00:05:08,290 key but it's not a public private key 金鑰,但這不是公共私密金鑰 public key->這是一個詞 114 00:05:08,490 --> 00:05:10,840 it's just the less secret key there's 這只是那裡的秘密鑰匙 Less=opposite of more 115 00:05:11,040 --> 00:05:13,810 also a secret access key and there's 還有一個秘密訪問金鑰, 116 00:05:14,009 --> 00:05:15,939 something called a canonical user ID and 所謂的規範使用者ID和 117 00:05:16,139 --> 00:05:19,060 each of these has different uses so 這些都有不同的用途,所以 118 00:05:19,259 --> 00:05:19,930 we're going to do now is we're going to 我們現在要做的是 119 00:05:20,129 --> 00:05:22,300 switch back to the web browser so we can 切換回網路流覽器,以便我們可以 120 00:05:22,500 --> 00:05:23,800 go take a look at all these different 去看看所有這些不同 121 00:05:24,000 --> 00:05:28,180 attributes of the account let's go take 該帳戶的屬性讓我們開始 122 00:05:28,379 --> 00:05:29,410 a look at all those bits of information 看所有這些資訊 Bits=(生活中)小東西/電腦用語BIT 123 00:05:29,610 --> 00:05:31,090 we have about the account we just 我們有關於我們剛剛使用的帳戶 124 00:05:31,290 --> 00:05:33,430 created here we are at the top level of 在這裡創建,我們位於 125 00:05:33,629 --> 00:05:36,100 the IDS console and it shows all of the IDS控制台,它顯示了所有 126 00:05:36,300 --> 00:05:37,930 different Amazon Web Services that we 我們提供的不同的Amazon Web Services 127 00:05:38,129 --> 00:05:40,028 have access to what we're interested in 可以訪問我們感興趣的內容 128 00:05:40,228 --> 00:05:41,980 is the account menu that's over here at 是這裡的帳戶菜單 129 00:05:42,180 --> 00:05:44,410 the top right if I click here I can 如果按一下此處,我可以在右上角 130 00:05:44,610 --> 00:05:47,110 select security credentials that's 選擇的安全憑證 131 00:05:47,310 --> 00:05:48,750 actually going to open a new window at 實際上將在打開一個新視窗 132 00:05:48,949 --> 00:05:53,560 AWS portal amazon.com and here we can AWS門戶amazon.com,在這裡我們可以 Portal=http~(網址最前面) 133 00:05:53,759 --> 00:05:55,389 see information about the account so one 查看有關該帳戶的資訊,這樣一來 134 00:05:55,589 --> 00:05:58,410 of the things is the account name 事情就是帳戶名 135 00:05:58,610 --> 00:06:01,090 followed by the account number and this 然後是帳號和這個 136 00:06:01,290 --> 00:06:04,060 was that series of three series of four 是那個三個系列的四個 137 00:06:04,259 --> 00:06:06,670 digits that I mentioned scrolling down 我提到的數字向下滾動 138 00:06:06,870 --> 00:06:10,060 you can see access keys we have this 您可以看到便捷鍵,我們有這個 139 00:06:10,259 --> 00:06:13,240 access key ID right here which is public 此處的訪問金鑰ID是公開的 140 00:06:13,439 --> 00:06:15,370 if I clicked here it would actually pop 如果我點擊這裡,它實際上會彈出 141 00:06:15,569 --> 00:06:16,840 up a little window that shows me the 上一個小視窗,向我展示 pop up:這是一個詞/彈出 142 00:06:17,040 --> 00:06:18,790 secret access key which I'm not going to 我不會使用的秘密訪問金鑰 143 00:06:18,990 --> 00:06:21,278 reveal to you scrolling down 向您展示向下滾動 144 00:06:21,478 --> 00:06:23,860 see a little bit more information here 在這裡看到更多資訊 145 00:06:24,060 --> 00:06:24,550 this is 這是 146 00:06:24,750 --> 00:06:26,290 the email address that we use to sign-in 我們用於登錄的電子郵寄地址 147 00:06:26,490 --> 00:06:27,910 that the account is really keyed off of 該帳戶確實已被鎖定 148 00:06:28,110 --> 00:06:30,370 and it has a password if we go down near 如果我們靠近附近,它有一個密碼 149 00:06:30,569 --> 00:06:33,040 the bottom once again you see the AWS 在底部,您再次看到AWS 150 00:06:33,240 --> 00:06:35,020 account ID and then there's this 帳戶ID,然後有這個 151 00:06:35,220 --> 00:06:36,639 canonical user ID which is what you use 所使用的規範使用者ID 152 00:06:36,839 --> 00:06:38,470 when you're controlling access to 當您控制對 153 00:06:38,670 --> 00:06:41,230 buckets and files in s3 and if you click s3中的存儲桶和檔,如果您按一下 buckets=老師說應該是packages 154 00:06:41,430 --> 00:06:43,060 on that it pops up a window where you 在它上面彈出一個視窗 155 00:06:43,259 --> 00:06:44,680 can get access to this very long ID 可以訪問這個非常長的ID 156 00:06:44,879 --> 00:06:47,460 string so we've got all these different 字串,所以我們有所有這些不同 157 00:06:47,660 --> 00:06:51,280 settings for AWS all these different 所有這些不同的AWS設置 158 00:06:51,480 --> 00:06:53,740 credentials but we need more the issue 憑據,但我們需要更多的問題 159 00:06:53,939 --> 00:06:55,480 is that in order to use elastic 是為了使用彈性 160 00:06:55,680 --> 00:06:59,079 MapReduce we need an ec2 key pair once MapReduce我們一次需要一個ec2金鑰對 161 00:06:59,279 --> 00:07:01,270 we have a real account we can then ask 我們有一個真實帳戶,然後我們可以詢問 162 00:07:01,470 --> 00:07:03,670 Amazon to create a key pair for us and 亞馬遜為我們和 163 00:07:03,870 --> 00:07:06,629 these this key pair is a classic 這對鑰匙很經典 164 00:07:06,829 --> 00:07:08,800 public/private key pair when you ask 詢問時的公開金鑰/私密金鑰對 165 00:07:09,000 --> 00:07:11,770 Amazon to create a key pair for you it 亞馬遜為您創建金鑰對 166 00:07:11,970 --> 00:07:13,990 will save the public portion of the key 將保存金鑰的公開部分 167 00:07:14,189 --> 00:07:16,540 pair with your account and it will let 與您的帳戶配對,它將讓 168 00:07:16,740 --> 00:07:18,129 you download the private portion of the 您下載了 169 00:07:18,329 --> 00:07:20,980 key pair you need this private portion 金鑰對,您需要此私人部分 170 00:07:21,180 --> 00:07:23,379 of the key pair in order to log into an 金鑰對中的一個,以便登錄到 171 00:07:23,579 --> 00:07:26,500 ec2 or elastic MapReduce cluster so how ec2或彈性MapReduce群集如何 172 00:07:26,699 --> 00:07:29,410 do we get a key pair well you go back to 我們能得到一對鑰匙對嗎? 173 00:07:29,610 --> 00:07:32,500 the Amazon Web Services console you go 您要使用的Amazon Web Services控制台 174 00:07:32,699 --> 00:07:35,110 to the ec2 section not the EMR section 到ec2部分而不是EMR部分 175 00:07:35,310 --> 00:07:37,028 on the bottom left is going to be a link 左下角將是一個連結 176 00:07:37,228 --> 00:07:39,400 to key pairs once you click on that link 按一下該連結以配對金鑰 177 00:07:39,600 --> 00:07:42,009 you're going to be in the key pair 您將成為金鑰對 178 00:07:42,209 --> 00:07:44,139 section of ec2 where you can actually ec2的部分,您實際上可以 179 00:07:44,339 --> 00:07:46,449 click on a create key pair button and 按一下創建金鑰對按鈕,然後 180 00:07:46,649 --> 00:07:48,400 from there it's pretty straightforward 從那裡很簡單 181 00:07:48,600 --> 00:07:50,590 to give it a name and you're done so 給它起個名字就好了 182 00:07:50,790 --> 00:07:54,400 let's go ahead and do that so if I click 讓我們繼續操作,如果我按一下 183 00:07:54,600 --> 00:08:00,730 over here on ec2 and it just so happens 在這裡在ec2上就發生了 184 00:08:00,930 --> 00:08:03,218 to open up showing me my key pairs 打開給我看我的鑰匙對 185 00:08:03,418 --> 00:08:04,778 because I'd previously clicked on the 因為我以前點擊過 186 00:08:04,978 --> 00:08:06,278 key pairs link down here in the bottom 金鑰對在底部連結到這裡 187 00:08:06,478 --> 00:08:09,040 left of the navigation pane it's not 功能窗格的左側不是 navigation pane:上下左右按鍵/選單 188 00:08:09,240 --> 00:08:10,329 showing any key pairs because I haven't 顯示任何金鑰對,因為我沒有 189 00:08:10,529 --> 00:08:13,060 created any yet if I click on create key 如果我按一下創建金鑰,則創建任何 Any:幾個 yet:幾個 190 00:08:13,259 --> 00:08:16,150 pair it will let me enter a name for the 配對,我將輸入一個名稱 191 00:08:16,350 --> 00:08:17,980 key pair in this case I'm going to call 在這種情況下,我要打電話給鑰匙對 192 00:08:18,180 --> 00:08:20,949 it again AWS test and I'll click the 再次進行AWS測試,然後點擊 193 00:08:21,149 --> 00:08:23,659 Create button 創建按鈕 194 00:08:24,480 --> 00:08:25,900 this generates a new key pair which 這將生成一個新的金鑰對 Generates:to make by computer 195 00:08:26,100 --> 00:08:27,730 means I've got both a public and a 意味著我既有公眾也有 196 00:08:27,930 --> 00:08:31,449 private key the public key Amazon hangs 私密金鑰亞馬遜掛起的公開金鑰 197 00:08:31,649 --> 00:08:34,329 on to and it's downloading now the 繼續,現在正在下載 198 00:08:34,529 --> 00:08:35,169 private key 私密金鑰 199 00:08:35,370 --> 00:08:37,539 this PEM file and yes I do want to save 這個PEM檔,是的,我確實要保存 Yes:choose OK 200 00:08:37,740 --> 00:08:38,949 that because I will need it at various 因為我會在各種場合需要它 201 00:08:39,149 --> 00:08:43,089 times so now you can see that amazon 所以現在你可以看到亞馬遜 202 00:08:43,289 --> 00:08:45,609 knows about this key pair called AWS 知道這個稱為AWS的金鑰對 203 00:08:45,809 --> 00:08:50,109 test there's one final step we need to 測試我們需要完成的最後一步 204 00:08:50,309 --> 00:08:52,209 do to set up to run a job and that is we 做準備工作,那就是我們 205 00:08:52,409 --> 00:08:55,719 need to create an s3 bucket so elastic 需要創建一個具有彈性的s3桶 206 00:08:55,919 --> 00:08:59,799 MapReduce uses s3 both as the source for MapReduce使用s3作為來源 207 00:09:00,000 --> 00:09:04,449 your job and also as the destination for 您的工作,也是工作的目的地 208 00:09:04,649 --> 00:09:08,019 where it puts log files and where it 它放置日誌檔的位置以及位置 log files:history and deteil files 209 00:09:08,220 --> 00:09:09,549 assumes you're going to put your results 假設您要發表結果 Assumes: 210 00:09:09,750 --> 00:09:13,870 so s3 organizes all of its data in 因此s3將所有資料整理到 211 00:09:14,070 --> 00:09:16,539 buckets at the very top level in s3 you 在s3的最高級別上存儲分區 212 00:09:16,740 --> 00:09:18,309 have some number of buckets you're going 你要去一些水桶 213 00:09:18,509 --> 00:09:20,229 to need to create a bucket that has 需要創建一個具有 214 00:09:20,429 --> 00:09:22,209 directories for where you're going to 您要去的目錄 215 00:09:22,409 --> 00:09:24,579 put your job jar and where you're going 把你的工作罐和你要去的地方 216 00:09:24,779 --> 00:09:28,149 to ask EMR to put the logs from the job 要求EMR將作業中的日誌放入 217 00:09:28,350 --> 00:09:30,370 and the results from the job so let's go 以及工作的結果,讓我們開始吧 218 00:09:30,570 --> 00:09:32,620 ahead and do that now once again we 向前,現在再做一次,我們 219 00:09:32,820 --> 00:09:34,569 start at the top level of the AWS 從AWS的頂層開始 220 00:09:34,769 --> 00:09:36,250 console and this time we're going to 控制台,這次我們要 221 00:09:36,450 --> 00:09:42,490 click on the link to s3 and here it 按一下連結到s3,在這裡 s3:simple/storage/service 222 00:09:42,690 --> 00:09:44,589 shows me a list of all the buckets that 顯示給我所有水桶的清單 223 00:09:44,789 --> 00:09:46,389 I've got defined in my account currently 我已經在我的帳戶中定義了 Currently=now 224 00:09:46,590 --> 00:09:48,189 I have none so I'm going to create a 我沒有,所以我要創建一個 225 00:09:48,389 --> 00:09:49,990 bucket so I'm clicking on this create 桶,所以我點擊此創建 226 00:09:50,190 --> 00:09:53,139 bucket link and ask me for a name as we 桶形連結,問我一個名字,因為我們 227 00:09:53,340 --> 00:09:54,609 mentioned the bucket name has to be 提到存儲桶名稱必須是 Mentioned:you said already 228 00:09:54,809 --> 00:09:57,219 unique across all users so if I go in 在所有用戶中都是唯一的,所以如果我進去 Unique:only one 229 00:09:57,419 --> 00:10:00,219 here and I give it some common lame name 在這裡,我給它取一個常見的la腳名稱 230 00:10:00,419 --> 00:10:03,189 like test and I try and create I'm going 喜歡測試,我嘗試創造我要去的地方 231 00:10:03,389 --> 00:10:05,199 to get an error it says this name is not 為了得到一個錯誤,它說這個名字不是 232 00:10:05,399 --> 00:10:08,049 available as a note you can see that I'm 作為便箋,您可以看到我在 233 00:10:08,250 --> 00:10:10,659 trying to create the bucket in a region 試圖在一個區域中創建存儲桶 Region:location 234 00:10:10,860 --> 00:10:13,149 and the region selected here should 並且此處選擇的區域應 235 00:10:13,350 --> 00:10:15,939 match the region that I'm using to run 匹配我正在運行的區域 236 00:10:16,139 --> 00:10:19,449 my EMR jobs I'm running in the US east 我在美國東部地區從事的EMR工作 E(Elastic)M(map)R(reduce) Reduce:make it small 237 00:10:19,649 --> 00:10:21,429 region so I'm going to use US standard 地區,所以我要使用美國標準 238 00:10:21,629 --> 00:10:24,250 and now to create a unique name I'm 現在要創建一個唯一的名稱 239 00:10:24,450 --> 00:10:27,250 going to call this AWS test and my 會叫這個AWS測試和我的 240 00:10:27,450 --> 00:10:34,059 initials so now I've got an AWS - test - 縮寫,所以現在我有一個AWS-測試- Initials:第1個字縮寫 241 00:10:34,259 --> 00:10:36,579 KK bucket over here on the left and it KK桶在這裡在左邊,它 242 00:10:36,779 --> 00:10:37,659 has nothing in it 裡面什麼都沒有 243 00:10:37,860 --> 00:10:39,878 so it's ready now to be used as a 所以現在就可以用作 244 00:10:40,078 --> 00:10:43,059 destination for elastic MapReduce job 彈性MapReduce作業的目的地 245 00:10:43,259 --> 00:10:46,979 output and also John job logging 輸出以及約翰作業記錄 246 00:10:47,179 --> 00:10:49,328 excellent we are now ready to run a 太好了,我們現在可以運行一個 Excellent=awesome 247 00:10:49,528 --> 00:10:51,250 Hadoop job we've got everything we need Hadoop工作,我們擁有所需的一切 248 00:10:51,450 --> 00:10:53,589 we have an account that account has 我們有一個該帳戶擁有的帳戶 249 00:10:53,789 --> 00:10:55,929 credentials we've created a key pair and 憑證,我們創建了一個金鑰對,並且 250 00:10:56,129 --> 00:10:58,809 we've created the bucket we need now the 我們已經創建了我們現在需要的存儲桶 251 00:10:59,009 --> 00:10:59,859 next module we're actually going to use 我們將實際使用的下一個模組 Module:part 252 00:11:00,059 --> 00:11:02,979 all of that setup in order to run a 所有這些設置,以便運行 253 00:11:03,179 --> 00:11:08,179 custom Hadoop job 定制Hadoop作業