--- title: 科技英文:Managing Machine Learning in Production with Kubeflow and DevOps --- #Managing Machine Learning in Production with Kubeflow and DevOps - David Aronchick, Microsoft 1.Managing Machine Learning in Production with Kubeflow and DevOps - David Aronchick, Microsoft Dev:development /Ops:operations=keep going 1 00:00:00,030 --> 00:00:01,990 hi I apologize I was not working on my 嗨,我很抱歉,我沒有為我工作 Apologize(V):sorry 2 00:00:02,189 --> 00:00:03,428 slides one minute I'm not one of those 滑了一分鐘,我不是其中之一 3 00:00:03,629 --> 00:00:05,049 presenters but I was very late so I 主持人,但我來晚了,所以我 4 00:00:05,250 --> 00:00:08,560 apologize thank you so much I am David 很抱歉,我是大衛 5 00:00:08,759 --> 00:00:10,390 Ron chick I lead open source machine 羅恩·小妞我領導開源機器 open source:free 6 00:00:10,589 --> 00:00:13,330 learning strategy and ml at Microsoft 微軟的學習策略和毫升 Ml:maching Learning 7 00:00:13,529 --> 00:00:16,780 and Azure and I was previously the lead 和Azure,而我以前是負責人 Pentagon:top secret 8 00:00:16,980 --> 00:00:18,940 p.m. for kubernetes and I helped start 下午為kubernetes,我幫助開始 p.m.:project manager 9 00:00:19,140 --> 00:00:20,890 the Q flow project and I'm here to talk Q flow項目,我在這裡談 10 00:00:21,089 --> 00:00:23,470 to you about how to bring your machine 向您介紹如何攜帶您的機器 11 00:00:23,670 --> 00:00:26,260 learning to production using Q flow and 使用Q流學習生產 12 00:00:26,460 --> 00:00:33,219 mo ops so at Microsoft because the 在Microsoft運作的原因是 13 00:00:33,420 --> 00:00:35,320 widget is not working all right I will 小部件無法正常工作,我會 14 00:00:35,520 --> 00:00:38,049 be operating from my laptop and 通過我的筆記型電腦操作 Operating:ops 15 00:00:38,250 --> 00:00:39,969 Microsoft's we do have a lot of 微軟的我們確實有很多 16 00:00:40,170 --> 00:00:43,178 experience bringing ml to production we 將ml投入生產的經驗 17 00:00:43,378 --> 00:00:44,739 bring together your data we bring 彙集您帶來的資料 18 00:00:44,939 --> 00:00:48,759 together cloud your models and our 一起遮蓋您的模型,以及 Cloud:雲端 19 00:00:48,960 --> 00:00:51,009 job is really to help customers large 真的是説明大客戶的工作 20 00:00:51,210 --> 00:00:52,869 customers small customers whatever it is 客戶小客戶,無論是什麼 21 00:00:53,070 --> 00:00:55,869 help you move through and this is 幫助您通過,這是 22 00:00:56,070 --> 00:00:57,579 something that we have a lot of 我們有很多東西 23 00:00:57,780 --> 00:01:00,038 experience doing we've have a lot of 經歷過,我們有很多 24 00:01:00,238 --> 00:01:02,979 internal experience around Microsoft mic Microsoft的內部經驗 25 00:01:03,179 --> 00:01:06,189 Research and an ml generally with many 研究和毫升一般與許多 26 00:01:06,390 --> 00:01:08,168 of the most recent benchmarks and 最新的基準 Benchmarks:test for your computer speed 27 00:01:08,368 --> 00:01:09,730 achievements from coming out of 的成就來自 28 00:01:09,930 --> 00:01:11,439 Microsoft Research we're really 微軟研究院,我們真的非常 29 00:01:11,640 --> 00:01:13,509 proud of that and of course we do give 為此感到自豪,我們當然會給予 30 00:01:13,709 --> 00:01:15,009 all those back to the research community 所有那些回到研究界的人 31 00:01:15,209 --> 00:01:17,528 in the form of open papers and notebooks 以打開的紙和筆記本的形式 Form:way 32 00:01:17,728 --> 00:01:21,909 and data and the reality is that ml does 和數據,而現實是ml確實 33 00:01:22,109 --> 00:01:24,849 touch every aspect of Microsoft today 觸及Microsoft的各個方面 Aspect:a part 34 00:01:25,049 --> 00:01:26,679 literally every one of these logos and 實際上,這些徽標中的每一個和 Literally:actually 35 00:01:26,879 --> 00:01:30,450 many more from your customers clients 客戶的更多客戶 36 00:01:30,650 --> 00:01:33,878 rich clients dead clients are you know 有錢人客戶死了你知道嗎 :everybody use ml 37 00:01:34,078 --> 00:01:35,230 thin clients whatever they may be 瘦客戶無論他們是什麼 38 00:01:35,430 --> 00:01:39,039 Xbox phone you name it we're using ml in 您命名的Xbox手機,我們正在使用ml in Xbox:like switch 39 00:01:39,239 --> 00:01:41,349 all of these various places and we're 所有這些不同的地方,我們 40 00:01:41,549 --> 00:01:45,219 using it at enormous scale you know a 大規模使用它,您知道 Enormous:so big 41 00:01:45,420 --> 00:01:47,259 hundred and eighty million office users 一億八千萬辦公室用戶 42 00:01:47,459 --> 00:01:52,209 today every day use often features 今天每天經常使用我的功能 43 00:01:52,409 --> 00:01:55,659 in office we have 18 billion queries 在辦公室,我們有180億個查詢 Queries:question and other question to…. 44 00:01:55,859 --> 00:01:58,058 asked of Cortana which is obviously rich 問到Cortana,這顯然很豐富 45 00:01:58,259 --> 00:02:00,730 NLP and other things and six point five NLP和其他東西和六點五 46 00:02:00,930 --> 00:02:03,549 trillion security events evaluated every 每萬億安全事件評估一次 trillion :1,000,000,000,000 evaluated:how good something is 47 00:02:03,750 --> 00:02:06,698 day there's simply no way that we could 一天根本沒有辦法 48 00:02:06,899 --> 00:02:09,130 operate and process this data 操作和處理此資料 49 00:02:09,330 --> 00:02:10,980 without something like machine learning 沒有像機器學習這樣的東西 50 00:02:11,180 --> 00:02:14,020 so this is the point in the slide 這就是幻燈片中的重點 51 00:02:14,219 --> 00:02:14,920 where everyone's like wow that does 每個人都喜歡哇 52 00:02:15,120 --> 00:02:17,590 sound really great except they also say 聽起來真的很棒,除了他們也說 53 00:02:17,789 --> 00:02:21,370 this right ml is hard and it's really on 正確的毫升很難,而且確實在 54 00:02:21,569 --> 00:02:23,560 us that those that build these platforms 我們那些建立這些平臺的人 55 00:02:23,759 --> 00:02:26,530 in the ml community to help folks get 在ml社區中幫助人們獲得 Folks:people 56 00:02:26,729 --> 00:02:29,590 there better because we know something 那裡更好,因為我們知道一些 57 00:02:29,789 --> 00:02:32,200 that a lot of new people to ml don't 有很多新朋友不願意 58 00:02:32,400 --> 00:02:35,650 know and that's the following today a 知道,那就是今天的下一個 59 00:02:35,849 --> 00:02:39,340 lot of people new to ml think that it's 很多剛接觸毫升的人都認為這是 60 00:02:39,539 --> 00:02:42,070 all about the model and I understand 關於模型的一切,我瞭解 61 00:02:42,270 --> 00:02:44,350 why every new article out there talks 為什麼每一篇新文章都在談論 62 00:02:44,550 --> 00:02:46,300 about well you know Google just released 關於您知道Google剛剛發佈的 63 00:02:46,500 --> 00:02:48,460 Bert or Microsoft to release this and 伯特或微軟發佈此 64 00:02:48,659 --> 00:02:50,650 you know alphago did this and it's 而且您知道alphago做到了,這是 Alpha:the leader and best 65 00:02:50,849 --> 00:02:52,180 all about this amazing model that they 他們關於這個驚人的模型的一切 66 00:02:52,379 --> 00:02:54,189 built but it's not it's about the data 內置的,但這不是關於資料的 67 00:02:54,389 --> 00:02:56,920 processing and cleaning and all the 處理和清潔以及所有 68 00:02:57,120 --> 00:02:58,509 various things that are involved and 涉及的各種事物 69 00:02:58,709 --> 00:03:01,150 actually bringing something to 實際上帶來了一些東西 70 00:03:01,349 --> 00:03:04,689 production because that's the nature of 生產,因為這就是 71 00:03:04,889 --> 00:03:05,860 machine learning 機器學習 72 00:03:06,060 --> 00:03:08,860 it is these many many micro services 這是許多許多微服務 73 00:03:09,060 --> 00:03:10,360 each of which have a very specific 每個都有一個非常具體的 74 00:03:10,560 --> 00:03:12,310 functionality often very specific 功能通常非常具體 75 00:03:12,509 --> 00:03:14,710 tooling that do very specific things 做非常具體的事情的工具 76 00:03:14,909 --> 00:03:16,960 well but then need to be coupled 好,但是需要結合 77 00:03:17,159 --> 00:03:19,330 together in an intelligent way and if 以一種聰明的方式在一起,如果 Intelligent:smart 78 00:03:19,530 --> 00:03:22,180 you just focus on the model then you're 您只關注模型,然後 79 00:03:22,379 --> 00:03:25,450 gonna be in trouble and I know what 會遇到麻煩,我知道 80 00:03:25,650 --> 00:03:27,580 you're saying you're saying your data 你是說你是在說你的數據 81 00:03:27,780 --> 00:03:29,620 scientists and you don't care and I 科學家,你不在乎,我 82 00:03:29,819 --> 00:03:33,280 believe you a little bit but I'm here to 相信你一點,但我在這裡 83 00:03:33,479 --> 00:03:36,069 tell you you actually do and the reason 告訴你你實際做的事以及原因 84 00:03:36,269 --> 00:03:40,500 is tweets like this right models are 是這樣的推文,正確的模型是 IDGAF:I don’t give a fuck= I don’t care 85 00:03:40,699 --> 00:03:43,750 relatively easy to build but they are 相對容易構建,但它們 Relatively:more or less 86 00:03:43,949 --> 00:03:47,380 very hard to roll out because more often 很難推出,因為更多時候 hard to roll out:make people use 87 00:03:47,580 --> 00:03:49,689 than not data scientists operate in a 而不是資料科學家在 88 00:03:49,889 --> 00:03:51,460 way that they are familiar with they 他們熟悉的方式 89 00:03:51,659 --> 00:03:53,319 understand their tools and they build 瞭解他們的工具,他們建立 90 00:03:53,519 --> 00:03:55,660 using their tooling and local laptops 使用他們的工具和本地筆記型電腦 91 00:03:55,860 --> 00:03:58,210 and local clusters but they don't know 和本地集群,但他們不知道 92 00:03:58,409 --> 00:03:59,770 how to reach out and roll it to 如何伸出手並推向 93 00:03:59,969 --> 00:04:01,480 production and the reason is because 生產的原因是因為 94 00:04:01,680 --> 00:04:04,719 you have this separation right the data 你有這種分離權的資料 95 00:04:04,919 --> 00:04:06,219 scientists over here they're trying to 他們正在嘗試的科學家 96 00:04:06,419 --> 00:04:08,500 iterate as quickly as she can she wants 盡可能快地反覆運算 Iterate:explain 97 00:04:08,699 --> 00:04:10,120 to use frameworks and tooling she 使用框架和工具 98 00:04:10,319 --> 00:04:12,520 understands she wants to mix and match 知道她想混搭 99 00:04:12,719 --> 00:04:15,069 tools the absolute latest build of 工具的絕對最新版本 100 00:04:15,269 --> 00:04:17,170 tensorflow or PI Torture or tensorflow或PI Torture y瑪瑙或 101 00:04:17,370 --> 00:04:19,689 whatever it may be you know somebody 不管你認識誰 102 00:04:19,889 --> 00:04:21,430 from Carnegie Mellon just launched a 來自卡耐基梅隆大學的 Carnegie Mellon:有名的大學 103 00:04:21,629 --> 00:04:24,699 brand new tool around reproducibility 圍繞重現性的全新工具 Reproducibility:how easy to make copy 104 00:04:24,899 --> 00:04:26,540 and it's respectability and that's 這是受人尊敬的 105 00:04:26,740 --> 00:04:28,309 she should have the capabilities to do 她應該有能力去做 106 00:04:28,509 --> 00:04:30,829 that she also wants to not worry about 她也不想擔心 107 00:04:31,029 --> 00:04:32,210 management because it's just her laptop 管理因為這只是她的筆記型電腦 108 00:04:32,410 --> 00:04:33,800 you know something goes wrong she 刮鬍子你知道出事了她 109 00:04:34,000 --> 00:04:36,170 can flatten it and restart and on the 可以將其展平並重新啟動,然後在 110 00:04:36,370 --> 00:04:37,579 other hand she wants unlimited scale 另一方面,她想要無限的規模 111 00:04:37,779 --> 00:04:39,319 she's got a paper due you know on 她有一份論文,因為你知道 112 00:04:39,519 --> 00:04:41,718 Thursday at 5:00 p.m. she wants all the 星期四下午5:00,她想要所有 113 00:04:41,918 --> 00:04:43,100 GPUs in the world in order to achieve 為了實現世界上的GPU GPUs:graphics(圖形) processing unit 114 00:04:43,300 --> 00:04:46,369 the results she needs on the other hand 另一方面,她需要的結果 115 00:04:46,569 --> 00:04:49,100 you have the right and she needs 你有權利,她需要 116 00:04:49,300 --> 00:04:51,800 consistency she needs observability she 一致性,她需要可觀察性,她 117 00:04:52,000 --> 00:04:53,420 needs to reuse tooling that's already 需要重用已經存在的工具 118 00:04:53,620 --> 00:04:55,490 been approved by her organization 被她的組織批准 119 00:04:55,689 --> 00:04:57,350 because they have support and things 因為他們有支持和東西 120 00:04:57,550 --> 00:05:00,410 built in and she needs uptime if the if 內置,如果需要,她需要正常執行時間 Uptime:time in the computer 121 00:05:00,610 --> 00:05:02,240 things are constantly changing under the 在不斷變化的情況下 122 00:05:02,439 --> 00:05:05,778 hood with no records of what's going on 引擎蓋,沒有發生任何事情的記錄 under the hood:parts inside 123 00:05:05,978 --> 00:05:09,649 then that's not gonna work now I am here 那那不行了我現在在這裡 124 00:05:09,848 --> 00:05:12,139 to propose that we can bring them 提議我們可以帶他們去 125 00:05:12,339 --> 00:05:14,210 together and we're gonna do it 在一起,我們要做 126 00:05:14,410 --> 00:05:19,610 through ml ops now before I begin I want 在我開始之前,現在要通過ml ops 127 00:05:19,810 --> 00:05:22,100 to talk about what the e2e machine 說說什麼e2e機 e2e: end to end=from here to there 128 00:05:22,300 --> 00:05:23,389 learning lifecycle looks like now you 學習生命週期看起來像現在 129 00:05:23,589 --> 00:05:24,860 saw a bunch of like boxes connected 看到一堆像盒子一樣的盒子 130 00:05:25,060 --> 00:05:26,749 together with arrows there but the main 與那裡的箭頭,但主要 Arrows-> 131 00:05:26,949 --> 00:05:28,759 things that I think we need to identify 我認為我們需要確定的事情 132 00:05:28,959 --> 00:05:31,459 and solve for in an lifecycle is 並在生命週期中解決 133 00:05:31,658 --> 00:05:34,249 what you see here first you do have the 首先您在這裡看到的是 134 00:05:34,449 --> 00:05:36,050 development and deployment of the model 模型的開發和部署 135 00:05:36,250 --> 00:05:37,939 then you're gonna have to package it in 那麼你將不得不打包 136 00:05:38,139 --> 00:05:40,490 a way that can be used and migrated to migrated:move to new place 一種可以使用和遷移到的方式 137 00:05:40,689 --> 00:05:42,499 in production you want to validate that 在生產中,您要驗證 138 00:05:42,699 --> 00:05:44,499 model behavior before you roll it out 在推出之前模擬行為 139 00:05:44,699 --> 00:05:46,699 then you want to deploy the model and 然後您要部署模型並 Deploy:調派 140 00:05:46,899 --> 00:05:48,170 then you want to monitor it and in 然後您要監視它並在 141 00:05:48,370 --> 00:05:49,819 monitoring it is not just about 監控它不僅僅是 142 00:05:50,019 --> 00:05:51,410 having it out there and making sure it's 把它放在那裡並確保它 143 00:05:51,610 --> 00:05:53,930 up but it's in fact taking all that data 上升,但實際上是在吸收所有資料 144 00:05:54,129 --> 00:05:56,809 and feeding it back into the original so 並把它送回原來的 145 00:05:57,009 --> 00:05:58,430 that you can now train it again and be 現在您可以再次訓練它並成為 146 00:05:58,629 --> 00:05:59,350 smarter about it 對此更聰明 147 00:05:59,550 --> 00:06:01,639 so you may say you might have heard this 所以你可能會說你可能聽說過 148 00:06:01,839 --> 00:06:04,819 before and the reality is you have you 以前,現實是你擁有你 149 00:06:05,019 --> 00:06:06,680 heard this several years ago when things 幾年前聽說過 150 00:06:06,879 --> 00:06:08,240 started getting kicked off around get 開始在周圍開始 kicked off: kickstart:there has many new project in WEBSITE 151 00:06:08,439 --> 00:06:10,968 ops and that was the idea that you could 操作,那是您可以 152 00:06:11,168 --> 00:06:14,749 start with get and record everything 從獲取開始並記錄一切 153 00:06:14,949 --> 00:06:16,129 that you were doing relative to your 你相對於你所做的 154 00:06:16,329 --> 00:06:19,218 overall pipeline iterate very quickly on 整個管道非常快速地反覆運算 155 00:06:19,418 --> 00:06:20,629 that pipeline and then once that 該管道,然後一旦 156 00:06:20,829 --> 00:06:22,730 pipeline had passed all its tests and 管道通過了所有測試,並且 157 00:06:22,930 --> 00:06:24,319 humans had looked at it and said yes 人們看著它說是 158 00:06:24,519 --> 00:06:26,809 this is ready to go you trigger a second 準備好了,您觸發第二個 Trigger:to start something 159 00:06:27,009 --> 00:06:29,149 cycle and that second cycle is rolling 週期,第二個週期正在滾動 160 00:06:29,348 --> 00:06:31,338 it out to production but again it's all 投入生產,但這又是全部 161 00:06:31,538 --> 00:06:33,259 driven off get you don't have someone 被趕走,你沒有人 162 00:06:33,459 --> 00:06:35,209 in the middle there introducing any new 在中間介紹任何新的 163 00:06:35,408 --> 00:06:37,160 changes because those wouldn't have been 變化,因為那些不會 164 00:06:37,360 --> 00:06:39,050 tested and relied or 經過測試並依賴 165 00:06:39,250 --> 00:06:41,090 tested and observed and made sure that 測試並觀察並確保 166 00:06:41,290 --> 00:06:43,340 they pass all your bars and what that 他們通過了你所有的酒吧,那 167 00:06:43,540 --> 00:06:45,170 really gets to you at is those lines 真正讓您著迷的是那些臺詞 168 00:06:45,370 --> 00:06:47,000 at the bottom you get velocity plus 在底部,您獲得速度加 Velocity:speed for something 169 00:06:47,199 --> 00:06:51,590 security so what we need is ml ops right 安全性,所以我們需要的是ml ops 170 00:06:51,790 --> 00:06:53,629 and ml ops is going to look like this 和ml ops看起來像這樣 171 00:06:53,829 --> 00:06:56,840 right you start with the data scientist 從資料科學家開始 172 00:06:57,040 --> 00:06:58,250 the data scientist is able to iterate 資料科學家能夠反覆運算 Iterate:explain 173 00:06:58,449 --> 00:07:00,319 exactly as quickly as she could and 盡其所能 174 00:07:00,519 --> 00:07:02,030 potentially even faster because you're 可能甚至更快,因為您 Potentially:something could but not sure 175 00:07:02,230 --> 00:07:03,470 giving her the Best of Breed tools and 給她最好的工具 176 00:07:03,670 --> 00:07:05,270 you're giving in to her in a system 你在系統中屈服於她 177 00:07:05,470 --> 00:07:06,560 where she doesn't have to think about 她不必思考的地方 178 00:07:06,759 --> 00:07:08,870 well this Python dependency didn't work 很好,這個Python依賴項不起作用 179 00:07:09,069 --> 00:07:10,370 with that other package that I had over 與我已經結束的其他包裹 180 00:07:10,569 --> 00:07:12,889 there you give her a path to install and 在那裡,您給她安裝的路徑, 181 00:07:13,089 --> 00:07:14,600 manage these things in a very regular 定期管理這些東西 182 00:07:14,800 --> 00:07:17,660 way and then from that you start the 方式,然後從那開始 183 00:07:17,860 --> 00:07:18,350 second half 下半場 184 00:07:18,550 --> 00:07:20,600 right you have her finish you have 對,你有她的成就,你有 185 00:07:20,800 --> 00:07:23,000 her check in and move forward through 她簽入並繼續前進 186 00:07:23,199 --> 00:07:25,040 dev that's integrating it into the 開發人員將其集成到 ML+DEV+OPS Integrating:整合 187 00:07:25,240 --> 00:07:27,259 application itself and then integrate 應用程式本身,然後集成 188 00:07:27,459 --> 00:07:29,569 again when it comes to rolling out to 再次涉及到 189 00:07:29,769 --> 00:07:32,629 production and when you do that you get 生產,當您這樣做時,您會得到 190 00:07:32,829 --> 00:07:34,430 these benefits right you get great 這些好處使您受益匪淺 191 00:07:34,629 --> 00:07:37,129 observability that means you exactly 可觀察性意味著您完全 192 00:07:37,329 --> 00:07:39,020 what's being rolled out so you're able 正在推出什麼,以便您能夠 193 00:07:39,220 --> 00:07:41,240 to observe it using standard toolings 使用標準工具進行觀察 194 00:07:41,439 --> 00:07:43,879 and reproducibility you get validation 和重現性得到驗證 195 00:07:44,079 --> 00:07:45,560 again because the stuff is checked in 再次因為東西被簽入 196 00:07:45,759 --> 00:07:48,560 and good you can do static validation 很好,您可以進行靜態驗證 Static:not moving validation:驗證 197 00:07:48,759 --> 00:07:50,870 you could do runtime validation by 您可以通過執行運行時驗證 198 00:07:51,069 --> 00:07:52,520 rolling it out to Canaries in a very 很快地將它推廣到金絲雀 199 00:07:52,720 --> 00:07:54,860 regimented way and then of course you 規整的方式,然後你當然 Regimented:array 200 00:07:55,060 --> 00:07:56,840 get reproducibility and auditability you 獲得可重複性和可審核性 Auditability:audit 201 00:07:57,040 --> 00:07:59,240 know that whatever that query was that 知道那個查詢是什麼 202 00:07:59,439 --> 00:08:01,069 prediction that happened at the end of 發生在末尾的預測 203 00:08:01,269 --> 00:08:03,079 that very long cycle you're able to 如此長的週期,您能夠 204 00:08:03,279 --> 00:08:05,300 trace back exactly what code rolled it 追溯確切的代碼 205 00:08:05,500 --> 00:08:07,790 out at every step of the way and in this 在這個過程中的每一步 206 00:08:07,990 --> 00:08:10,790 byte you get velocity plus security for 位元組,您可以獲得速度和安全性 207 00:08:10,990 --> 00:08:13,610 ml so some of the specific components 毫升,所以一些特定成分 208 00:08:13,810 --> 00:08:15,050 here and I'll just walk through this 在這裡,我將逐步流覽 209 00:08:15,250 --> 00:08:17,090 really quickly like I said you have the 真的很快,就像我說過的 210 00:08:17,290 --> 00:08:19,100 data scientist she checks in to get at 她簽到的資料科學家 211 00:08:19,300 --> 00:08:21,350 that point you do code set version or 那點你做代碼設置版本或 212 00:08:21,550 --> 00:08:23,210 you do code dataset and environment 您編寫資料集和環境代碼 213 00:08:23,410 --> 00:08:24,949 versioning you snapshot all of those 對所有這些版本進行快照 214 00:08:25,149 --> 00:08:27,230 things from there it automatically 從那裡自動進行的事情 215 00:08:27,430 --> 00:08:30,280 builds both the app and trains the model 同時構建應用程式和訓練模型 216 00:08:30,480 --> 00:08:32,750 and in that you're able to do hyper 而且你可以做超 Hyper:super 217 00:08:32,950 --> 00:08:34,218 parameter sweeps and all those kind of 參數掃描以及所有此類 Parameter: Sweeps:掃毒 218 00:08:34,418 --> 00:08:36,949 things in an automated way you validate 驗證的自動化方式 219 00:08:37,149 --> 00:08:38,629 the model at that testing point using 使用該測試點的模型 220 00:08:38,830 --> 00:08:41,299 things like model validation and 諸如模型驗證和 221 00:08:41,500 --> 00:08:43,309 certification and things like that you 認證和類似的東西 222 00:08:43,509 --> 00:08:45,139 release the app and then finally you 釋放應用程式,然後最終您 223 00:08:45,339 --> 00:08:47,029 roll it out and so where that's what 推出它,那在哪裡 224 00:08:47,230 --> 00:08:48,169 we're gonna try and get to here today 我們今天要去這裡 225 00:08:48,370 --> 00:08:49,879 and we're gonna do it right here on 我們將在這裡繼續 226 00:08:50,080 --> 00:08:52,578 stage for you well technically 從技術上為您做好舞臺 Technically:具體 227 00:08:52,778 --> 00:08:56,659 so the question is this sounds pretty 所以問題是聽起來很漂亮 228 00:08:56,860 --> 00:08:58,969 good I'd like to take it on what should 好,我想拿什麼做 229 00:08:59,169 --> 00:09:01,519 I do in order to achieve that well I 我為了做到這一點而努力 230 00:09:01,720 --> 00:09:03,378 have a suggestion for you go join one of 有一個建議給你去參加其中之一 231 00:09:03,578 --> 00:09:05,029 these big companies because they already 這些大公司,因為他們已經 232 00:09:05,230 --> 00:09:08,029 have and they have it it's you know at 有,他們有,這是你知道的 233 00:09:08,230 --> 00:09:10,428 Google it's tf-x and at Facebook it's FB Google是TF-X,Facebook是FB 234 00:09:10,629 --> 00:09:13,488 Lerner flow at uber it's Michelangelo 尤納的勒納潮流是米開朗基羅 235 00:09:13,688 --> 00:09:15,318 Microsoft has our own which is called 微軟有我們自己的被稱為 236 00:09:15,519 --> 00:09:19,039 ether and what this is it's a very 乙太,這是一個非常 Ether:magic in the air 237 00:09:19,240 --> 00:09:21,738 standard data science platform where a 標準資料科學平臺 238 00:09:21,938 --> 00:09:25,698 data scientist will come in and you know 資料科學家會進來,你知道 239 00:09:25,899 --> 00:09:27,469 be able to interact at a very high level 能夠進行高水準的互動 240 00:09:27,669 --> 00:09:30,139 see other people's experiments see her 看別人的實驗見她 241 00:09:30,339 --> 00:09:32,868 own experiments iterate and version on 自己的實驗進行反覆運算並在其上進行版本控制 242 00:09:33,068 --> 00:09:35,448 her own experiments and then basically 她自己的實驗,然後基本上 243 00:09:35,649 --> 00:09:37,099 download and begin to use those 下載並開始使用那些 244 00:09:37,299 --> 00:09:39,019 experiments rather than having to start 實驗而不是必須開始 245 00:09:39,220 --> 00:09:40,578 from scratch every time she can start 每次她可以從頭開始 Start from scratch:star from nothing 246 00:09:40,778 --> 00:09:42,618 there and and move forward rather than 在那裡,前進而不是 247 00:09:42,818 --> 00:09:46,669 you know resetting everything so you 你知道重置一切,所以你 248 00:09:46,870 --> 00:09:48,799 have rejected my suggestions 拒絕了我的建議 249 00:09:49,000 --> 00:09:50,479 that's very harsh of you I don't want to 對你來說我很苛刻 Harsh:too difficult 250 00:09:50,679 --> 00:09:51,498 work at a company so I'm gonna help you 在公司工作,所以我會幫助你 251 00:09:51,698 --> 00:09:55,039 by building your own ml ops platform 通過建立自己的ml ops平臺 252 00:09:55,240 --> 00:09:57,128 using first we'll do it with C ICD 首先使用C ICD 253 00:09:57,328 --> 00:10:01,099 second we'll add in git and third we'll 第二,我們添加git,第三,我們添加 254 00:10:01,299 --> 00:10:04,459 add a soon you start with Q plug that 添加一個很快您開始使用Q外掛程式 255 00:10:04,659 --> 00:10:07,608 was get in the middle or get lab or 在中間或實驗室 256 00:10:07,808 --> 00:10:09,019 bitbucket or whatever it might be and 比特桶或其他可能的東西 257 00:10:09,220 --> 00:10:11,659 then finally we'll do it with a C ICD 最後,我們將使用C ICD 258 00:10:11,860 --> 00:10:14,868 platform by the way I hope I did my best 平臺,希望我能做到最好 259 00:10:15,068 --> 00:10:16,818 I'm trying to be very neutral here all 我試圖在這裡保持中立 Neutral: 中立 260 00:10:17,019 --> 00:10:20,058 logos are the same size and by the way 徽標的大小和方式相同 261 00:10:20,259 --> 00:10:23,628 if you are thinking of inventing a new C 如果您正在考慮發明新的C 262 00:10:23,828 --> 00:10:27,289 ICD platform there are so many there are ICD平臺有很多 263 00:10:27,490 --> 00:10:31,849 so many C ICD platforms it's enough just 這麼多的C ICD平臺就足夠了 264 00:10:32,049 --> 00:10:34,128 help another one help one that already 幫助另一個已經幫助的人 265 00:10:34,328 --> 00:10:39,318 exists I promise you okay so joking 存在,我保證你還可以,所以開玩笑 Exists:egg+sister 266 00:10:39,519 --> 00:10:42,738 aside let's go ml ops in the real I'm 拋開現實,我走了 267 00:10:42,938 --> 00:10:45,828 gonna take on a challenge that many 將會面臨許多挑戰 268 00:10:46,028 --> 00:10:48,318 people I don't know if Corey's here 我不知道寇里在這裡的人 269 00:10:48,519 --> 00:10:49,669 Corey are you here I know that you're 寇里你在這裡我知道你在 270 00:10:49,870 --> 00:10:51,108 attending cube con but I'm not sure you 參加立方體騙局,但我不確定你 Con:conference 活動 271 00:10:51,308 --> 00:10:52,789 hear Corey doesn't believe that at 聽到寇里不相信 272 00:10:52,990 --> 00:10:55,248 Multi-cloud exists I'm gonna say that it 多面存在,我要說 Multi:many 273 00:10:55,448 --> 00:10:56,959 does and I'm gonna say it does in the 會,我會說它在 274 00:10:57,159 --> 00:10:58,969 following way multi-cloud really does 按照多雲真正的方法 275 00:10:59,169 --> 00:11:00,558 exist in the real world and I'm gonna 存在于現實世界中,我要 276 00:11:00,759 --> 00:11:03,529 lay out a very standard scenario and 提出一個非常標準的方案 Scenario:common story 277 00:11:03,730 --> 00:11:05,379 then solve it for you on stage 然後在舞臺上為你解決 278 00:11:05,580 --> 00:11:08,409 so what when I say multi-cloud this is 所以當我說多雲時 279 00:11:08,610 --> 00:11:10,929 what it looks like you have at the top 頂部看起來像什麼 280 00:11:11,129 --> 00:11:12,939 you have cloud whatever cloud it might 無論有沒有雲,您都有雲 281 00:11:13,139 --> 00:11:14,889 be and at the bottom you have on Prem be and在Prem的最底端 on Prem:on the local hard drive 282 00:11:15,089 --> 00:11:17,109 and again that could be that's a special 再次可能那是一個特殊的 283 00:11:17,309 --> 00:11:18,639 case of a second cloud it could be a 第二個雲的情況可能是 284 00:11:18,839 --> 00:11:19,448 second cloud 第二朵雲 285 00:11:19,649 --> 00:11:20,769 it could be on Prem could be your local 它可能在Prem上可能是您本地的 286 00:11:20,970 --> 00:11:22,539 laptop doesn't matter and in the middle 筆記型電腦沒關係,在中間 287 00:11:22,740 --> 00:11:26,529 you have get the reason you've chosen 你有選擇的理由 288 00:11:26,730 --> 00:11:28,750 that top cloud is because it's 那頂雲是因為 289 00:11:28,950 --> 00:11:30,008 distributed is something that your 分發是你的事 Distributed:to give to many people 290 00:11:30,208 --> 00:11:31,899 datacenter can't offer you today maybe 資料中心今天可能無法為您提供服務 291 00:11:32,100 --> 00:11:32,828 it's up time 時間到了 up time:how much time it takes on the computer 292 00:11:33,028 --> 00:11:34,959 maybe it's locality because of 也許是因為 Locality(adv) 293 00:11:35,159 --> 00:11:37,059 regulatory reasons maybe it's just 監管方面的原因可能僅僅是 Regulatory(V):規制 294 00:11:37,259 --> 00:11:39,189 closer to your IOT deployments right you 更接近您的物聯網部署 deploy =Distributed:to give to many people 295 00:11:39,389 --> 00:11:41,139 don't want to have that far big Latancy 不想擁有那麼大的萊頓 Latency:ping = ping test 296 00:11:41,339 --> 00:11:43,419 sees and so on but the reason used to 看到等等,但以前的原因 297 00:11:43,620 --> 00:11:44,679 have stuff on prem is because you 在prem上有東西是因為你 298 00:11:44,879 --> 00:11:45,909 actually have a lot of data that you've 實際上有很多資料 299 00:11:46,110 --> 00:11:48,128 been collecting for many years and it's 已經收集了很多年了 300 00:11:48,328 --> 00:11:49,899 just a lot a big pain in the buttom to 屁股上的痛苦很大 a big pain:a big problem again 301 00:11:50,100 --> 00:11:52,599 move it to the cloud we see this 將其移動到雲中,我們看到了 302 00:11:52,799 --> 00:11:54,399 scenario all the time and it's whether 情況一直存在,這是否 Scenario:common situation 303 00:11:54,600 --> 00:11:56,198 or not you know maybe that's a poor 還是你不知道那是一個窮人 304 00:11:56,399 --> 00:11:57,698 latency connection or maybe you just 延遲連接,也許你只是 Latency: 305 00:11:57,899 --> 00:12:00,159 don't want to spend you know many many 不想花你知道很多很多 306 00:12:00,360 --> 00:12:01,689 millions of dollars on petabytes of data PB級數據的數百萬美元 小到大Megabyte->Gigabyte->terabyte(1000GB)->Petabytes(1000000GB) 307 00:12:01,889 --> 00:12:02,769 being held in the cloud because you 被困在雲端,因為你 308 00:12:02,970 --> 00:12:03,849 already have a bunch of disks and it's 已經有一堆光碟了 309 00:12:04,049 --> 00:12:06,149 doing its job you don't want to do that 做它的工作,你不想那樣做 310 00:12:06,350 --> 00:12:09,008 so and then we have our to cast of 所以然後我們要 311 00:12:09,208 --> 00:12:10,659 characters we have our data scientist 我們擁有資料科學家的角色 312 00:12:10,860 --> 00:12:13,448 she wants to iterate very quickly on the 她想非常快速地反覆運算 Iterate:explain 313 00:12:13,649 --> 00:12:15,819 the model and make sure that it works 模型並確保其有效 314 00:12:16,019 --> 00:12:17,799 and she's gonna want to train next to 她要在旁邊訓練 315 00:12:18,000 --> 00:12:20,198 the model but then the sres 模型,然後是sres 316 00:12:20,399 --> 00:12:21,698 and the ml engineers are going to want 和毫升工程師會想要 317 00:12:21,899 --> 00:12:25,899 to host in the cloud so let's do this so 託管在雲中,所以我們這樣做 318 00:12:26,100 --> 00:12:28,508 what she does is first she's gonna check 她要做的是首先要檢查 319 00:12:28,708 --> 00:12:32,500 in her code and to get will be 在她的代碼,並獲得將是 320 00:12:32,700 --> 00:12:34,809 watched by the CI CD pipeline for a new CI CD管道正在監視新的 321 00:12:35,009 --> 00:12:37,179 check-in and that will automatically 簽入,這將自動 322 00:12:37,379 --> 00:12:41,198 trigger a cube flow pipeline it that cue 觸發一個提示的多維資料集流管道 323 00:12:41,399 --> 00:12:43,029 flow pipeline will process the data 流管道將處理資料 324 00:12:43,230 --> 00:12:44,769 first in whatever way it needs 首先以任何需要的方式 325 00:12:44,970 --> 00:12:46,959 processing or if it's already been 正在處理或已經處理過 326 00:12:47,159 --> 00:12:48,819 processed and check pointed then it can 處理並檢查指出,然後可以 327 00:12:49,019 --> 00:12:50,319 skip to the next step and cue flows 跳到下一步並提示流程 328 00:12:50,519 --> 00:12:51,429 smart enough to do that 足夠聰明地做到這一點 329 00:12:51,629 --> 00:12:54,479 it then will run a training run 然後它將進行一次訓練 330 00:12:54,679 --> 00:12:58,120 automatically on that process data again 再次自動處理該過程資料 331 00:12:58,320 --> 00:12:59,859 using whatever parameters you've passed 使用您傳遞的任何參數 332 00:13:00,059 --> 00:13:02,409 in via get right again this isn't 通過再次獲得權利這不是 333 00:13:02,610 --> 00:13:03,609 something where the data scientist is 資料科學家所在的地方 334 00:13:03,809 --> 00:13:05,318 picking it on the fly you want to be 隨時隨地挑選它 335 00:13:05,519 --> 00:13:08,258 extremely prescriptive about this once 關於這一次非常規範 Prescriptive:don’t give wrong code 336 00:13:08,458 --> 00:13:10,059 that's done it's gonna register the 這樣就可以註冊 Register:報名 337 00:13:10,259 --> 00:13:12,459 model at a central point which can be 可以在一個中心點建模 338 00:13:12,659 --> 00:13:15,669 picked up and then rolled out to the 拿起然後展開到 339 00:13:15,870 --> 00:13:17,139 cloud and in this case you would roll it 雲,在這種情況下,您將其滾動 340 00:13:17,339 --> 00:13:19,120 out to something like a canary end point 到金絲雀終點 341 00:13:19,320 --> 00:13:21,009 or a staging endpoint to make sure that 或暫存端點以確保 342 00:13:21,210 --> 00:13:24,069 it runs a human being often will after 它使一個人經常會 343 00:13:24,269 --> 00:13:26,500 the tests of paws come by make sure 通過檢查爪子來確保 344 00:13:26,700 --> 00:13:29,469 everything is operating properly that 一切運行正常 345 00:13:29,669 --> 00:13:31,809 the human being will then say okay it's 然後,人類會說,好的 346 00:13:32,009 --> 00:13:33,669 time for me to trigger the next step and 是我觸發下一步的時間了, 347 00:13:33,870 --> 00:13:35,799 that's at that point it gets rolled out 到那時它被推出 348 00:13:36,000 --> 00:13:38,259 to a public service endpoint and use for 到公共服務端點並用於 349 00:13:38,460 --> 00:13:42,639 the model for production okay so that's 生產模型還可以,所以 350 00:13:42,840 --> 00:13:44,319 the high level let's see what this 高層次,讓我們看看這是什麼 351 00:13:44,519 --> 00:13:47,689 actually looks like when it's running 實際上看起來像是在運行時 352 00:13:47,779 --> 00:13:53,889 and yes you can okay well this is gonna 是的,你可以很好,這將會 353 00:13:54,090 --> 00:13:55,359 be very hard because I shoulder the 很難,因為我肩負著 354 00:13:55,559 --> 00:13:57,819 whole time alright so this is um you 整個時間都很好,所以這是你 355 00:13:58,019 --> 00:13:59,919 know the one downside of doing we're 知道做我們的一個缺點 Downside:壞處 356 00:14:00,120 --> 00:14:02,079 moving into ml is you have to record all 進入毫升是你必須記錄所有 357 00:14:02,279 --> 00:14:04,049 your videos because everything takes forever 您的視頻會永遠解決一切 358 00:14:04,250 --> 00:14:07,089 so I promise you can look at the 所以我保證你可以看看 359 00:14:07,289 --> 00:14:08,379 date I don't know where the data is on 日期,我不知道資料在哪裡 360 00:14:08,580 --> 00:14:10,000 anyone see a date on there this is 有人在那裡看到約會 361 00:14:10,200 --> 00:14:12,189 literally run last night so I promise 昨天晚上確實跑了,所以我保證 362 00:14:12,389 --> 00:14:14,109 you can go to this repo which is 你可以去這個倉庫 363 00:14:14,309 --> 00:14:15,879 public and download all the code and 公開並下載所有代碼, 364 00:14:16,080 --> 00:14:18,939 have a good time okay where are you 玩得開心吧,你在哪裡 365 00:14:19,139 --> 00:14:23,199 okay so first is this running it is okay 好的,所以首先運行它是可以的 366 00:14:23,399 --> 00:14:25,539 so first what you see here this is a 首先,您在這裡看到的是 367 00:14:25,740 --> 00:14:28,719 very standard repo out there and it's 非常標準的倉庫在那裡 368 00:14:28,919 --> 00:14:30,309 got all the code in it and I let me show 裡面有所有代碼,我讓我展示 369 00:14:30,509 --> 00:14:31,870 you actually exactly what this code 您實際上正是這段代碼 370 00:14:32,070 --> 00:14:36,920 looks like here see this you cannot oh 看起來像這裡看到你不能哦 371 00:14:41,389 --> 00:14:46,059 okay so they always say never show code 好吧,所以他們總是說從不顯示代碼 372 00:14:46,259 --> 00:14:48,159 when you're doing a demo because 在進行演示時,因為 373 00:14:48,360 --> 00:14:49,779 everyone's eyes glaze over but these are 每個人的眼睛都凝視著,但這是 eyes glaze over:they don’t understand 374 00:14:49,980 --> 00:14:51,639 very smart room so I'm gonna trust you 非常聰明的房間,所以我會相信你 375 00:14:51,840 --> 00:14:53,949 your eyes are not going to leave over so 你的眼睛不會離開所以 376 00:14:54,149 --> 00:14:55,449 what you have here is this is a very 你在這裡是一個非常 377 00:14:55,649 --> 00:14:57,939 very standard CI CD pipeline this is 非常標準的CI CD管道,這是 378 00:14:58,139 --> 00:14:59,949 using Azure dev ops which is something 使用Azure開發人員操作 379 00:15:00,149 --> 00:15:02,409 that we use hosted dev ops CI CD on 我們使用託管的dev ops CI CD Host: the leader 380 00:15:02,610 --> 00:15:04,659 Azure but the Yambol is gonna look super 蔚藍但Yambol看起來超級好 381 00:15:04,860 --> 00:15:06,250 familiar because it follows very 熟悉,因為它遵循 382 00:15:06,450 --> 00:15:08,769 standard cid CD practices looks just 標準CID CD的做法看起來只是 383 00:15:08,970 --> 00:15:11,589 like jenkins and what you see here is at 就像詹金斯,你在這裡看到的是 384 00:15:11,789 --> 00:15:13,089 the top you see a trigger that means 在頂部,您會看到一個觸發器,這意味著 385 00:15:13,289 --> 00:15:14,620 it's triggering off the master once you 一旦你觸發了主人 386 00:15:14,820 --> 00:15:18,279 check in and here you have individual 簽入,這裡有個人 387 00:15:18,480 --> 00:15:20,199 steps this in fact does a build of 步驟這實際上是建立 388 00:15:20,399 --> 00:15:23,829 container and we do three builds here 容器,我們在這裡進行三個構建 389 00:15:24,029 --> 00:15:26,039 because that's what cube flow requires 因為那是多維資料集流所需要的 390 00:15:26,240 --> 00:15:28,809 cube flow needs every one of your steps 多維資料集流需要您執行的每一步 391 00:15:29,009 --> 00:15:31,089 to be built into a container and then 放在一個容器裡然後 392 00:15:31,289 --> 00:15:32,939 you roll that for 你為 393 00:15:33,139 --> 00:15:34,829 and into your cute flow pipeline and 並進入您可愛的流程管道, 394 00:15:35,029 --> 00:15:37,469 what you're rolling in what the pipeline 您在管道中滾動什麼 395 00:15:37,669 --> 00:15:42,449 looks like is this and actually why 看起來是這樣,實際上是為什麼 396 00:15:42,649 --> 00:15:43,879 don't I get to this in just a second so 我不是一秒鐘就能做到這一點嗎 397 00:15:44,080 --> 00:15:48,329 we'll go back to our video here and like 我們將在這裡回到我們的視頻 398 00:15:48,529 --> 00:15:52,859 I said what you see is a standard repo 我說你看到的是標準回購 399 00:15:53,059 --> 00:15:56,969 or is my thing here and that was just 還是我在這裡的東西,那僅僅是 400 00:15:57,169 --> 00:15:58,709 the code that you saw me pointing at 您看到我指向的代碼 401 00:15:58,909 --> 00:16:01,289 earlier with standard pipelines and this 早期使用標準管道,這 402 00:16:01,490 --> 00:16:03,779 is a very important one it does burritos 是一個非常重要的墨西哥卷餅 403 00:16:03,980 --> 00:16:05,909 versus tacos you upload and it'll tell 與您上傳的炸玉米餅,它將告訴您 404 00:16:06,110 --> 00:16:06,870 you whether or not something Sabrina 你是否有薩布麗娜的東西 405 00:16:07,070 --> 00:16:08,519 virgin taco we're just a hardware heart 原始的炸玉米餅,我們只是硬體的心臟 406 00:16:08,720 --> 00:16:10,289 problem that you would think you know a 你以為自己知道的問題 407 00:16:10,490 --> 00:16:13,229 half-open burrito what you get the idea 半開卷餅你有什麼主意 408 00:16:13,429 --> 00:16:15,659 all right so we moving on here and you 好吧,我們繼續在這裡和你 409 00:16:15,860 --> 00:16:17,309 can see what it's doing is this code 可以看到這段代碼在做什麼 410 00:16:17,509 --> 00:16:19,289 that's in the repo right now is doing 現在在回購中 411 00:16:19,490 --> 00:16:21,569 transfer learning using very standard 使用非常標準的轉移學習 412 00:16:21,769 --> 00:16:23,459 transfer learning tool called mobile net 轉移學習工具稱為移動網 413 00:16:23,659 --> 00:16:28,579 and so we layer on top of that so 所以我們在此之上 414 00:16:28,779 --> 00:16:31,169 like I said this is just incredibly 就像我說的那樣 Incredibly:so much can’t believe 415 00:16:31,370 --> 00:16:34,289 pipeline code and these are the pipeline 管道代碼,這些是管道 416 00:16:34,490 --> 00:16:35,849 steps that you'll see here and I've into 您將在這裡看到的步驟,我已經進入 417 00:16:36,049 --> 00:16:38,099 those in just a second but the idea is 那些只是一秒鐘,但想法是 418 00:16:38,299 --> 00:16:41,849 that we want to show how when you check 我們想告訴您如何檢查 419 00:16:42,049 --> 00:16:43,139 in it's going to kick all these things 在其中踢所有這些東西 Kick:throw 420 00:16:43,340 --> 00:16:45,750 off so we come back here and we're gonna 離開,所以我們回到這裡,我們要 421 00:16:45,950 --> 00:16:52,259 hit merge on our pull request and it's 在我們的拉取請求上點擊合併, Merge:put somethings together Request:要求 pull request:ask something from server 422 00:16:52,460 --> 00:16:53,279 gonna kick off there we go 要去那裡開始 kick off:to start 423 00:16:53,480 --> 00:16:57,120 so we're hitting merge and off we go 所以我們要合併然後離開 424 00:16:57,320 --> 00:16:59,759 and there you go so this is the UI for a 然後去,所以這是 425 00:16:59,960 --> 00:17:02,490 DevOps again all we did was hit merge DevOps我們所做的一切再次被合併 426 00:17:02,690 --> 00:17:05,309 and it all this kicked off so at this 這一切開始了 427 00:17:05,509 --> 00:17:07,200 point we're gonna go forward you can see 點,我們要前進,你可以看到 428 00:17:07,400 --> 00:17:09,059 there right there build number nine has 那邊有九號樓 429 00:17:09,259 --> 00:17:11,549 just kicked off and that is you know May 剛剛開始,那就是你知道的梅 430 00:17:11,750 --> 00:17:13,649 21st so you know I'm being honest here 21號所以你知道我在這裡很誠實 21st:20+first Honest: onest 431 00:17:13,849 --> 00:17:15,059 and you can see how the things are 你會看到事情如何 432 00:17:15,259 --> 00:17:17,339 aspiring here so in this case it's going 嚮往這裡,所以在這種情況下 Aspiring:to dream for something 433 00:17:17,539 --> 00:17:18,960 to go through those three build steps 完成這三個構建步驟 434 00:17:19,160 --> 00:17:20,220 where it builds each one of the 它在其中構建每個 435 00:17:20,420 --> 00:17:22,740 containers using standard as your build 使用標準容器的容器 436 00:17:22,940 --> 00:17:25,499 tools and again this can be docker this 工具,這又可以是docker this 437 00:17:25,699 --> 00:17:26,759 could be builder this could be anything 可能是建設者,這可能是任何事情 438 00:17:26,959 --> 00:17:29,399 that makes sense for you and you've all 這對您有意義,而您所有人 makes sense:有道理 439 00:17:29,599 --> 00:17:31,139 seen docker build many many times so I'm 看到碼頭工人建造了很多次,所以我 440 00:17:31,339 --> 00:17:35,220 going to fast forward through those okay 快點走過去 441 00:17:35,420 --> 00:17:37,589 and then this last step is where you 然後這最後一步是您 442 00:17:37,789 --> 00:17:39,809 take all those containers and now you've 拿走所有這些容器,現在您已經 443 00:17:40,009 --> 00:17:42,899 uploaded them to in this case a CR but 在這種情況下將它們上傳到CR, 444 00:17:43,099 --> 00:17:45,930 it could be any docker registry okay 可能是任何Docker註冊表都可以 445 00:17:46,130 --> 00:17:48,598 so now that is done and now we have the 所以現在完成了,現在我們有了 446 00:17:48,798 --> 00:17:50,129 interesting bit because that is the 有趣的是,因為那是 Bit:small thing 447 00:17:50,329 --> 00:17:52,559 build pipeline our steps here are first 建立管道我們的步驟是首先 448 00:17:52,759 --> 00:17:54,419 off a build pipeline where it takes 離開需要的構建管道 449 00:17:54,619 --> 00:17:57,509 those containers them and then we do 這些容器,然後我們做 450 00:17:57,710 --> 00:17:59,129 what's called a release pipeline and you 所謂的發佈管道,你 Release:to let something go 451 00:17:59,329 --> 00:18:01,049 want those to be separate right you want 想要那些分開的權利 452 00:18:01,250 --> 00:18:02,250 to make sure the build is complete that 確保構建完成 453 00:18:02,450 --> 00:18:04,169 all the artifacts pass and already go 所有的文物都過去了,已經過去了 454 00:18:04,369 --> 00:18:05,940 and now we're going to do this release 現在我們要發佈此版本 455 00:18:06,140 --> 00:18:07,829 pipeline and the release pipeline is 管道和發佈管道是 456 00:18:08,029 --> 00:18:11,279 where it reaches out to cube flow and 它伸向立方體流動的地方 reaches out:try to catch something from far away 457 00:18:11,480 --> 00:18:13,680 executes and you can see this here it's 執行,您可以在這裡看到它 Executes:.exe 458 00:18:13,880 --> 00:18:16,769 reaching out using a swagger client that 使用招搖的客戶 459 00:18:16,970 --> 00:18:19,139 we wrote to connect directly to the cue 我們寫了直接連接到提示 460 00:18:19,339 --> 00:18:20,789 flow API you can see there it's 流API,您可以看到它的 461 00:18:20,990 --> 00:18:24,479 forwarding the connection and here you 轉發連接,在這裡您 462 00:18:24,679 --> 00:18:26,098 can see the run these are the rough 可以看到運行這些是粗糙的 463 00:18:26,298 --> 00:18:28,769 he had previously we run 84 we had a lot 他以前有我們跑84我們有很多 464 00:18:28,970 --> 00:18:31,019 of debugging not cue flows fault our 調試不提示流故障我們 Fault:mistake 465 00:18:31,220 --> 00:18:33,429 fault 故障 466 00:18:34,390 --> 00:18:37,858 and presto we're at run eighty-five 前提是我們現在快八十五了 Presto(口語):immediately馬上 467 00:18:38,058 --> 00:18:41,430 right so again I want to call back 對,所以我想再次致電 468 00:18:41,630 --> 00:18:43,108 because this is a small but special 因為這很小但是很特別 469 00:18:43,308 --> 00:18:45,180 thing all I did was kick hit get all I 我所做的就是踢得到我所有 470 00:18:45,380 --> 00:18:46,919 did was check in and merge my final PR 做的是檢查併合並我的最終PR 471 00:18:47,119 --> 00:18:48,799 and it kicked off all of this goodness 它開始了所有的美好 472 00:18:49,000 --> 00:18:51,389 so now we were sending through here and 所以現在我們通過這裡發送 473 00:18:51,589 --> 00:18:54,119 you can see a rerunning know okay so 你可以看到一個重新運行的知道,所以 Rerunning:run again 474 00:18:54,319 --> 00:18:56,579 this is a standard cue put pipeline I'd 這是我想要的標準提示放置管道 475 00:18:56,779 --> 00:18:58,019 already described it does pre 已經描述過了 476 00:18:58,220 --> 00:18:59,789 processing here it's mounting in Azure 在這裡處理它正在Azure中安裝 477 00:18:59,990 --> 00:19:02,190 blob and you can see it watching the 一滴,你可以看到它看著 478 00:19:02,390 --> 00:19:03,180 logs right there 記錄在那裡 479 00:19:03,380 --> 00:19:05,309 and you can see the artifacts once 你可以一次看到文物 480 00:19:05,509 --> 00:19:06,899 that's done it automatically moves 完成後,它會自動移動 481 00:19:07,099 --> 00:19:09,750 forward the containers been created and 轉發已創建的容器,並 482 00:19:09,950 --> 00:19:11,399 here it's actually doing a training run 這實際上是在進行訓練 483 00:19:11,599 --> 00:19:13,259 right now it's using the latest 現在正在使用最新的 484 00:19:13,460 --> 00:19:16,049 tensorflow 2.0 you can see and not just 你可以看到的不僅僅是tensorflow 2.0 2.0:two point o 485 00:19:16,250 --> 00:19:17,700 tensorflow 2.0 but let me just highlight tensorflow 2.0,但讓我強調一下 486 00:19:17,900 --> 00:19:19,680 that right there it's using GPUs 就在那裡,它正在使用GPU 487 00:19:19,880 --> 00:19:22,559 natively on kubernetes directly through 直接通過kubernetes在本地 488 00:19:22,759 --> 00:19:27,180 this check it ok and it's going to 這項檢查確定,它將 489 00:19:27,380 --> 00:19:29,190 operate and run quite fast much faster 快速運行和運行 490 00:19:29,390 --> 00:19:30,899 than it ran when it was running on my 比它在我的電腦上運行時所運行的 491 00:19:31,099 --> 00:19:36,180 local machine forward there and now it's 本地機器向前走,現在 492 00:19:36,380 --> 00:19:38,190 doing registration and this is where you 做註冊,這是你的地方 493 00:19:38,390 --> 00:19:40,588 start to mix and match between things 開始在事物之間融合和匹配 494 00:19:40,788 --> 00:19:42,000 running in queue flow and things that 在佇列流中運行以及 495 00:19:42,200 --> 00:19:44,608 you may want to run in the cloud so in 您可能想在雲中運行,所以在 496 00:19:44,808 --> 00:19:46,379 this case it's going to do this last 在這種情況下,它將持續到最後 497 00:19:46,579 --> 00:19:49,889 register step in Azure we have a service 在Azure中註冊步驟我們有一項服務 498 00:19:50,089 --> 00:19:51,809 called the model management service this 稱為模型管理服務 499 00:19:52,009 --> 00:19:54,419 is basically a sophisticated storage 基本上是一個複雜的存儲 Sophisticated:very developed 500 00:19:54,619 --> 00:19:56,399 location which understands machine 瞭解機器的位置 501 00:19:56,599 --> 00:19:59,159 learning concepts first-class way 學習概念的一流方法 Concepts:idea 502 00:19:59,359 --> 00:20:01,229 and by that what I mean is you're able 我的意思是說你有能力 503 00:20:01,429 --> 00:20:02,789 to understand whether or not something 瞭解是否有東西 504 00:20:02,990 --> 00:20:05,369 needs versioning how to compare versions 需要版本控制如何比較版本 505 00:20:05,569 --> 00:20:06,960 within each other what the performance 彼此之間的表現 506 00:20:07,160 --> 00:20:08,460 of something is and you're able to 的東西是,你能夠 507 00:20:08,660 --> 00:20:11,819 visualize it in a first-class way and so 以一流的方式視覺化 Visualize:to see something with your eye close 508 00:20:12,019 --> 00:20:13,049 what it's going to do here at this last 這最後要在這裡做什麼 509 00:20:13,250 --> 00:20:15,659 step here are three versions that we ran 這是我們運行的三個版本 510 00:20:15,859 --> 00:20:19,379 already and once it's done you can see 已經,一旦完成,您可以看到 511 00:20:19,579 --> 00:20:22,129 here this is the model registry service 這是模型註冊服務 512 00:20:22,329 --> 00:20:27,428 it's going to kick it off any minute 它會在任何時候開始 any minute:very soon 513 00:20:28,599 --> 00:20:32,069 yeah okay so you can see here it's using 是的,所以你可以看到這裡正在使用 514 00:20:32,269 --> 00:20:35,039 in this case the azure SDK natively 在這種情況下,本機azure SDK 515 00:20:35,240 --> 00:20:37,079 inside of cube flow it's able to reach 在多維資料集流內部,它可以達到 516 00:20:37,279 --> 00:20:38,879 out and you can see they're using our 出去,你可以看到他們正在使用我們的 517 00:20:39,079 --> 00:20:40,829 service principles service passwords the 服務原則服務密碼 Principles:rules 518 00:20:41,029 --> 00:20:42,960 subscription information it's able to 能夠訂閱的資訊 Subscription:訂閱 519 00:20:43,160 --> 00:20:45,539 take that hc5 file and simply push it up 提取該hc5文件,然後將其向上推 520 00:20:45,740 --> 00:20:48,119 to the model registry service which then 到模型註冊服務,然後 521 00:20:48,319 --> 00:20:50,039 will kick off the second half of the 將在下半場開始 522 00:20:50,240 --> 00:20:55,559 solution and you can see there so that's 解決方案,您可以在那裡看到,所以 523 00:20:55,759 --> 00:20:57,690 roughly what it is you know rolling 大概是什麼,你知道滾動 524 00:20:57,890 --> 00:21:00,029 things out here's where you can see some 在這裡可以看到一些東西 525 00:21:00,230 --> 00:21:01,079 of the richness that we're able to 我們能夠做到的豐富 526 00:21:01,279 --> 00:21:02,598 upload and the metadata that we have 上傳以及我們擁有的中繼資料 Metadata:data about data 527 00:21:02,798 --> 00:21:06,990 relative to these things and no I don't 相對於這些東西,不,我不 528 00:21:07,190 --> 00:21:13,108 need you they were real so that's what 需要你,他們是真實的,那就是 529 00:21:13,308 --> 00:21:15,389 model that's what Ops looks like in 這就是ops的樣子 530 00:21:15,589 --> 00:21:16,950 reality right and from that point 現實的權利,從那時起 531 00:21:17,150 --> 00:21:18,089 forward you know I just rolled out a 轉發,你知道我剛剛推出了 532 00:21:18,289 --> 00:21:19,289 service endpoint everyone seen that 每個人都看到的服務端點 533 00:21:19,490 --> 00:21:23,069 right nothing special there but that's 沒什麼特別的,但是那是 534 00:21:23,269 --> 00:21:24,358 what we're talking about this is 我們正在談論的是 535 00:21:24,558 --> 00:21:26,159 something where you had a cube flow 立方體流動的地方 536 00:21:26,359 --> 00:21:29,190 cluster that was in no way connected to 完全沒有連接的集群 537 00:21:29,390 --> 00:21:31,289 you know any of the more sophisticated 你知道任何更複雜的 538 00:21:31,490 --> 00:21:33,598 azor or other components and with just azor或其他組件 539 00:21:33,798 --> 00:21:36,539 running standard kubernetes and standard 運行標準的kubernetes和標準 540 00:21:36,740 --> 00:21:38,819 cube flow and the reality is that 立方體流動,而現實是 541 00:21:39,019 --> 00:21:41,309 gives you the opportunity to do the 給你機會去做 542 00:21:41,509 --> 00:21:42,869 training in a way that makes sense for 進行有意義的培訓 543 00:21:43,069 --> 00:21:45,299 your organization you don't have to take 您的組織,您不必 544 00:21:45,500 --> 00:21:46,769 your data you don't have to change the 您的資料,您不必更改 545 00:21:46,970 --> 00:21:48,450 way your processes work you don't have 您沒有的流程工作方式 546 00:21:48,650 --> 00:21:51,598 to adopt some crazy new standard just 採取一些瘋狂的新標準 Adopt:to take something not yours and raise it 547 00:21:51,798 --> 00:21:53,848 because that looks slightly faster this 因為這看起來快一點 Slightly:a little 548 00:21:54,048 --> 00:21:55,979 is an open platform where you can pick 是一個開放的平臺,您可以在其中選擇 549 00:21:56,179 --> 00:21:57,839 and choose and plug in the components 然後選擇並插入元件 550 00:21:58,039 --> 00:21:59,789 that make sense to you and that's the 對你有意義,那就是 551 00:21:59,990 --> 00:22:01,319 power that things like open source 像開源這樣的東西 552 00:22:01,519 --> 00:22:03,750 and things like Q flow provide now I 現在像Q flow這樣的東西 553 00:22:03,950 --> 00:22:05,219 know what you're saying it sounds like a 知道你在說什麼聽起來像 554 00:22:05,419 --> 00:22:07,529 lot of work I would like to recall this 我想回顧很多工作 Recall:remember 555 00:22:07,730 --> 00:22:09,659 statement here you know what's a lot of 聲明在這裡,你知道很多 556 00:22:09,859 --> 00:22:10,829 work eleven months sitting around 工作十一個月 557 00:22:11,029 --> 00:22:11,940 without your model shipping out to 無需將模型運送到 558 00:22:12,140 --> 00:22:12,399 production 生產 559 00:22:12,599 --> 00:22:15,039 that's a lot of work this you know okay 這是很多工作,你知道嗎 560 00:22:15,240 --> 00:22:16,509 you have to spend a few minutes making 你必須花幾分鐘的時間 561 00:22:16,710 --> 00:22:18,129 sure that service principles are able to 確保服務主體能夠 Principles: 主體 562 00:22:18,329 --> 00:22:20,558 cross clouds but other than that you'll 跨雲,但除此之外, 563 00:22:20,759 --> 00:22:24,698 be fine and you're not just fine 很好,你不只是很好 564 00:22:24,898 --> 00:22:27,549 right you're actually moving faster and 對,實際上您的移動速度更快, 565 00:22:27,750 --> 00:22:29,438 and that's what we're really trying to 這就是我們真正想要的 566 00:22:29,638 --> 00:22:32,318 provide here data scientists understand 提供資料科學家瞭解 data scientists:algorithm 567 00:22:32,519 --> 00:22:36,519 their problem areas but too often they 他們的問題領域,但他們經常 568 00:22:36,720 --> 00:22:38,318 have thrown things over the wall and 把東西扔在牆上, 569 00:22:38,519 --> 00:22:41,169 asked production to ultimately you know 要求生產,最終你知道 Ultimately:in the end 570 00:22:41,369 --> 00:22:43,568 deal with a lot of the hardness what 處理很多硬度什麼 Hardness:difficult thing 571 00:22:43,769 --> 00:22:45,188 we're saying is we want to bring 我們是說我們要帶 572 00:22:45,388 --> 00:22:47,799 through things like ml ops we want to 通過像ml ops這樣的事情,我們想要 573 00:22:48,000 --> 00:22:50,108 bring the same advances that get ops 帶來同樣的進步 574 00:22:50,308 --> 00:22:52,808 brought to app developers we want to 帶給我們想要的應用程式開發人員 575 00:22:53,009 --> 00:22:55,568 help data scientists be part of how to 説明資料科學家成為如何 576 00:22:55,769 --> 00:22:57,789 bring things to production and get into 把東西帶入生產並進入 577 00:22:57,990 --> 00:23:00,039 end ownership and even better than that 最終擁有權甚至比這更好 578 00:23:00,240 --> 00:23:02,469 help them learn real software 説明他們學習真正的軟體 579 00:23:02,669 --> 00:23:04,659 engineering best practices like for too 工程最佳實踐,例如 580 00:23:04,859 --> 00:23:06,308 long we've said hey data 很久以來,我們已經說過嘿資料 581 00:23:06,509 --> 00:23:08,108 scientists you know don't worry about 你認識的科學家不用擔心 582 00:23:08,308 --> 00:23:11,259 you know sweet best practices you know 你知道甜蜜的最佳實踐 583 00:23:11,460 --> 00:23:13,959 come back when you're older no that is 當你長大回來時,那不是 584 00:23:14,159 --> 00:23:16,000 not okay these are these are software 不好,這些是軟體 585 00:23:16,200 --> 00:23:17,740 engineers they want to help get things 他們想要幫助的工程師 586 00:23:17,940 --> 00:23:19,240 into reduction they want their models to 減少他們想要他們的模型 587 00:23:19,440 --> 00:23:21,399 be used just as much as you do we need 盡可能多地使用,我們需要 588 00:23:21,599 --> 00:23:23,709 to help them bridge the gap but not by 幫助他們彌合差距,但不能 bridge the gap: 彌合差距