1 00:00:05,850 --> 00:00:07,713 欢迎来到 Hugging Face 课程。 Welcome to the Hugging Face Course. 2 00:00:08,550 --> 00:00:10,320 本课程旨在带您了解 This course has been designed to teach you 3 00:00:10,320 --> 00:00:12,750 关于 Hugging Face 生态系统的一切 all about the Hugging Face ecosystem, 4 00:00:12,750 --> 00:00:14,700 包括如何使用数据集和模型中心 how to use the dataset and model hub 5 00:00:14,700 --> 00:00:16,803 以及我们所有的开源库。 as well as all our open-source libraries. 6 00:00:18,300 --> 00:00:19,950 这是本课程的目录。 Here is the Table of Contents. 7 00:00:19,950 --> 00:00:22,770 它共分为三个部分 As you can see, it's divided in three sections 8 00:00:22,770 --> 00:00:25,110 由浅入深地带您学习。 which become progressively more advanced. 9 00:00:25,110 --> 00:00:28,500 到目前为止,前两部分已经发布。 At this stage, the first two sections have been released. 10 00:00:28,500 --> 00:00:30,120 在课程的一开始,我们会教您基础知识 So first, we'll teach you the basics 11 00:00:30,120 --> 00:00:32,250 包括如何使用 Transformer 模型, of how to use a Transformer model, 12 00:00:32,250 --> 00:00:34,230 以及如何基于您自己的数据集上进行微调 fine-tune it on your own data set 13 00:00:34,230 --> 00:00:36,960 并与社区分享结果。 and share the result with the community. 14 00:00:36,960 --> 00:00:39,420 然后,我们将带您深入了解我们的开源库 So second, we'll dive deeper into our libraries 15 00:00:39,420 --> 00:00:42,360 并教您如果能够处理任何 NLP 任务。 and teach you how to tackle any NLP task. 16 00:00:42,360 --> 00:00:44,430 我们正在积极研究最后一部分 We're actively working on the last one 17 00:00:44,430 --> 00:00:47,280 并希望在 2022 年春季完成并发布。 and hope to have it ready for you for the spring of 2022. 18 00:00:48,510 --> 00:00:50,880 第一章不需要技术知识 The first chapter requires no technical knowledge 19 00:00:50,880 --> 00:00:52,320 只会为您介绍一些基础知识 and is a good introduction to learn 20 00:00:52,320 --> 00:00:54,180 例如 Transformers 模型可以做什么 what Transformers models can do 21 00:00:54,180 --> 00:00:56,883 以及它如何帮助到您以及应用到您公司的业务中。 and how it could be of use to you or your company. 22 00:00:58,050 --> 00:01:01,110 第一部分之后的章节需要具备 Python 的相关知识 The next chapters require a good knowledge of Python 23 00:01:01,110 --> 00:01:02,130 以及机器学习和深度学习的 and some basic knowledge of 24 00:01:02,130 --> 00:01:04,350 一些基础知识。 Machine Learning and Deep Learning. 25 00:01:04,350 --> 00:01:07,110 如果您不知道什么是训练集和验证集 If you don't know what a training and validation set are 26 00:01:07,110 --> 00:01:09,360 或者梯度下降法意味着什么, or what gradient decent means, 27 00:01:09,360 --> 00:01:11,340 您应该看看一些 you should look at an introductory course 28 00:01:11,340 --> 00:01:14,863 诸如 deeplearning.ai 或 fast.ai 发布的入门课程 such as the ones published by deeplearning.ai or fast.ai. 29 00:01:16,200 --> 00:01:17,910 如果您有一些关于某个 It's also best if you have some basics 30 00:01:17,910 --> 00:01:21,150 深度学习框架、PyTorch 或 TensorFlow 中的基础知识那就更好了。 in one Deep Learning Framework, PyTorch or TensorFlow. 31 00:01:21,150 --> 00:01:23,520 本课程介绍的各部分资料 Each part of the material introduced in this course 32 00:01:23,520 --> 00:01:25,590 在 PyTorch 和 TensorFlow 中都有一个相对应的版本, has a version in both those frameworks, 33 00:01:25,590 --> 00:01:26,730 这样您就可以选择一个 so you will be able to pick the one 34 00:01:26,730 --> 00:01:28,230 您最熟悉的版本。 you are most comfortable with. 35 00:01:29,550 --> 00:01:31,740 这是开发这门课程的团队。 This is the team that developed this course. 36 00:01:31,740 --> 00:01:33,120 接下来每位讲师会先 I'll now let each of the speakers 37 00:01:33,120 --> 00:01:34,570 简单介绍一下自己。 introduce themselves briefly. 38 00:01:37,230 --> 00:01:38,880 - Hi,我叫马修, - Hi, my name is Matthew, 39 00:01:38,880 --> 00:01:41,610 我是 Hugging Face 的机器学习工程师。 and I'm a Machine Learning Engineer at Hugging Face. 40 00:01:41,610 --> 00:01:43,200 我在开源团队工作 I work on the open-source team 41 00:01:43,200 --> 00:01:45,180 我负责维护 and I'm responsible for maintaining particularly 42 00:01:45,180 --> 00:01:47,280 团队内的 TensorFlow 代码。 the TensorFlow code there. 43 00:01:47,280 --> 00:01:50,130 在此之前,我是 Parsley 的机器学习工程师, Previously, I was a Machine Learning Engineer at Parsley, 44 00:01:50,130 --> 00:01:52,620 最近该公司被 Automatic 收购, who've recently been acquired by Automatic, 45 00:01:52,620 --> 00:01:54,210 我是一名博士后研究员 and I was a postdoctoral researcher 46 00:01:54,210 --> 00:01:57,000 之前在爱尔兰都柏林三一学院 before that at Trinity College, Dublin in Ireland 47 00:01:57,000 --> 00:02:00,093 致力于计算遗传学和视网膜疾病的研究。 working on computational genetics and retinal disease. 48 00:02:02,400 --> 00:02:03,870 - Hi,我是 Lysandre - Hi, I'm Lysandre. 49 00:02:03,870 --> 00:02:05,640 我是 Hugging Face 的机器学习工程师 I'm a Machine Learning Engineer at Hugging Face 50 00:02:05,640 --> 00:02:08,700 我是开源团队的一员。 and I'm specifically part of the open-source team. 51 00:02:08,700 --> 00:02:10,890 我已经在 Hugging Face 团队和我的团队成员一起 I've been at Hugging Face for a few years now 52 00:02:10,890 --> 00:02:12,300 工作了好几年, and alongside my team members, 53 00:02:12,300 --> 00:02:13,890 我一直致力于研究大多数您将在 I've been working on most of the tools 54 00:02:13,890 --> 00:02:15,790 本课程中看到的工具。 that you'll get to see in this course. 55 00:02:18,270 --> 00:02:20,130 - Hi,我是 Sylvain - Hi, I'm Sylvain. 56 00:02:20,130 --> 00:02:22,140 我是 Hugging Face 的研究工程师 I'm a Research Engineer at Hugging Face 57 00:02:22,140 --> 00:02:25,830 也是 Transformers 代码库的主要维护者之一。 and one of the main maintainers of the Transformers Library. 58 00:02:25,830 --> 00:02:28,110 之前,我在 fast.ai 工作 Previously, I worked at fast.ai 59 00:02:28,110 --> 00:02:30,420 我帮助开发了 fast.ai 库 where I helped develop the fast.ai Library 60 00:02:30,420 --> 00:02:32,220 以及在线图书。 as well as the online book. 61 00:02:32,220 --> 00:02:35,340 在那之前,我是一名在法国的 Before that, I was a math and computer science teacher 62 00:02:35,340 --> 00:02:36,173 数学和计算机科学老师。 in France. 63 00:02:38,550 --> 00:02:41,340 - Hi,我叫 Sasha,是 Hugging Face 的一名研究员, - Hi, my name is Sasha and I'm a Researcher at Hugging Face, 64 00:02:41,340 --> 00:02:42,420 致力于道德, working on the ethical, 65 00:02:42,420 --> 00:02:46,230 机器学习模型的环境和社会影响相关的研究。 environmental and social impacts of machine learning models. 66 00:02:46,230 --> 00:02:49,020 之前,我是 Mila 蒙特利尔大学的 Previously, I was a postdoctoral researcher at Mila, 67 00:02:49,020 --> 00:02:50,400 博士后研究员 University in Montreal 68 00:02:50,400 --> 00:02:53,040 我还为联合国全球脉搏计划担任过 and I also worked as an Applied AI Researcher 69 00:02:53,040 --> 00:02:55,140 应用人工智能研究员。 for the United Nations Global Pulse. 70 00:02:55,140 --> 00:02:57,300 参与过 CodeCarbon 和 I've been involved in projects such as CodeCarbon 71 00:02:57,300 --> 00:02:59,790 机器学习影响计算器等项目 and the Machine Learning Impacts Calculator 72 00:02:59,790 --> 00:03:02,390 致力于衡量机器学习的碳足迹的研究。 to measure the carbon footprint of machine learning. 73 00:03:05,160 --> 00:03:07,650 - Hi,我是 Merve,我是 Hugging Face 团队的 - Hi, I'm Merve and I'm a Developer Advocate 74 00:03:07,650 --> 00:03:09,390 开发技术推广工程师 at Hugging Face. 75 00:03:09,390 --> 00:03:12,480 在此之前,我是一名机器学习工程师 Previously, I was working as a Machine Learning Engineer 76 00:03:12,480 --> 00:03:15,360 负责构建 NLP 工具和聊天机器人。 building NLP tools and chat bots. 77 00:03:15,360 --> 00:03:17,670 目前,我正在努力改进模型中心 Currently, I'm working to improve the hub 78 00:03:17,670 --> 00:03:19,563 并使机器学习民主化。 and democratize machine learning. 79 00:03:22,140 --> 00:03:23,670 - 大家好。 - Hello everyone. 80 00:03:23,670 --> 00:03:27,210 我叫 Lucile,是 Hugging Face 团队的 My name is Lucile and I'm a Machine Learning Engineer 81 00:03:27,210 --> 00:03:28,353 一名机器学习工程师 at Hugging Face. 82 00:03:29,580 --> 00:03:32,550 用两句话告诉您我是谁, To tell you in two sentences who I am, 83 00:03:32,550 --> 00:03:35,590 我致力于开源工具的开发和支持 I work on the development and support of open-source tools 84 00:03:36,600 --> 00:03:39,595 我也参与了在自然语言处理领域的 and I also participate in several research project 85 00:03:39,595 --> 00:03:41,795 几个研究项目。 in the field of Natural Language Processing. 86 00:03:44,610 --> 00:03:45,540 - 大家好。 - Good day there. 87 00:03:45,540 --> 00:03:47,550 我是 Lewis,我是 Hugging Face 开源团队中的 I'm Lewis and I'm a Machine Learning Engineer 88 00:03:47,550 --> 00:03:50,130 一名机器学习工程师。 in the open-source team at Hugging Face. 89 00:03:50,130 --> 00:03:53,490 我热衷于为 NLP 社区开发工具 I'm passionate about developing tools for the NLP community 90 00:03:53,490 --> 00:03:55,050 您可以在很多 Hugging Face and you'll see me at many of Hugging Face's 91 00:03:55,050 --> 00:03:56,910 对外的活动中见到我 outreach activities. 92 00:03:56,910 --> 00:03:58,470 在加入 Hugging Face 之前, Before joining Hugging Face, 93 00:03:58,470 --> 00:03:59,790 我花了几年时间 I spent several years developing 94 00:03:59,790 --> 00:04:01,860 为初创公司和 NLP 领域的企业 machine learning applications for startups 95 00:04:01,860 --> 00:04:04,230 开发机器学习应用程序, and enterprises in the domains of NLP, 96 00:04:04,230 --> 00:04:07,260 以及拓扑数据分析和时间序列。 topological data analysis and time series. 97 00:04:07,260 --> 00:04:10,110 在此之前,我是一名理论物理学家, In a former life, I was a theoretical physicist, 98 00:04:10,110 --> 00:04:11,760 负责在大型强子对撞机等 where I researched particle collisions 99 00:04:11,760 --> 00:04:13,560 研究粒子碰撞。 at the Large Hadron Collider and so. 100 00:04:15,900 --> 00:04:18,450 - Hey,我是 Leandro,我是一名 Hugging Face 开源团队中 - Hey, I'm Leandro and I'm a Machine Learning Engineer 101 00:04:18,450 --> 00:04:21,030 的一名机器学习工程师 in the open-source team at Hugging Face. 102 00:04:21,030 --> 00:04:23,460 在加入 Hugging Face 之前,我是一名在瑞士的数据科学家 Before joining Hugging Face, I worked as a Data Scientist 103 00:04:23,460 --> 00:04:26,733 并在大学教授数据科学。 in Switzerland and have taught Data Science at University.