subtitles/zh-CN/00_welcome-to-the-hugging-face-course.srt (412 lines of code) (raw):
1
00:00:05,850 --> 00:00:07,713
欢迎来到 Hugging Face 课程。
Welcome to the Hugging Face Course.
2
00:00:08,550 --> 00:00:10,320
本课程旨在带您了解
This course has been designed to teach you
3
00:00:10,320 --> 00:00:12,750
关于 Hugging Face 生态系统的一切
all about the Hugging Face ecosystem,
4
00:00:12,750 --> 00:00:14,700
包括如何使用数据集和模型中心
how to use the dataset and model hub
5
00:00:14,700 --> 00:00:16,803
以及我们所有的开源库。
as well as all our open-source libraries.
6
00:00:18,300 --> 00:00:19,950
这是本课程的目录。
Here is the Table of Contents.
7
00:00:19,950 --> 00:00:22,770
它共分为三个部分
As you can see, it's divided in three sections
8
00:00:22,770 --> 00:00:25,110
由浅入深地带您学习。
which become progressively more advanced.
9
00:00:25,110 --> 00:00:28,500
到目前为止,前两部分已经发布。
At this stage, the first two sections have been released.
10
00:00:28,500 --> 00:00:30,120
在课程的一开始,我们会教您基础知识
So first, we'll teach you the basics
11
00:00:30,120 --> 00:00:32,250
包括如何使用 Transformer 模型,
of how to use a Transformer model,
12
00:00:32,250 --> 00:00:34,230
以及如何基于您自己的数据集上进行微调
fine-tune it on your own data set
13
00:00:34,230 --> 00:00:36,960
并与社区分享结果。
and share the result with the community.
14
00:00:36,960 --> 00:00:39,420
然后,我们将带您深入了解我们的开源库
So second, we'll dive deeper into our libraries
15
00:00:39,420 --> 00:00:42,360
并教您如果能够处理任何 NLP 任务。
and teach you how to tackle any NLP task.
16
00:00:42,360 --> 00:00:44,430
我们正在积极研究最后一部分
We're actively working on the last one
17
00:00:44,430 --> 00:00:47,280
并希望在 2022 年春季完成并发布。
and hope to have it ready for you for the spring of 2022.
18
00:00:48,510 --> 00:00:50,880
第一章不需要技术知识
The first chapter requires no technical knowledge
19
00:00:50,880 --> 00:00:52,320
只会为您介绍一些基础知识
and is a good introduction to learn
20
00:00:52,320 --> 00:00:54,180
例如 Transformers 模型可以做什么
what Transformers models can do
21
00:00:54,180 --> 00:00:56,883
以及它如何帮助到您以及应用到您公司的业务中。
and how it could be of use to you or your company.
22
00:00:58,050 --> 00:01:01,110
第一部分之后的章节需要具备 Python 的相关知识
The next chapters require a good knowledge of Python
23
00:01:01,110 --> 00:01:02,130
以及机器学习和深度学习的
and some basic knowledge of
24
00:01:02,130 --> 00:01:04,350
一些基础知识。
Machine Learning and Deep Learning.
25
00:01:04,350 --> 00:01:07,110
如果您不知道什么是训练集和验证集
If you don't know what a training and validation set are
26
00:01:07,110 --> 00:01:09,360
或者梯度下降法意味着什么,
or what gradient decent means,
27
00:01:09,360 --> 00:01:11,340
您应该看看一些
you should look at an introductory course
28
00:01:11,340 --> 00:01:14,863
诸如 deeplearning.ai 或 fast.ai 发布的入门课程
such as the ones published by deeplearning.ai or fast.ai.
29
00:01:16,200 --> 00:01:17,910
如果您有一些关于某个
It's also best if you have some basics
30
00:01:17,910 --> 00:01:21,150
深度学习框架、PyTorch 或 TensorFlow 中的基础知识那就更好了。
in one Deep Learning Framework, PyTorch or TensorFlow.
31
00:01:21,150 --> 00:01:23,520
本课程介绍的各部分资料
Each part of the material introduced in this course
32
00:01:23,520 --> 00:01:25,590
在 PyTorch 和 TensorFlow 中都有一个相对应的版本,
has a version in both those frameworks,
33
00:01:25,590 --> 00:01:26,730
这样您就可以选择一个
so you will be able to pick the one
34
00:01:26,730 --> 00:01:28,230
您最熟悉的版本。
you are most comfortable with.
35
00:01:29,550 --> 00:01:31,740
这是开发这门课程的团队。
This is the team that developed this course.
36
00:01:31,740 --> 00:01:33,120
接下来每位讲师会先
I'll now let each of the speakers
37
00:01:33,120 --> 00:01:34,570
简单介绍一下自己。
introduce themselves briefly.
38
00:01:37,230 --> 00:01:38,880
- Hi,我叫马修,
- Hi, my name is Matthew,
39
00:01:38,880 --> 00:01:41,610
我是 Hugging Face 的机器学习工程师。
and I'm a Machine Learning Engineer at Hugging Face.
40
00:01:41,610 --> 00:01:43,200
我在开源团队工作
I work on the open-source team
41
00:01:43,200 --> 00:01:45,180
我负责维护
and I'm responsible for maintaining particularly
42
00:01:45,180 --> 00:01:47,280
团队内的 TensorFlow 代码。
the TensorFlow code there.
43
00:01:47,280 --> 00:01:50,130
在此之前,我是 Parsley 的机器学习工程师,
Previously, I was a Machine Learning Engineer at Parsley,
44
00:01:50,130 --> 00:01:52,620
最近该公司被 Automatic 收购,
who've recently been acquired by Automatic,
45
00:01:52,620 --> 00:01:54,210
我是一名博士后研究员
and I was a postdoctoral researcher
46
00:01:54,210 --> 00:01:57,000
之前在爱尔兰都柏林三一学院
before that at Trinity College, Dublin in Ireland
47
00:01:57,000 --> 00:02:00,093
致力于计算遗传学和视网膜疾病的研究。
working on computational genetics and retinal disease.
48
00:02:02,400 --> 00:02:03,870
- Hi,我是 Lysandre
- Hi, I'm Lysandre.
49
00:02:03,870 --> 00:02:05,640
我是 Hugging Face 的机器学习工程师
I'm a Machine Learning Engineer at Hugging Face
50
00:02:05,640 --> 00:02:08,700
我是开源团队的一员。
and I'm specifically part of the open-source team.
51
00:02:08,700 --> 00:02:10,890
我已经在 Hugging Face 团队和我的团队成员一起
I've been at Hugging Face for a few years now
52
00:02:10,890 --> 00:02:12,300
工作了好几年,
and alongside my team members,
53
00:02:12,300 --> 00:02:13,890
我一直致力于研究大多数您将在
I've been working on most of the tools
54
00:02:13,890 --> 00:02:15,790
本课程中看到的工具。
that you'll get to see in this course.
55
00:02:18,270 --> 00:02:20,130
- Hi,我是 Sylvain
- Hi, I'm Sylvain.
56
00:02:20,130 --> 00:02:22,140
我是 Hugging Face 的研究工程师
I'm a Research Engineer at Hugging Face
57
00:02:22,140 --> 00:02:25,830
也是 Transformers 代码库的主要维护者之一。
and one of the main maintainers of the Transformers Library.
58
00:02:25,830 --> 00:02:28,110
之前,我在 fast.ai 工作
Previously, I worked at fast.ai
59
00:02:28,110 --> 00:02:30,420
我帮助开发了 fast.ai 库
where I helped develop the fast.ai Library
60
00:02:30,420 --> 00:02:32,220
以及在线图书。
as well as the online book.
61
00:02:32,220 --> 00:02:35,340
在那之前,我是一名在法国的
Before that, I was a math and computer science teacher
62
00:02:35,340 --> 00:02:36,173
数学和计算机科学老师。
in France.
63
00:02:38,550 --> 00:02:41,340
- Hi,我叫 Sasha,是 Hugging Face 的一名研究员,
- Hi, my name is Sasha and I'm a Researcher at Hugging Face,
64
00:02:41,340 --> 00:02:42,420
致力于道德,
working on the ethical,
65
00:02:42,420 --> 00:02:46,230
机器学习模型的环境和社会影响相关的研究。
environmental and social impacts of machine learning models.
66
00:02:46,230 --> 00:02:49,020
之前,我是 Mila 蒙特利尔大学的
Previously, I was a postdoctoral researcher at Mila,
67
00:02:49,020 --> 00:02:50,400
博士后研究员
University in Montreal
68
00:02:50,400 --> 00:02:53,040
我还为联合国全球脉搏计划担任过
and I also worked as an Applied AI Researcher
69
00:02:53,040 --> 00:02:55,140
应用人工智能研究员。
for the United Nations Global Pulse.
70
00:02:55,140 --> 00:02:57,300
参与过 CodeCarbon 和
I've been involved in projects such as CodeCarbon
71
00:02:57,300 --> 00:02:59,790
机器学习影响计算器等项目
and the Machine Learning Impacts Calculator
72
00:02:59,790 --> 00:03:02,390
致力于衡量机器学习的碳足迹的研究。
to measure the carbon footprint of machine learning.
73
00:03:05,160 --> 00:03:07,650
- Hi,我是 Merve,我是 Hugging Face 团队的
- Hi, I'm Merve and I'm a Developer Advocate
74
00:03:07,650 --> 00:03:09,390
开发技术推广工程师
at Hugging Face.
75
00:03:09,390 --> 00:03:12,480
在此之前,我是一名机器学习工程师
Previously, I was working as a Machine Learning Engineer
76
00:03:12,480 --> 00:03:15,360
负责构建 NLP 工具和聊天机器人。
building NLP tools and chat bots.
77
00:03:15,360 --> 00:03:17,670
目前,我正在努力改进模型中心
Currently, I'm working to improve the hub
78
00:03:17,670 --> 00:03:19,563
并使机器学习民主化。
and democratize machine learning.
79
00:03:22,140 --> 00:03:23,670
- 大家好。
- Hello everyone.
80
00:03:23,670 --> 00:03:27,210
我叫 Lucile,是 Hugging Face 团队的
My name is Lucile and I'm a Machine Learning Engineer
81
00:03:27,210 --> 00:03:28,353
一名机器学习工程师
at Hugging Face.
82
00:03:29,580 --> 00:03:32,550
用两句话告诉您我是谁,
To tell you in two sentences who I am,
83
00:03:32,550 --> 00:03:35,590
我致力于开源工具的开发和支持
I work on the development and support of open-source tools
84
00:03:36,600 --> 00:03:39,595
我也参与了在自然语言处理领域的
and I also participate in several research project
85
00:03:39,595 --> 00:03:41,795
几个研究项目。
in the field of Natural Language Processing.
86
00:03:44,610 --> 00:03:45,540
- 大家好。
- Good day there.
87
00:03:45,540 --> 00:03:47,550
我是 Lewis,我是 Hugging Face 开源团队中的
I'm Lewis and I'm a Machine Learning Engineer
88
00:03:47,550 --> 00:03:50,130
一名机器学习工程师。
in the open-source team at Hugging Face.
89
00:03:50,130 --> 00:03:53,490
我热衷于为 NLP 社区开发工具
I'm passionate about developing tools for the NLP community
90
00:03:53,490 --> 00:03:55,050
您可以在很多 Hugging Face
and you'll see me at many of Hugging Face's
91
00:03:55,050 --> 00:03:56,910
对外的活动中见到我
outreach activities.
92
00:03:56,910 --> 00:03:58,470
在加入 Hugging Face 之前,
Before joining Hugging Face,
93
00:03:58,470 --> 00:03:59,790
我花了几年时间
I spent several years developing
94
00:03:59,790 --> 00:04:01,860
为初创公司和 NLP 领域的企业
machine learning applications for startups
95
00:04:01,860 --> 00:04:04,230
开发机器学习应用程序,
and enterprises in the domains of NLP,
96
00:04:04,230 --> 00:04:07,260
以及拓扑数据分析和时间序列。
topological data analysis and time series.
97
00:04:07,260 --> 00:04:10,110
在此之前,我是一名理论物理学家,
In a former life, I was a theoretical physicist,
98
00:04:10,110 --> 00:04:11,760
负责在大型强子对撞机等
where I researched particle collisions
99
00:04:11,760 --> 00:04:13,560
研究粒子碰撞。
at the Large Hadron Collider and so.
100
00:04:15,900 --> 00:04:18,450
- Hey,我是 Leandro,我是一名 Hugging Face 开源团队中
- Hey, I'm Leandro and I'm a Machine Learning Engineer
101
00:04:18,450 --> 00:04:21,030
的一名机器学习工程师
in the open-source team at Hugging Face.
102
00:04:21,030 --> 00:04:23,460
在加入 Hugging Face 之前,我是一名在瑞士的数据科学家
Before joining Hugging Face, I worked as a Data Scientist
103
00:04:23,460 --> 00:04:26,733
并在大学教授数据科学。
in Switzerland and have taught Data Science at University.