File | # lines | # units |
---|
|
|
221 |
90 |
|
1060 |
74 |
|
791 |
62 |
|
836 |
59 |
|
785 |
58 |
|
760 |
58 |
|
477 |
54 |
|
577 |
53 |
|
642 |
48 |
|
454 |
44 |
__init__.pyiin bindings/python/py_src/tokenizers/normalizers |
101 |
43 |
__init__.pyiin bindings/python/py_src/tokenizers/pre_tokenizers |
90 |
38 |
|
854 |
35 |
|
131 |
35 |
|
93 |
35 |
|
2002 |
33 |
|
799 |
32 |
|
342 |
32 |
mod.rsin tokenizers/src/tokenizer |
1185 |
31 |
|
244 |
27 |
|
834 |
26 |
|
56 |
22 |
|
685 |
21 |
|
219 |
19 |
|
513 |
18 |
model.rsin tokenizers/src/models/unigram |
509 |
18 |
|
254 |
18 |
|
40 |
17 |
|
598 |
15 |
|
588 |
13 |
|
187 |
13 |
|
209 |
13 |
|
279 |
12 |
mod.rsin tokenizers/src/models/wordlevel |
204 |
12 |
|
131 |
11 |
mod.rsin tokenizers/src/models/wordpiece |
250 |
11 |
|
303 |
11 |
|
331 |
10 |
|
780 |
10 |
bert.rsin tokenizers/src/normalizers |
102 |
9 |
|
141 |
9 |
|
111 |
8 |
|
91 |
7 |
|
72 |
7 |
|
98 |
7 |
|
114 |
7 |
|
40 |
7 |
|
133 |
6 |
|
71 |
6 |
|
132 |
6 |