|
来自Transformers的双向编码器表示(BERT)
|
|
16
|
4912
|
June 13, 2025
|
|
Calculus
|
|
17
|
4716
|
July 25, 2021
|
|
The Transformer Architecture
|
|
13
|
5269
|
January 22, 2022
|
|
Convolutions for Images
|
|
14
|
5065
|
February 16, 2025
|
|
Concise Implementation of Linear Regression
|
|
19
|
4348
|
March 12, 2023
|
|
Deep Convolutional Neural Networks (AlexNet)
|
|
10
|
3296
|
September 25, 2020
|
|
Batch Normalization
|
|
10
|
5765
|
June 8, 2024
|
|
Customer Layers
|
|
9
|
3342
|
February 18, 2025
|
|
风格迁移
|
|
11
|
5330
|
November 27, 2025
|
|
Synthetic Regression Data
|
|
14
|
4708
|
July 11, 2025
|
|
Auto Differentiation
|
|
13
|
4829
|
November 7, 2024
|
|
Word Embedding (word2vec)
|
|
10
|
5405
|
August 21, 2024
|
|
Beam Search
|
|
14
|
4586
|
April 28, 2025
|
|
Environment and Distribution Shift
|
|
15
|
4429
|
December 19, 2023
|
|
Parameter Management
|
|
14
|
4558
|
February 17, 2025
|
|
深层循环神经网络
|
|
12
|
4816
|
May 13, 2025
|
|
Text Preprocessing
|
|
16
|
4134
|
November 21, 2021
|
|
Large-Scale Pretraining with Transformers
|
|
9
|
5387
|
June 25, 2024
|
|
Full pytorch code book for d2l.ai [help]
|
|
15
|
4227
|
January 21, 2021
|
|
Gated Recurrent Units (GRU)
|
|
14
|
4326
|
December 1, 2024
|
|
Self-Attention and Positional Encoding
|
|
10
|
4954
|
January 22, 2024
|
|
Attention Scoring Functions
|
|
11
|
4608
|
September 27, 2024
|
|
Self-Attention and Positional Encoding
|
|
11
|
4563
|
January 22, 2024
|
|
Generative Adversarial Networks
|
|
10
|
4747
|
February 4, 2025
|
|
情感分析:使用递归神经网络
|
|
12
|
4166
|
September 15, 2025
|
|
Geometry and Linear Algebraic Operations
|
|
15
|
3728
|
June 22, 2024
|
|
Neural Collaborative Filtering for Personalized Ranking
|
|
15
|
3701
|
December 19, 2023
|
|
编译器和解释器
|
|
11
|
4276
|
September 22, 2023
|
|
多GPU训练
|
|
10
|
4223
|
October 17, 2024
|
|
Single Shot Multibox Detection (SSD)
|
|
15
|
3504
|
January 9, 2022
|
|
Recurrent Neural Networks
|
|
11
|
4032
|
April 22, 2022
|
|
词嵌入(word2vec)
|
|
10
|
4200
|
June 6, 2025
|
|
Deep Factorization Machines
|
|
9
|
4400
|
October 25, 2021
|
|
Predicting House Prices on Kaggle
|
|
11
|
3984
|
March 12, 2022
|
|
Image Augmentation
|
|
12
|
3818
|
November 20, 2020
|
|
Concise Implementation of Softmax Regression
|
|
13
|
3679
|
June 19, 2020
|
|
Long Short Term Memory (LSTM)
|
|
9
|
4338
|
May 11, 2025
|
|
Linear Regression Implementation from Scratch
|
|
9
|
4245
|
December 14, 2021
|
|
Language Models
|
|
12
|
3616
|
November 10, 2021
|
|
梯度下降
|
|
9
|
3841
|
April 20, 2026
|
|
预训练word2vec
|
|
11
|
3423
|
September 10, 2024
|
|
The Dataset for Pretraining Word Embedding
|
|
11
|
3413
|
August 21, 2024
|
|
数据预处理
|
|
9
|
3742
|
February 18, 2025
|
|
Image Classification Dataset
|
|
9
|
3715
|
June 27, 2022
|
|
Batch Normalization
|
|
11
|
3167
|
January 11, 2022
|
|
Recurrent Neural Network Implementation from Scratch
|
|
9
|
3416
|
August 9, 2024
|
|
Pretraining word2vec
|
|
10
|
3218
|
September 10, 2024
|
|
Numerical Stability and Initialization
|
|
11
|
3051
|
December 31, 2022
|
|
Attention Scoring Functions
|
|
10
|
3141
|
February 9, 2022
|
|
Convolutions for Images
|
|
10
|
3100
|
January 14, 2022
|