Network in Network (NiN)
|
|
14
|
3744
|
December 25, 2024
|
Convolutions for Images
|
|
14
|
3705
|
February 16, 2025
|
Object-Oriented Design for Implementation
|
|
14
|
3686
|
April 24, 2025
|
深层循环神经网络
|
|
12
|
3914
|
May 13, 2025
|
Text Preprocessing
|
|
16
|
3364
|
November 21, 2021
|
The Transformer Architecture
|
|
13
|
3690
|
January 22, 2022
|
风格迁移
|
|
10
|
4149
|
September 22, 2024
|
Concise Implementation of Linear Regression
|
|
19
|
3074
|
March 12, 2023
|
Word Embedding (word2vec)
|
|
10
|
4149
|
August 21, 2024
|
Environment and Distribution Shift
|
|
15
|
3356
|
December 19, 2023
|
Large-Scale Pretraining with Transformers
|
|
9
|
4195
|
June 25, 2024
|
Parameter Management
|
|
14
|
3406
|
February 17, 2025
|
Auto Differentiation
|
|
13
|
3508
|
November 7, 2024
|
Customer Layers
|
|
9
|
2318
|
February 18, 2025
|
Batch Normalization
|
|
10
|
3784
|
June 8, 2024
|
编译器和解释器
|
|
11
|
3563
|
September 22, 2023
|
Attention Scoring Functions
|
|
11
|
3507
|
September 27, 2024
|
Gated Recurrent Units (GRU)
|
|
14
|
3071
|
December 1, 2024
|
Synthetic Regression Data
|
|
13
|
3135
|
February 1, 2025
|
多GPU训练
|
|
10
|
3451
|
October 17, 2024
|
Self-Attention and Positional Encoding
|
|
11
|
3245
|
January 22, 2024
|
Self-Attention and Positional Encoding
|
|
10
|
3336
|
January 22, 2024
|
Beam Search
|
|
14
|
2845
|
April 28, 2025
|
Single Shot Multibox Detection (SSD)
|
|
15
|
2732
|
January 9, 2022
|
词嵌入(word2vec)
|
|
10
|
3284
|
June 6, 2025
|
Full pytorch code book for d2l.ai [help]
|
|
15
|
2682
|
January 21, 2021
|
Concise Implementation of Softmax Regression
|
|
13
|
2841
|
June 19, 2020
|
Recurrent Neural Networks
|
|
11
|
3070
|
April 22, 2022
|
Neural Collaborative Filtering for Personalized Ranking
|
|
15
|
2637
|
December 19, 2023
|
Linear Regression Implementation from Scratch
|
|
9
|
3320
|
December 14, 2021
|
Geometry and Linear Algebraic Operations
|
|
15
|
2594
|
June 22, 2024
|
Image Augmentation
|
|
12
|
2823
|
November 20, 2020
|
Language Models
|
|
12
|
2762
|
November 10, 2021
|
Deep Factorization Machines
|
|
9
|
3121
|
October 25, 2021
|
Generative Adversarial Networks
|
|
10
|
2976
|
February 4, 2025
|
Predicting House Prices on Kaggle
|
|
11
|
2824
|
March 12, 2022
|
数据预处理
|
|
9
|
3017
|
February 18, 2025
|
Image Classification Dataset
|
|
9
|
3014
|
June 27, 2022
|
预训练word2vec
|
|
11
|
2730
|
September 10, 2024
|
The Dataset for Pretraining Word Embedding
|
|
11
|
2580
|
August 21, 2024
|
Long Short Term Memory (LSTM)
|
|
9
|
2647
|
May 11, 2025
|
Pretraining word2vec
|
|
10
|
2450
|
September 10, 2024
|
Batch Normalization
|
|
11
|
2329
|
January 11, 2022
|
Attention Scoring Functions
|
|
10
|
2420
|
February 9, 2022
|
Information Theory
|
|
9
|
2510
|
March 17, 2021
|
Convolutions for Images
|
|
10
|
2288
|
January 14, 2022
|
Numerical Stability and Initialization
|
|
11
|
2163
|
December 31, 2022
|
Machine Translation and the Dataset
|
|
9
|
2292
|
September 6, 2023
|
Recurrent Neural Network Implementation from Scratch
|
|
9
|
2238
|
August 9, 2024
|
Parameter Management
|
|
10
|
1963
|
January 11, 2022
|