The Transformer Architecture
|
|
13
|
3269
|
January 22, 2022
|
Q-learning
|
|
10
|
2030
|
June 20, 2024
|
Large-Scale Pretraining with Transformers
|
|
9
|
3781
|
June 25, 2024
|
Convolutions for Images
|
|
13
|
3170
|
June 6, 2024
|
风格迁移
|
|
10
|
3519
|
September 22, 2024
|
Environment and Distribution Shift
|
|
15
|
2785
|
December 19, 2023
|
Object-Oriented Design for Implementation
|
|
13
|
2965
|
June 13, 2024
|
深层循环神经网络
|
|
10
|
3247
|
June 13, 2024
|
Network in Network (NiN)
|
|
13
|
2885
|
June 8, 2024
|
编译器和解释器
|
|
11
|
3070
|
September 22, 2023
|
Attention Scoring Functions
|
|
11
|
3051
|
September 27, 2024
|
Parameter Management
|
|
12
|
2889
|
May 8, 2024
|
Auto Differentiation
|
|
13
|
2699
|
November 7, 2024
|
Self-Attention and Positional Encoding
|
|
11
|
2880
|
January 22, 2024
|
Single Shot Multibox Detection (SSD)
|
|
15
|
2436
|
January 9, 2022
|
Gated Recurrent Units (GRU)
|
|
13
|
2555
|
October 18, 2023
|
多GPU训练
|
|
10
|
2825
|
October 17, 2024
|
Self-Attention and Positional Encoding
|
|
10
|
2806
|
January 22, 2024
|
Full pytorch code book for d2l.ai [help]
|
|
15
|
2322
|
January 21, 2021
|
Neural Collaborative Filtering for Personalized Ranking
|
|
15
|
2305
|
December 19, 2023
|
Concise Implementation of Softmax Regression
|
|
13
|
2466
|
June 19, 2020
|
Linear Regression Implementation from Scratch
|
|
9
|
2902
|
December 14, 2021
|
Batch Normalization
|
|
10
|
2721
|
June 8, 2024
|
词嵌入(word2vec)
|
|
9
|
2795
|
August 21, 2024
|
Recurrent Neural Networks
|
|
11
|
2539
|
April 22, 2022
|
Image Augmentation
|
|
12
|
2439
|
November 20, 2020
|
Geometry and Linear Algebraic Operations
|
|
15
|
2183
|
June 22, 2024
|
Predicting House Prices on Kaggle
|
|
11
|
2453
|
March 12, 2022
|
Deep Factorization Machines
|
|
9
|
2679
|
October 25, 2021
|
Word Embedding (word2vec)
|
|
10
|
2529
|
August 21, 2024
|
预训练word2vec
|
|
11
|
2394
|
September 10, 2024
|
Language Models
|
|
12
|
2302
|
November 10, 2021
|
Synthetic Regression Data
|
|
10
|
2483
|
June 14, 2024
|
Image Classification Dataset
|
|
9
|
2599
|
June 27, 2022
|
Beam Search
|
|
12
|
2264
|
April 24, 2024
|
The Dataset for Pretraining Word Embedding
|
|
11
|
2210
|
August 21, 2024
|
Generative Adversarial Networks
|
|
9
|
2293
|
March 28, 2023
|
Attention Scoring Functions
|
|
10
|
2135
|
February 9, 2022
|
Batch Normalization
|
|
11
|
2042
|
January 11, 2022
|
Information Theory
|
|
9
|
2201
|
March 17, 2021
|
Pretraining word2vec
|
|
10
|
2046
|
September 10, 2024
|
Numerical Stability and Initialization
|
|
11
|
1946
|
December 31, 2022
|
Convolutions for Images
|
|
10
|
2034
|
January 14, 2022
|
Machine Translation and the Dataset
|
|
9
|
1957
|
September 6, 2023
|
Recurrent Neural Network Implementation from Scratch
|
|
9
|
1899
|
August 9, 2024
|
Parameter Management
|
|
10
|
1705
|
January 11, 2022
|
The Dataset for Pretraining BERT
|
|
9
|
1655
|
July 6, 2022
|
注意力提示
|
|
3
|
6406
|
September 12, 2024
|
多尺度目标检测
|
|
1
|
2127
|
February 22, 2022
|
Do these before you ask!
|
|
7
|
4617
|
September 11, 2020
|