|
The Transformer Architecture
|
|
35
|
8922
|
December 3, 2024
|
|
Gated Recurrent Units (GRU)
|
|
14
|
3743
|
December 1, 2024
|
|
Multi-Branch Networks (GoogLeNet)
|
|
7
|
3788
|
November 27, 2024
|
|
Pretraining BERT
|
|
8
|
3394
|
November 22, 2024
|
|
Region-based CNNs (R-CNNs)
|
|
2
|
2034
|
November 21, 2024
|
|
Vision Transformer
|
|
8
|
4316
|
November 21, 2024
|
|
Document
|
|
2
|
2389
|
November 12, 2024
|
|
Auto Differentiation
|
|
13
|
4309
|
November 7, 2024
|
|
Bidirectional Encoder Representations from Transformers (BERT)
|
|
6
|
2539
|
November 7, 2024
|
|
Subword Embedding
|
|
2
|
1556
|
October 29, 2024
|
|
The Base Classification Model
|
|
4
|
2181
|
October 24, 2024
|
|
Image Augmentation
|
|
5
|
2683
|
October 24, 2024
|
|
Multivariable Calculus
|
|
8
|
2322
|
October 24, 2024
|
|
Geometry and Linear Algebraic Operations
|
|
15
|
3224
|
June 22, 2024
|
|
Some questions on Linear Regression Exercises
|
|
0
|
567
|
October 13, 2024
|
|
Factorization Machines
|
|
5
|
2494
|
October 12, 2024
|
|
Encoder-Decoder Architecture
|
|
5
|
3225
|
October 9, 2024
|
|
Statistics
|
|
3
|
2160
|
October 2, 2024
|
|
Value iteration
|
|
5
|
1829
|
September 29, 2024
|
|
Attention Scoring Functions
|
|
11
|
4125
|
September 27, 2024
|
|
Attention Cues
|
|
10
|
7220
|
September 9, 2023
|
|
VGG implemmentation problem
|
|
0
|
447
|
September 17, 2024
|
|
Convexity
|
|
7
|
2486
|
September 16, 2024
|
|
Pretraining word2vec
|
|
10
|
2859
|
September 10, 2024
|
|
Installation of Jax version.
|
|
1
|
552
|
September 7, 2024
|
|
Deep Convolutional Neural Networks (AlexNet)
|
|
16
|
5402
|
September 4, 2024
|
|
Generalization in Classification
|
|
8
|
2411
|
September 1, 2024
|
|
Word Embedding (word2vec)
|
|
10
|
5054
|
August 21, 2024
|
|
The Dataset for Pretraining Word Embedding
|
|
11
|
3087
|
August 21, 2024
|
|
Recurrent Neural Network Implementation from Scratch
|
|
9
|
2991
|
August 9, 2024
|