Hoai-Chau Tran
Search
Search
Dark mode
Light mode
Explorer
notes
Adam
AdamW
Attention with Linear Biases (ALiBi)
Auto regressive decoding
Backpropagation
Batch Norm
Convolution
Convolutional networks
Decision Tree
Deep Learning
Euler's Formula
Gradient Descent
Group-Query Attention
Index
kernels
KV cache
Large Language Model (llm)
Layer Norm
LLaMA
LLaMA 2
LLaMA 3.1
Multi-Query Attention
neuron networks
Perceptron
Relative Positional Encoding
Residual Connection
Rotary Position Embeddings (RoPE)
SVM
Transformer
papers
Introduction to probability for data science
RoFormer Enhanced Transformer with Rotary Position Embedding
Towards Efficient Generative Large Language Model Serving A Survey from Algorithms to Systems
Train Short, Test Long Attention with Linear Biases Enables Input Length Extrapolation
Home
❯
notes
Folder: notes
29 items under this folder.
Sep 15, 2024
kernels
read_later
Sep 15, 2024
neuron networks
read_later
Sep 15, 2024
Adam
read_later
Sep 15, 2024
AdamW
read_later
Sep 15, 2024
Attention with Linear Biases (ALiBi)
RPE
Sep 15, 2024
Auto regressive decoding
llm
decoding_algorithms
Sep 15, 2024
Backpropagation
read_later
Sep 15, 2024
Batch Norm
read_later
Sep 15, 2024
Convolution
read_later
Sep 15, 2024
Convolutional networks
read_later
Sep 15, 2024
Decision Tree
read_later
Sep 15, 2024
Deep Learning
read_later
Sep 15, 2024
Euler's Formula
read_later
Sep 15, 2024
Gradient Descent
read_later
Sep 15, 2024
Group-Query Attention
read_later
Sep 15, 2024
Index
Sep 15, 2024
KV cache
read_later
Sep 15, 2024
LLaMA 2
read_later
Sep 15, 2024
LLaMA 3.1
read_later
Sep 15, 2024
LLaMA
read_later
Sep 15, 2024
Large Language Model (llm)
read_later
Sep 15, 2024
Layer Norm
normalization
regularlization
deep_learning
Sep 15, 2024
Multi-Query Attention
read_later
Sep 15, 2024
Perceptron
read_later
Sep 15, 2024
Relative Positional Encoding
transformers
llm
Sep 15, 2024
Residual Connection
read_later
Sep 15, 2024
Rotary Position Embeddings (RoPE)
read_later
Sep 15, 2024
SVM
read_later
Sep 15, 2024
Transformer
transformers
deep_learning
encoder_decoder