HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
Lessons on Parameter Sharing across Layers in Transformers
6 months ago
Transformer
Deep Learning
Natural Language Processing
Research Field
Method/Architecture
Natural Language Processing
Task/Problem
Summary
Paper
Benchmarks
Resources
takase/share_layer_params
Official
pytorch
jaketae/param-share-transformer
pytorch
HyperAI
HyperAI
Main
Home
GPU
Console
Docs
Pricing
Pulse
News
Resources
Papers
Notebooks
Datasets
Wiki
Benchmarks
SOTA
LLM Models
GPU Leaderboard
Community
Events
Utility
Search
About
Terms of Service
Privacy Policy
English
HyperAI
HyperAI
Toggle Sidebar
⌘
K
Command Palette
Search for a command to run...
Sign In
HyperAI
Papers
Lessons on Parameter Sharing across Layers in Transformers
6 months ago
Transformer
Deep Learning
Natural Language Processing
Research Field
Method/Architecture
Natural Language Processing
Task/Problem
Summary
Paper
Benchmarks
Resources
takase/share_layer_params
Official
pytorch
jaketae/param-share-transformer
pytorch
28
28
26
26