Content-Length: 266095 | pFad | https://github.com/cuda-mode/awesomeMLSys

6883C82E GitHub - gpu-mode/awesomeMLSys: An ML Systems Onboarding list
Skip to content

gpu-mode/awesomeMLSys

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 

Repository files navigation

ML Systems Onboarding Reading List

This is a reading list of papers/videos/repos I've personally found useful as I was ramping up on ML Systems and that I wish more people would just sit and study carefully during their work hours. If you're looking for more recommendations, go through the citations of the below papers and enjoy!

Conferences where MLSys papers get published

Attention Mechanism

Performance Optimizations

Quantization

Long context length

Sparsity

  • Venom: Vectorized N:M Format for sparse tensor cores when hardware only supports 2:4
  • Megablocks: Efficient Sparse training with mixture of experts
  • ReLu Strikes Back: Really enjoyed this paper as an example of doing model surgery for more efficient inference

Distributed

About

An ML Systems Onboarding list

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: https://github.com/cuda-mode/awesomeMLSys

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy