Skip to content

A project focused on scaling test-time computation for language models using different methods to improve performance.

License

Notifications You must be signed in to change notification settings

dame-cell/Scalellm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ScaleLLM

This project is inspired from the blog scaling-test-time-compute and their github-code

This is purely a beginner code specfically for poor gpu people maybe that's what I intend to do atleast

Citation

@misc{beeching2024scalingtesttimecompute,
      title={Scaling test-time compute with open models},
      author={Edward Beeching and Lewis Tunstall and Sasha Rush},
      url={https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute},
}
@misc{snell2024scalingllmtesttimecompute,
      title={Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters}, 
      author={Charlie Snell and Jaehoon Lee and Kelvin Xu and Aviral Kumar},
      year={2024},
      eprint={2408.03314},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2408.03314}, 
}

About

A project focused on scaling test-time computation for language models using different methods to improve performance.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages