KingsmanVince
- 22 Posts
- 8 Comments
Joined 2 years ago
Cake day: June 17th, 2023
You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.
KingsmanVince@kbin.socialto Machine Learning@kbin.social•Think before you speak: Training Language Models With Pause Tokens1·2 years agoIIRC DeTr generate a sequence to predict boxes of objects. I think this paradigm can be applied to such models. “Think before you locate” could be a new path to explore.
KingsmanVince@kbin.socialto Machine Learning@kbin.social•Retentive Network: A Successor to Transformer for Large Language Models3·2 years agohttps://github.com/Jamie-Stirling/RetNet non-official implementation
KingsmanVince@kbin.socialto Machine Learning@kbin.social•Machine Learning Beginner Info/Resources1·2 years agoI also want to share some resources.
For Pytorch,- https://pytorch.org/tutorials/ their basic tutorials are fundamental but some more advanced tutorials might be outdated.
- https://www.learnpytorch.io/ the author guides mostly in computer vision but he gives the overview from research to production.
For TPU,
- https://github.com/ayaka14732/tpu-starter full guideline using TPUs with Jax
indeed it would be great if the authors did so. I personally found some non-official implementations: