LLMs
Lists of Models
We will eventually be hosting an API and list of LLMs. In the meantime: Open LLMs The GitH...
Research
Improving LLM Quality
Model Architecture Mixture of Experts / Ensemble Zoph, Barret, Irwan Bello, Sameer Kumar, Nan D...
Learning Resources
Getting Started If you're starting from nothing. Just go to Wikipedia and start reading: https:...
Interpretability
Language Models Implement Simple Word2Vec-styleVector Arithmetic https://arxiv.org/pdf/2305.1613...
Colophon
This site runs on BookStack, a PHP-based Wiki/documentation software. While there are other docum...
Code Assistants
We'll probably move this somewhere else, but I figure it might be useful to put this in public so...
Japanese LLMs
There has been a stream of open Japanese LLMs being trained but they are on average far behind th...
Quantization
OpenAI API Compatibility
Most inferencing packages have their own REST API, but having an OpenAI compatible API is useful ...
Prompting
Prompting https://www.promptingguide.ai/ https://help.openai.com/en/articles/6654000-best-practi...
Translation
Google MADLAD-400 10.7B model NLLB 54B model https://research.facebook.com/publications...