Recently Updated Pages
List of Evals
MosaicML Model Gauntlet - 34 benchmarks in 6 categories HuggingFace Open LLM Leaderboard warn...
Comparing Quants
https://github.com/mit-han-lab/smoothquant https://neuralmagic.com/blog/fast-llama-2-on-cpus-with...
StyleTTS 2 Setup Guide
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
AMD GPUs
As of August 2023, AMD's ROCm GPU compute software stack is available for Linux or Windows. Linux...
Prompting
Prompting https://www.promptingguide.ai/ https://help.openai.com/en/articles/6654000-best-practi...
Performance
2023-08-14 Aman Sanger (cursor.so) comparing high batch throughput 2023-08-11 Optimizing laten...
Code Assistants
We'll probably move this somewhere else, but I figure it might be useful to put this in public so...
Lists of Models
We will eventually be hosting an API and list of LLMs. In the meantime: Open LLMs The GitH...
Fine Tuning Mistral
We'll try to fine tune Mistral 7B. Training Details The Mistral AI Discord has a #finetuning chan...
Interpretability
Language Models Implement Simple Word2Vec-styleVector Arithmetic https://arxiv.org/pdf/2305.1613...
Japanese LLMs
There has been a stream of open Japanese LLMs being trained but they are on average far behind th...
Transcription Test
This project was done 2023-08-21. Code checked in here: https://github.com/AUGMXNT/transcribe Her...
Speech-to-Text
WhisperX WhisperX is the current best version of Whisper. conda create --name whisperx python=3.1...
Learning Resources
Getting Started If you're starting from nothing. Just go to Wikipedia and start reading: https:...
Translation
Google MADLAD-400 10.7B model NLLB 54B model https://research.facebook.com/publications...
Quantization Overview
How does quantisation affect model output? - 15 basic tests on different quant levels EXL2 (Ex...
Getting Started
Large Language Models (LLM) are a type of generative AI that power chatbot systems like ChatGPT. ...
OpenAI API Compatibility
Most inferencing packages have their own REST API, but having an OpenAI compatible API is useful ...
Improving LLM Quality
Model Architecture Mixture of Experts / Ensemble Zoph, Barret, Irwan Bello, Sameer Kumar, Nan D...
Nvidia GPUs
Nvidia GPUs are the most compatible hardware for AI/ML. All of Nvidia's GPUs (consumer and profes...