Skip to main content

Recently Updated Pages

List of Evals

Evals

MosaicML Model Gauntlet - 34 benchmarks in 6 categories HuggingFace Open LLM Leaderboard warn...

Updated 1 week ago by lhl

Comparing Quants

Logbook

https://github.com/mit-han-lab/smoothquant https://neuralmagic.com/blog/fast-llama-2-on-cpus-with...

Updated 1 week ago by lhl

StyleTTS 2 Setup Guide

HOWTO Guides

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...

Updated 1 week ago by lhl

AMD GPUs

HOWTO Guides Inferencing

As of August 2023, AMD's ROCm GPU compute software stack is available for Linux or Windows. Linux...

Updated 2 weeks ago by lhl

Prompting

LLMs

Prompting https://www.promptingguide.ai/ https://help.openai.com/en/articles/6654000-best-practi...

Updated 1 month ago by lhl

Performance

HOWTO Guides Inferencing

2023-08-14 Aman Sanger (cursor.so) comparing high batch throughput 2023-08-11 Optimizing laten...

Updated 1 month ago by lhl

Code Assistants

LLMs

We'll probably move this somewhere else, but I figure it might be useful to put this in public so...

Updated 1 month ago by lhl

Lists of Models

LLMs

We will eventually be hosting an API and list of LLMs.  In the meantime: Open LLMs The GitH...

Updated 2 months ago by lhl

Fine Tuning Mistral

Logbook

We'll try to fine tune Mistral 7B. Training Details The Mistral AI Discord has a #finetuning chan...

Updated 2 months ago by lhl

Interpretability

LLMs Research

Language Models Implement Simple Word2Vec-styleVector Arithmetic https://arxiv.org/pdf/2305.1613...

Updated 2 months ago by lhl

Japanese LLMs

LLMs

There has been a stream of open Japanese LLMs being trained but they are on average far behind th...

Updated 2 months ago by lhl

Transcription Test

Logbook

This project was done 2023-08-21. Code checked in here: https://github.com/AUGMXNT/transcribe Her...

Updated 2 months ago by lhl

Speech-to-Text

Evals

WhisperX WhisperX is the current best version of Whisper. conda create --name whisperx python=3.1...

Updated 2 months ago by lhl

Learning Resources

LLMs Research

Getting Started If you're starting from nothing. Just go to Wikipedia and start reading: https:...

Updated 2 months ago by lhl

Translation

LLMs

Google MADLAD-400 10.7B model NLLB 54B model https://research.facebook.com/publications...

Updated 2 months ago by lhl

Quantization Overview

LLMs Quantization

How does quantisation affect model output? - 15 basic tests on different quant levels EXL2 (Ex...

Updated 2 months ago by lhl

Getting Started

HOWTO Guides

Large Language Models (LLM) are a type of generative AI that power chatbot systems like ChatGPT. ...

Updated 2 months ago by lhl

OpenAI API Compatibility

LLMs

Most inferencing packages have their own REST API, but having an OpenAI compatible API is useful ...

Updated 2 months ago by lhl

Improving LLM Quality

LLMs Research

Model Architecture Mixture of Experts / Ensemble Zoph, Barret, Irwan Bello, Sameer Kumar, Nan D...

Updated 3 months ago by lhl

Nvidia GPUs

HOWTO Guides Inferencing

Nvidia GPUs are the most compatible hardware for AI/ML. All of Nvidia's GPUs (consumer and profes...

Updated 3 months ago by lhl