One-click Download Hundreds of Popular Models:
- Llama3, Phi3, Mistral, Mixtral, Gemma, Command-R, and dozens more
- Download any LLM from Huggingface
- Train and use separate embedding models
- Convert between model formats (e.g. MLX, GGUF)
Chat with Models
- Chat
- Completions
- Preset (Templated) Prompts
- Chat History
- Tweak generation parameters
- Batch Inference
- Calculate Embeddings
- Visualize LogProbs
- Visualize Tokenizers
- Inference Logs
Pre-training, Finetuning, RLHF and Preference Optimization
- Train models from scratch
- Finetuning
- DPO
- ORPO
- SIMPO
- Reward Modeling
- GRPO
Comprehensive Evals
- Eleuther Harness
- LLM as a Judge
- Objective Metrics
- Red Teaming Evals
- Eval visualization and graphing
Plugin Support
- Easily pull from a library of existing plugins
- Write your own plugins to extend functionality
RAG (Retrieval Augmented Generation)
- Drag and Drop File UI
- Works on Apple MLX, HF Transformers, and other engines
- Windows, MacOS, Linux App
- Training and Inference using MLX on Apple Silicon
- Training and Inference using CUDA
- Multi-GPU Training