MinText

Tutorial 1: Parallelization Basics
Tutorial 2: Data Parallel and Fully Sharded Data Parallel Training
Tutorial 3: Tensor Parallel and Transformers Scaling
Tutorial 4: Up Next

MinText

Welcome to MinText’s documentation!
View page source

Welcome to MinText’s documentation!

MinText is a minimalistic 3D-parallelism distributed training and inference framework for LLMs in JAX

Tutorials

Tutorial 1: Parallelization Basics
Tutorial 2: Data Parallel and Fully Sharded Data Parallel Training
Tutorial 3: Tensor Parallel and Transformers Scaling
Tutorial 4: Up Next
- 1. Other Forms of Parallelism in Distributed Deep Learning
- 2. What to Read Next

Indices and tables

Index
Module Index
Search Page

Next

© Copyright 2025, Shashank Shekhar.

Built with Sphinx using a theme provided by Read the Docs.