Posts, articles, and discussions

Welcome fastText to the 🤗 Hub
By June 6, 2023

The Falcon has landed in the Hugging Face ecosystem
By June 5, 2023

Smaller is better: Q8-Chat, an efficient generative AI experience on Xeon
By May 16, 2023

Run a Chatgpt-like Chatbot on a Single GPU with ROCm
By May 15, 2023

Introducing RWKV — An RNN with the advantages of a transformer
By May 15, 2023

Assisted Generation: a new direction toward low-latency text generation
By May 11, 2023

Creating a Coding Assistant with StarCoder
By May 9, 2023

StarCoder: A State-of-the-Art LLM for Code
By May 4, 2023

Training a language model with 🤗 Transformers using TensorFlow and TPUs
By April 27, 2023

Accelerating Hugging Face Transformers with AWS Inferentia2
By April 17, 2023

StackLLaMA: A hands-on guide to train LLaMA with RLHF
By April 5, 2023

Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator
By March 28, 2023

Federated Learning using Hugging Face and Flower
By March 27, 2023 guest

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
By March 9, 2023

Using Machine Learning to Aid Survivors and Race through Time
By March 3, 2023