Published onApril 18, 2024Deploy Llama 3 on Amazon SageMaker#HuggingFace#LLM#SageMaker#GenerativeAIIn this blog post you will learn how to deploy Llama 3 70B to Amazon SageMaker.Read more →
Published onApril 2, 2024Accelerate Mixtral 8x7B with Speculative Decoding and Quantization on Amazon SageMaker#HuggingFace#LLM#SageMaker#GenerativeAIIn this blog post you will learn how to accelerate Mixtral using Speculative Decoding (Medusa) and Quantization (AWQ).Read more →
Published onMarch 26, 2024Deploy Llama 2 70B on AWS Inferentia2 with Hugging Face Optimum#HuggingFace#LLM#SageMaker#GenerativeAIIn this blog post you will learn how to deploy Meta Llama 2 70B on AWS Inferentia2 with Hugging Face Optimum on Amazon SageMaker.Read more →
Published onMarch 12, 2024Fine-Tune & Evaluate LLMs in 2024 with Amazon SageMaker#HuggingFace#LLM#SageMaker#GenerativeAIIn this blog post you will learn how to fine-tune open LLMs from Hugging Face using Amazon SageMaker.Read more →
Published onMarch 5, 2024Evaluate LLMs with Hugging Face Lighteval on Amazon SageMaker#HuggingFace#LLM#SageMaker#GenerativeAIIn this blog post you will learn how to evaluate LLMs using Hugging Face lighteval on Amazon SageMaker.Read more →
Published onMarch 1, 2024How to fine-tune Google Gemma with ChatML and Hugging Face TRL#HuggingFace#LLM#RLHF#GenerativeAIIn this blog post you will learn how to fine tune Google Gemma using Hugging Face Transformers, Datasets and TRL.Read more →
Published onJanuary 23, 2024RLHF in 2024 with DPO & Hugging Face#HuggingFace#LLM#RLHF#GenerativeAIIn this blog post you will learn how to align LLMs using Hugging Face TRL and RLHF through Direct Preference Optimization (DPO).Read more →
Published onJanuary 23, 2024How to Fine-Tune LLMs in 2024 with Hugging Face#HuggingFace#LLM#Opensource#GenerativeAIIn this blog post you will learn how to fine-tune LLMs using Hugging Face TRL, Transformers and Datasets in 2024. We will fine-tune a LLM on a text to SQL dataset.Read more →
Published onJanuary 11, 2024Scale LLM Inference on Amazon SageMaker with Multi-Replica Endpoints#LLAMA#HuggingFace#LLM#SageMakerIn this blog post you will learn how to increase the throughput of Llama 13B on Amazon SageMaker using single instance multi-replica endpoints.Read more →
Published onDecember 21, 2023Fine-tune Llama 7B on AWS Trainium #GenerativeAI#HuggingFace#LLM#AWSIn this blog post you will learn how to fine-tune Llama 7B on AWS Trainium using the Hugging Face Optimum Neuron library.Read more →