Name		Name	Last commit message	Last commit date
parent directory ..
Axolotl		Axolotl
Gemma		Gemma
HandsOnWithFinetuning		HandsOnWithFinetuning
LLMArchitecture/ParameterCount		LLMArchitecture/ParameterCount
LLama2		LLama2
Llama3		Llama3
LlamaFactory		LlamaFactory
Mistral-7b		Mistral-7b
Mixtral		Mixtral
TheoryBehindFinetuning		TheoryBehindFinetuning
VLM		VLM
README.md		README.md

README.md

Large Language Models (LLMs)

Welcome to the Large Language Models section of the AI Engineering Academy. This module provides a comprehensive understanding of LLMs and their practical applications in AI engineering.

Repository Structure

Category	Topic	Resource
Introduction	Overview	Introduction to LLMs
Theory Behind Fine-tuning	Pre-Training	Pre-Training
	Supervised Fine-Tuning (SFT)	SFT Theory
	Proximal Policy Optimization (PPO)	PPO Theory
	Direct Preference Optimization (DPO)	DPO Theory
	Observation-Regularized Policy Optimization (ORPO)	ORPO Theory
	Gated Regularized Policy Optimization (GRPO)	GRPO Theory
Hands-On SFT	Overview	SFT Implementation Guide
	Implementation	SFT Notebook
Hands-On GRPO	Guide	Hacker Guide to GRPO
	Implementation	Qwen 0.5B GRPO
Gemma	Overview	Gemma Guide
	Implementation	Gemma Fine-tuning
Llama2	Overview	Llama2 Guide
	Implementation	Llama2 Fine-tuning
	Advanced	QLora Fine-tuning
Llama3	Implementation	Llama3 Fine-tuning
Mistral-7B	Overview	Mistral Guide
	Implementation	Mistral Fine-tuning
	Evaluation	Evaluation Harness
	DPO	DPO Fine-tuning
	SFT	SFT Trainer
	Inference	ChatML Inference
Mixtral	Implementation	Mixtral Fine-tuning
Visual Language Models	Florence2	Florence2 Fine-tuning
	PaliGemma	PaliGemma Fine-tuning
Architecture	Parameter Analysis	Parameter Count

Learning Roadmap

Level	Steps	Resources
Beginner	1. Introduction to LLMs	Introduction
	2. Understanding core theory	Pre-Training, SFT Theory
	3. First implementation	SFT Guide
	4. Practical application	Llama2 Fine-tuning
Intermediate	1. Advanced techniques	DPO Theory, PPO Theory
	2. Model implementation	Mistral Fine-tuning
	3. Architecture concepts	Parameter Count
Advanced	1. Cutting-edge methods	ORPO Theory, GRPO Theory
	2. Advanced implementation	GRPO Implementation
	3. Multimodal models	Florence2, PaliGemma

Contributing

We welcome contributions to expand this repository. Please follow the standard pull request process and ensure your contributions align with the overall structure.

License

This project is licensed under the MIT License - see the LICENSE file for details.

AI Engineering Academy - Advancing the frontier of language model understanding and implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM

LLM

README.md

Large Language Models (LLMs)

Repository Structure

Learning Roadmap

Contributing

License

Files

LLM

Directory actions

More options

Directory actions

More options

Latest commit

History

LLM

Folders and files

parent directory

README.md

Large Language Models (LLMs)

Repository Structure

Learning Roadmap

Contributing

License