LOADING

AI Engineering Institute: 3Fine-tuning (fine-tuning of large language models)

AI Knowledge Base7mos agoupdate Sharenet.ai

1.1K 0

📚 Structure of the database

Models/Catalog	Description and content
Axolotl	A framework for fine-tuning language models
Gemma	Google's latest implementation of the Big Language Model
- `finetune-gemma.ipynb` - `gemma-sft.py` - `Gemma_finetuning_notebook.ipynb`	Fine-tuning notebooks and scripts
LLama2	Meta's Open Source Large Language Model
- `generate_response_stream.py` - `Llama2_finetuning_notebook.ipynb` - `Llama_2_Fine_Tuning_using_QLora.ipynb`	Implementation and fine-tuning guidelines
Llama3	Upcoming Meta Large Language Modeling Experiments
- `Llama3_finetuning_notebook.ipynb`	Initial fine-tuning experiments
LlamaFactory	A Framework for Training and Deployment of Large Language Models
LLMArchitecture/ParameterCount	Technical details of the model architecture
Mistral-7b	Mistral AI The 7 billion parameter model
- `LLM_evaluation_harness_for_Arc_Easy_and_SST.ipynb` - `Mistral_Colab_Finetune_ipynb_Colab_Final.ipynb` - `notebooks_chatml_inference.ipynb` - `notebooks_DPO_fine_tuning.ipynb` - `notebooks_SFTTrainer TRL.ipynb` - `SFT.py`	Integrated notebook for assessment, fine-tuning and reasoning
Mixtral	Mixtral's Expert Mixing Model
- `Mixtral_fine_tuning.ipynb`	Fine-tuning Realization
VLM	visual language model
- `Florence2_finetuning_notebook.ipynb` - `PaliGemma_finetuning_notebook.ipynb`	Visual language model implementation

🎯 Module Overview

1. LLM architecture

Explore the following model implementations:
- Llama2 (Meta's open source model)
- Mistral-7b (efficient 7 billion parameter model)
- Mixtral (expert hybrid architecture)
- Gemma (Google's latest contribution)
- Llama3 (upcoming experiment)

2. 🛠️ fine-tuning technology

implementation strategy
The LoRA (Low Rank Adaptation) approach
Advanced Optimization Methods

3. 🏗️ model architecture analysis

An in-depth study of the model structure
Parameter calculation method
Scalability Considerations

4. 🔧 Professional realization

Code Llama for programming tasks
Visual language modeling:
- Florence2
- PaliGemma

5. 💻 Practical applications

Integrated Jupyter Notebook
Response Generation Pipeline
Reasoning Implementation Guide

6. 🚀 Advanced Themes

DPO (Direct Preference Optimization)
SFT (supervised fine tuning)
Assessment methodology

AI Knowledge Base

© Copyright notes

The copyright of the article belongs to the author, please do not reprint without permission.

Related posts

朴素、有效的RAG检索策略：稀疏+密集混合检索并重排，并利用“提示缓存”为文本块生成整体文档相关的上下文

Simple, effective RAG retrieval strategy: sparse + dense hybrid search and rearrangement, and use "cue caching" to generate overall document-relevant context for text chunks.

AI Knowledge Base # Knowledge Retrieval with RAG Framework

6mos ago

01.2K

法律翻译领域：ChatGPT 与神经网络翻译 (NMT) 系统性能深度评测

Legal Translation: An In-Depth Review of ChatGPT and Neural Network Translation (NMT) System Performance

AI Knowledge Base

5mos ago

0749

Intents 意图：用zep解释如何让大模型理解客户意图

Intents : Explain with zep how to make a big model understand customer intents.

AI Knowledge Base

6mos ago

0965

LLM OCR 的局限性：光鲜外表下的文档解析难题

Limitations of LLM OCR: The Document Parsing Challenge Behind the Glossy Surface

AI Knowledge Base

5mos ago

0983

No comments

none

No comments...