Bonsai : un modèle de langage pondéré à trois valeurs adapté aux appareils périphériques

Derniers outils d'IAPosté il y a 4 mois Sharenet.ai

692 0

Introduction générale

Bonsai 是 deepgrove-ai 开发的一个开源语言模型，参数规模为 5 亿，采用三值权重（ternary weights）技术。它基于 Llama 架构和 Mistral 分词器设计，线性层经过调整以支持三值权重。模型主要使用 DCLM-Pro 和 Fineweb-Edu 数据集训练，总计不到 50 亿个标记。尽管训练数据少，Bonsai 性能依然出色，是首批达到竞争水平的轻量三值模型之一。用户可以通过 Huggingface Transformers 库调用它。项目代码在 GitHub 上公开，适合开发者探索高效 AI 模型。

Liste des fonctions

轻量高效运行：采用三值权重技术，模型小巧，运行速度快，适合低资源设备。
生成自然语言：支持生成流畅文本，可用于对话、问答等任务。
开源访问：在 GitHub 上提供完整代码，允许用户下载、修改和优化。
兼容 Huggingface：无缝集成到 Transformers 库，便于加载和部署。
性能优异：在少量训练数据下，性能可媲美同级别模型。

Utiliser l'aide

Processus d'installation

要使用 Bonsai，需要先搭建运行环境。以下是详细步骤：

检查 Python 环境
确保电脑安装了 Python 3.8 或以上版本。在终端输入：

python --version

如果未安装，可从 Python 官网下载。

安装 Transformers 库
Bonsai 依赖 Huggingface 的 Transformers 库。在终端运行：

pip install transformers

安装后，用 pip show transformers Confirmer la version.

下载 Bonsai 模型
模型托管在 Huggingface 上。推荐通过代码自动加载（见下文），也可手动下载。
安装可选依赖
如果需要微调或加速，可安装 torch répondre en chantant datasets: :

pip install torch datasets

Comment l'utiliser

Bonsai 使用 Python 脚本调用。以下是基本操作步骤：

加载模型和分词器

在 Python 中运行以下代码：

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("deepgrove/Bonsai", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("deepgrove/Bonsai", trust_remote_code=True)

生成文本

输入文本并生成结果：

text = "中国的首都是哪里？"
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_length=100)
result = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(result)

输出可能是“中国的首都是北京。”。

Paramètres de réglage

可修改生成参数，例如：

outputs = model.generate(**inputs, max_length=50, temperature=0.7)

max_length：设置输出长度。
temperature：控制输出随机性，值越小越稳定。

Fonction en vedette Fonctionnement

高效运行

Bonsai 的三值权重让它在 16 位精度下运行良好。若有 GPU，可自动加速：

import torch
print(torch.cuda.is_available())  # 返回 True 表示 GPU 可用

GPU 会显著提升性能，但 CPU 也能正常运行。

Évaluation des performances

Bonsai 在多个基准测试中表现优异。以下是官方数据：

modélisation	ARC-c	ARC-e	HS.	OBQA	PiQA	Wino.	MMLU	平均分
MobiLlama 0.5B	26.62	46.68	51.66	30.00	71.65	54.50	28.61	44.25
Qwen 2 0.5B	28.84	50.29	49.12	33.00	69.26	56.99	31.78	45.61
MobileLLM 600M	29.01	56.65	55.35	34.00	71.65	59.75	31.40	48.13
Qwen 2.5 0.5B	32.25	58.29	52.18	35.40	69.91	56.12	33.40	48.22
Bonsai	33.36	57.95	48.04	34.00	70.24	54.85	30.28	46.96
这些测试包括 ARC、OBQA、MMLU 等，显示 Bonsai 在轻量模型中名列前茅。

微调模型

Bonsai 未经过指令微调，适合通用生成任务。若需优化特定用途（如问答），可自行微调：

准备数据：用文本文件或 datasets 库加载。
配置参数：用 TrainingArguments 设置。
训练模型：

from transformers import Trainer, TrainingArguments
training_args = TrainingArguments(
output_dir="./bonsai_finetuned",
num_train_epochs=3,
per_device_train_batch_size=4
)
trainer = Trainer(model=model, args=training_args, train_dataset=your_dataset)
trainer.train()

更多细节见 Huggingface 文档.