大模型微调 | Sharenet

Sorting

GraphGen: Fine-tuning Language Models Using Knowledge Graphs to Generate Synthetic Data

Comprehensive Introduction GraphGen is an open source framework developed by OpenScienceLab, an AI lab in Shanghai, hosted on GitHub, focused on optimizing supervised fine-tuning of Large Language Models (LLMs) by guiding synthetic data generation through knowledge graphs. It was developed from ...

2mos ago

0467

MiniMind-V: 1 hour training of a 26M parameter visual language model

综合介绍 MiniMind-V 是一个开源项目，托管于 GitHub，旨在帮助用户在 1 小时内训练一个仅 2600 万参数的轻量级视觉语言模型（VLM）。它基于 MiniMind 语言模型，新增视觉...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

3mos ago

0449

DeepCoder-14B-Preview: an open source model that specializes in code generation

综合介绍 DeepCoder-14B-Preview 是由 Agentica 团队开发并在 Hugging Face 平台发布的开源代码生成模型。它基于 DeepSeek-R1-Distilled-Q...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

3mos ago

0541

WeClone: training digital doppelgangers with WeChat chats and voices

Comprehensive introduction WeClone is an open source project that uses WeChat chat logs and voice messages, combined with large language models and speech synthesis technology, to allow users to create personalized digital doppelgangers. The project can analyze the user's chat habits to train the model , but also a small number of voice samples to generate realistic sound...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

3mos ago

0599

Search-R1: A Tool for Reinforcement Learning to Train Large Models for Search and Reasoning

综合介绍 Search-R1 是一个开源项目，由 PeterGriffinJin 在 GitHub 上开发，基于 veRL 框架构建。它通过强化学习（RL）技术训练大语言模型（LLM），让模型自主学会...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0655

Optexity: an open-source project to train AI to perform web actions with human demonstrations

General Introduction Optexity is an open source project on GitHub, developed by the Optexity team. Its core is to use human demonstration data to train AI to complete computer tasks, especially web page operations. The project contains three code libraries : Compute...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning # Desktop Automation Intelligence

4mos ago

0746

Bonsai: A three-valued weighted language model suitable for operation on edge devices

综合介绍 Bonsai 是 deepgrove-ai 开发的一个开源语言模型，参数规模为 5 亿，采用三值权重（ternary weights）技术。它基于 Llama 架构和 Mistral 分词器...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0662

Second Me: Locally trained AI doppelgangers with personal memories and habits

综合介绍 Second Me 是 Mindverse 团队开发的一个开源项目，它能让你在自己电脑上打造一个像“数字分身”的 AI。这个 AI 通过你的文字和记忆学会你的说话方式和习惯，变成一个懂你的智...

Latest AI tools # AI Java Open Source Projecct # AI Life Efficiency Assistant # Large model fine-tuning

4mos ago

01K

Easy Dataset: an easy tool for creating fine-tuned datasets for large models

综合介绍 Easy Dataset 是一个专门为大模型（LLM）微调设计的开源工具，托管在 GitHub 上。它提供了一个简单易用的界面，让用户可以上传文件、自动分割内容、生成问题和答案，最终输出适合...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0927

MM-EUREKA: A Multimodal Reinforcement Learning Tool for Exploring Visual Reasoning

综合介绍 MM-EUREKA 是一个由上海人工智能实验室、上海交通大学等多方合作开发的开源项目。它通过基于规则的强化学习技术，把文本推理能力扩展到多模态场景，帮助模型处理图像和文字信息。这个工具的核心...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0672

AI Toolkit by Ostris：Stable Diffusion与FLUX.1模型训练工具包

AI Toolkit by Ostris: Stable Diffusion with FLUX.1 Model Training Toolkit

综合介绍 AI Toolkit by Ostris 是一个开源的AI工具集，专注于支持Stable Diffusion及FLUX.1模型的训练与图像生成任务。该工具集由开发者Ostris创建并维护，托...

Latest AI tools # AI Image Generation Aids # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0821

X-R1: Low-cost training of 0.5B models in common devices

综合介绍 X-R1 是一个由 dhcode-cpp 团队在 GitHub 上开源的强化学习框架，旨在为开发者提供一个低成本、高效的工具，用于训练基于端到端强化学习的模型。该项目受到 DeepSeek...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0589

OpenManus-RL: Fine-tuning Large Models to Enhance Intelligent Body Reasoning and Decision Making

综合介绍 OpenManus-RL是由UIUC-Ulab与 MetaGPT 社区的OpenManus团队联合开发的开源项目，托管于GitHub。该项目通过强化学习（RL）技术提升大型语言模型（LLM...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

4mos ago

0753

TPO-LLM-WebUI: An AI framework where you can input questions to train a model to output results in real time

综合介绍 TPO-LLM-WebUI 是由 Airmomo 在 GitHub 上开源的一个创新项目，通过直观的 Web 界面实现大语言模型（LLM）的实时优化。它采用 TPO（Test-Time Pr...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

0761

Open-Reasoner-Zero: Open Source Large-Scale Reasoning Reinforcement Learning Training Platform

General Introduction Open-Reasoner-Zero is an open source project focused on reinforcement learning (RL) research, developed by the Open-Reasoner-Zero team on GitHub. It aims to provide efficient, scalable and easy-to-use training ...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

0796

Chinese based full-blooded DeepSeek-R1 distillation dataset, supports Chinese R1 distillation SFT dataset

Comprehensive Introduction The Chinese DeepSeek-R1 distillation dataset is an open source Chinese dataset containing 110K pieces of data designed to support machine learning and natural language processing research. The dataset is released by Cong Liu's NLP team. The dataset contains not only mathematical data, but also a large number of general types...

Latest AI tools # AI Java Open Source Projecct # Large model fine-tuning

5mos ago

0836