secondKarlMarx

基于大型语言模型的马克思主义理论助手，支持分布式训练和MCP远程访问。

项目概述

secondKarlMarx是一个专门针对马克思主义理论的大语言模型微调项目，通过SFT（监督微调）技术，使模型能够深入理解和解释马克思主义理论，同时保持自然对话能力和RAG工具调用能力。

主要特点：

基于主流大语言模型（如Llama 3）进行微调
支持多GPU分布式训练
使用LoRA高效参数微调方法
通过MCP（Model Context Protocol）实现远程访问
支持RAG（检索增强生成）工具调用

项目结构

secondKarlMarx/
├── configs/                # 配置文件
│   ├── training_config.py  # 训练配置
│   └── ds_config.json      # DeepSpeed配置
├── training/               # 训练相关代码
│   ├── data_utils.py       # 数据处理工具
│   └── trainer.py          # 训练器实现
├── model/                  # 模型相关代码
│   └── model_loader.py     # 模型加载器
├── mcp/                    # MCP服务相关
│   ├── server.py           # MCP服务器
│   ├── client.py           # MCP客户端
│   └── mcp_config.json     # MCP配置
├── utils/                  # 工具函数
├── train.py                # 主训练脚本
├── run_distributed_training.sh  # 分布式训练启动脚本
├── start_mcp_server.py     # 启动MCP服务器脚本
└── requirements.txt        # 依赖包列表

安装指南

1. 环境准备

# 克隆仓库
git clone https://github.com/yourusername/secondKarlMarx.git
cd secondKarlMarx

# 创建虚拟环境
python -m venv venv
source venv/bin/activate  # Linux/Mac
# 或
venv\Scripts\activate  # Windows

# 安装依赖
pip install -r requirements.txt

2. Hugging Face认证（如需访问受限模型）

# 设置Hugging Face令牌
export HUGGING_FACE_HUB_TOKEN=your_token_here
# 或在Windows上
set HUGGING_FACE_HUB_TOKEN=your_token_here

使用指南

1. 训练模型（在云服务器上）

单GPU训练

python train.py

多GPU分布式训练

# 修改run_distributed_training.sh中的GPU设置
chmod +x run_distributed_training.sh
./run_distributed_training.sh

2. 启动MCP服务（在云服务器上）

# 启动MCP服务器
python start_mcp_server.py --model_path ./results/final_model --host 0.0.0.0 --port 8000

3. 在本地笔记本上使用模型

配置MCP客户端

编辑mcp/mcp_config.json文件，设置服务器IP和路径：

{
  "mcpServers": {
    "secondKarlMarx": {
      "command": "python",
      "args": ["/path/to/your/server.py"],
      "host": "your-server-ip",
      "port": 8000
    }
  }
}

启动客户端界面：

python mcp/client.py

自定义配置

修改训练配置

编辑configs/training_config.py文件，可以调整以下参数：

基础模型：修改BASE_MODEL_CONFIG中的model_name_or_path
数据集：修改DATASET_CONFIG中的dataset_name
训练参数：修改TRAINING_CONFIG中的各项参数
LoRA配置：修改LORA_CONFIG中的参数

修改DeepSpeed配置

编辑configs/ds_config.json文件，可以调整分布式训练参数。

注意事项

确保云服务器有足够的GPU内存
对于大型模型，建议使用8位或4位量化
训练前检查数据集格式是否符合要求
MCP服务器需要开放相应端口供外部访问

贡献指南

欢迎提交Issue和Pull Request来改进项目。

许可证

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
LLaMA-Factory		LLaMA-Factory
configs		configs
graphrag		graphrag
graphrag_output		graphrag_output
inference		inference
lancedb		lancedb
marx_collection_works		marx_collection_works
marx_kg		marx_kg
mcp		mcp
model		model
outputs_llama_factory_test		outputs_llama_factory_test
training		training
wandb		wandb
README.md		README.md
README_MVP.md		README_MVP.md
README_inference.md		README_inference.md
check_llama_factory.sh		check_llama_factory.sh
custom_dataset.py		custom_dataset.py
fix_bitsandbytes.sh		fix_bitsandbytes.sh
fix_cuda_env.sh		fix_cuda_env.sh
fix_env_complete.sh		fix_env_complete.sh
fix_env_no_flash.sh		fix_env_no_flash.sh
fix_gradio.sh		fix_gradio.sh
fix_httpx.sh		fix_httpx.sh
fix_torch_compatibility.sh		fix_torch_compatibility.sh
graphrag_test_config.yaml		graphrag_test_config.yaml
install_cudnn.sh		install_cudnn.sh
install_cudnn_alt.sh		install_cudnn_alt.sh
install_llama_factory.sh		install_llama_factory.sh
minimal_config.yaml		minimal_config.yaml
prepare_dataset.py		prepare_dataset.py
python_client.py		python_client.py
qwen_inference.yaml		qwen_inference.yaml
qwen_lora_sft.yaml		qwen_lora_sft.yaml
requirements.txt		requirements.txt
requirements_llama_factory.txt		requirements_llama_factory.txt
run_api_server.sh		run_api_server.sh
run_distributed_training.sh		run_distributed_training.sh
run_kg_enhanced_llm_without_api.sh		run_kg_enhanced_llm_without_api.sh
run_llama_factory.sh		run_llama_factory.sh
run_llama_factory_cli.sh		run_llama_factory_cli.sh
run_llama_factory_direct.sh		run_llama_factory_direct.sh
run_llama_factory_official.sh		run_llama_factory_official.sh
run_mcp_api_server.sh		run_mcp_api_server.sh
run_simple_training.sh		run_simple_training.sh
run_very_simple_training.sh		run_very_simple_training.sh
run_webchat.sh		run_webchat.sh
setup_llama_factory_complete.sh		setup_llama_factory_complete.sh
setup_venv.sh		setup_venv.sh
simple_config.yaml		simple_config.yaml
start_mcp_server.py		start_mcp_server.py
test_mcp.py		test_mcp.py
train.py		train.py
train_llama_factory.py		train_llama_factory.py
train_llama_factory.py.bak		train_llama_factory.py.bak
train_simple.py		train_simple.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

secondKarlMarx

项目概述

项目结构

安装指南

1. 环境准备

2. Hugging Face认证（如需访问受限模型）

使用指南

1. 训练模型（在云服务器上）

单GPU训练

多GPU分布式训练

2. 启动MCP服务（在云服务器上）

3. 在本地笔记本上使用模型

配置MCP客户端

自定义配置

修改训练配置

修改DeepSpeed配置

注意事项

贡献指南

许可证

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ChizhongWang/secondKarlMarx

Folders and files

Latest commit

History

Repository files navigation

secondKarlMarx

项目概述

项目结构

安装指南

1. 环境准备

2. Hugging Face认证（如需访问受限模型）

使用指南

1. 训练模型（在云服务器上）

单GPU训练

多GPU分布式训练

2. 启动MCP服务（在云服务器上）

3. 在本地笔记本上使用模型

配置MCP客户端

自定义配置

修改训练配置

修改DeepSpeed配置

注意事项

贡献指南

许可证

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages