History

enlin 6c534ad7d3 update V3.0		3 years ago
..
docs	update V3.0	3 years ago

src/main	update operator	4 years ago

.gitignore	update operator	4 years ago

README.md	update operator	4 years ago

pom.xml	update distribute-train-operator	5 years ago

之江天枢-分布式训练 operator

该模块是分布式训练CRD的控制器，管理分布式训练容器生命周期，为分布式训练容器注入其他容器ip。

源码部署

安装如下软件环境。

git clone https://codeup.teambition.com/zhejianglab/distribute-train-operator.git
# 进入项目根目录
cd distribute-train-operator

# 构建，生成的 jar 包位于 ./target/distribute-train-operator-1.0.jar
mvn clean compile package

一站式算法开发平台、高性能分布式深度学习框架、先进算法模型库、视觉模型炼知平台、数据可视化分析平台等一系列平台及工具，在模型高效分布式训练、数据处理和可视分析、模型炼知和轻量化等技术上形成独特优势，目前已在产学研等各领域近千家单位及个人提供AI应用赋能

深度学习大数据处理数据可视化模型分布式训练

Java Vue Python Text JavaScript other

tianshu@zhejianglab.com 648240260@qq.com 864216432@qq.com jiangjiqiong@zhejianglab.com 1103225671@qq.com yeyue@zhejianglab.com