Repository navigation

#

kubeflow

tencentmusic/cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式

Jupyter Notebook
4519
2 个月前
alirezadir/Production-Level-Deep-Learning

A guideline for building practical production-level deep learning systems to be deployed in real world applications.

4507
2 个月前

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Python
4455
1 天前
Python
3913
1 小时前

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Python
940
10 个月前

DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKS

HCL
790
3 小时前

Kubeflow’s superfood for Data Scientists

Python
639
1 天前

Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)

Go
491
5 天前

deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform.

Shell
447
1 年前

Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...

394
3 年前

👩‍🔬 Train and Serve TensorFlow Models at Scale with Kubernetes and Kubeflow on Azure

Python
291
5 年前