You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

architecture.md 2.7 kB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758
  1. # Edge Cloud Collaborative AI Framework
  2. ## Motivation
  3. Currently, "Edge AI" in the industry is at an early stage of training on the cloud and inference on the edge. However, the future trend has emerged, and related research and practice are booming, bringing new value growth points for edge computing and AI. Also, edge AI applications have much room for optimization in terms of cost, model effect, and privacy protection. For example:
  4. This proposal provides a basic framework for edge-cloud collaborative training and inference, so that AI applications running at the edge can benefit from cost reduction, model performance improvement, and data privacy protection.
  5. ### Goals
  6. For AI applications running at the edge, the goals of edge cloud collaborative framework are:
  7. * reducing resource cost on the edge
  8. * improving model performance
  9. * protecting data privacy
  10. ## Proposal
  11. * What we propose:
  12. * an edge-cloud collaborative AI framework based on KubeEdge
  13. * with embed collaborative training and joint inferencing algorithm
  14. * working with existing AI framework like Tensorflow, etc
  15. * 3 Features:
  16. * joint inference
  17. * incremental learning
  18. * federated learning
  19. * Targeting Users:
  20. * Domain-specific AI Developers: build and publish edge-cloud collaborative AI services/functions easily
  21. * Application Developers: use edge-cloud collaborative AI capabilities.
  22. * We are NOT:
  23. * to re-invent existing ML framework, i.e., tensorflow, pytorch, mindspore, etc.
  24. * to re-invent existing edge platform, i.e., kubeedge, etc.
  25. * to offer domain/application-specific algorithms, i.e., facial recognition, text classification, etc.
  26. ### Architecture
  27. ![](./images/framework.png)
  28. * GlobalManager: implements the Edge AI features controllers based on the [k8s operator pattern](https://kubernetes.io/docs/concepts/extend-kubernetes/operator/)
  29. * Federated Learning Controller: Implements the federated learning feature based on user created CRDs
  30. * Incremental Learning Controller: Implements the incremental learning feature based on user created CRDs
  31. * Joint Inference Controller: Implements the joint inference feature based on user created CRDs
  32. * LocalController: manages the Edge AI features, the extra dataset/model resources on the edge nodes
  33. * Workers: includes the training/evaluation/inference/aggregator
  34. * do inference or training, based on existing ML framework
  35. * launch on demand, imagine they are docker containers
  36. * different workers for different features
  37. * could run on edge or cloud
  38. * Lib: exposes the Edge AI features to applications, i.e. training or inference programs

No Description