Skip to content
/ disco Public

DISCO is a code-free and installation-free browser platform that allows any non-technical user to collaboratively train machine learning models without sharing any private data.

License

Notifications You must be signed in to change notification settings

epfml/disco

Repository files navigation

DISCO - DIStributed COllaborative Machine Learning

DISCO leverages federated 🌟 and decentralized ✨ learning to allow several data owners to collaboratively build machine learning models without sharing any original data.

The latest version is always running on the following link, directly in your browser, for web and mobile:

πŸ•Ί https://discolab.ai/ πŸ•Ί


πŸͺ„ DEVELOPERS: Have a look at our developer guide


❓ WHY DISCO?

  • To build deep learning models across private datasets without compromising data privacy, ownership, sovereignty, or model performance
  • To create an easy-to-use platform that allows non-specialists to participate in collaborative learning

βš™οΈ HOW DISCO WORKS

  • DISCO has a public model – private data approach
  • Private and secure model updates – not data – are communicated to either:
    • a central server : federated learning ( 🌟 )
    • directly between users : decentralized learning ( ✨ ) i.e. no central coordination
  • Model updates are then securely aggregated into a trained model
  • See more HERE

❓ DISCO TECHNOLOGY

  • DISCO supports arbitrary deep learning tasks and model architectures, via TF.js
  • ✨ relies on peer2peer communication
  • Have a look at how DISCO ensures privacy and confidentiality HERE

πŸ§ͺ RESEARCH-BASED DESIGN

DISCO aims to enable open-access and easy-use distributed training which is

  • πŸŒͺ️ efficient (R1, R2)
  • πŸ”’ privacy-preserving (R3, R4)
  • πŸ› οΈ fault-tolerant and dynamic over time (R5)
  • πŸ₯· robust to malicious actors and data poisoning (R6, R7)
  • 🍎 🍌 interpretable in imperfectly interoperable data distributions (R8)
  • πŸͺž personalizable (R9)
  • πŸ₯• fairly incentivize participation

🏁 HOW TO USE DISCO

  • Start by exploring our example DISCOllaboratives in the Tasks page.
  • The example models are based on popular datasets such as Titanic, MNIST or CIFAR-10
  • It is also possible to create your own task without coding on the custom training page:
    • Upload the initial model
    • You can choose from several existing dataloaders
    • Choose between federated and decentralized for your DISCO training scheme ... connect your data and... done! πŸ“Š
    • For more details on ML tasks and custom training have a look at this guide

Note: Currently only CSV and Image data types are supported. Adding new data types, preprocessing code or dataloaders, is accessible in developer mode (see developer guide).

__

JOIN US

  • You are welcome on slack