Rust ML WG Meeting 00011

Meeting Info

Date: 20201006

Start time: 1600 ET

Zoom: https://ucsb.zoom.us/j/6601852842

Agenda

Transfer ZuseZ4/datasets to Rust-ML-WG?
Rust ML Optimisation
- Research on Optimisation Path (FPGA/Embedded Devices)
- CPU version of framework as first deliverable?
SmartCore
- https://smartcorelib.org/
- https://www.reddit.com/r/rust/comments/j1mj1g/smartcore_fast_and_comprehensive_machine_learning/
Review of linfa updates
chrism: updates on arewelearningyet.com
chrism: has been doing some research about open source licensing in machine learning
AndrazT: Anyone interested in on-line learning machinery in rust - like Vowpal Wabbit?
Ricky: Rust ML Meetup 20201127
- https://hackmd.io/7hStiyyHQcCCFGVCJb5pfQ

Participants

chrism
ricky
andraz
manuel

Minutes

Transfer ZuseZ4/datasets to Rust-ML-WG

Datasets repository to Rust ML Group Repository
manuel is interested in transfering that over Zuse24 dataset repository
talking about having a list of datasets or places to go for Rust data loaders for ML set in the group
chrism mentioned that it would be nice to have multiple loaders for the datasets so it is widely availble to people
- if it is a specific loader for something we might want to just link to it

Rust ML Optimisation

jamal is looking into optimisations with FPGAs and GPUs
- for a lot of FPGAs with open source we're not there yet
- cool project called symbiflow
  - https://symbiflow.github.io/
  - open source project that is taking that process and make it simplier
    - standarizing how to deploy to an FPGA
    - but that's a ways off
took a step back and looking at CPU optimizations
- RISCV
  - open source instruction set architecture
  - https://riscv.org/
- wants to start here, anything beyond FPGAs for this is a no-starter since the environment is too new
here he was looking at arrayfire and ndarray a framework for neural networks
- as they relate to embedded hardware

SmartCore

chrism looked at the documentation for the code
- docs are really good
- mostly classical ML algorithms
  - bunch of different regressions
  - decision trees and random forest
  - nearest neighbor
- linfa has one more clustering algorithm
  - SmartCore might only have k-means
Have not looked into their model evaluation
it is on arewelearningyet.com!

arewelearningyet.com updates

updated info
direct links to linfa and smartcore
- calling them out specifically as meta-repos for doing ML in Rust
we have green on the board!
- one category has been updated from red -> yellow
we need to have a sort order to the libraries on arewelearningyet.com
- talk with anthony on how to get this done
- best sorting order
  - maybe a function of crates.io downloads and last commit?
datasets on arewelearningyet.com
- another category for those
- right now there's not enough repositories for adding another category
linfa datasets
- https://github.com/rust-ml/linfa/tree/master/datasets
Talking about how crates.io blocked a publish crate with a github dependency
- just chatting about the technical aspects around loading a dataset with a crate

Open Source Licensing in Machine Learning

looking into licensing, ethics, privacy, and everythign around ML
reached out to the group of responsible AI licensesing
- not helpful responses from them
nearing the end of the first draft of the blog post on licensing around ML
- overview, in-the-weeds, and a good use case of licenses
- description of why he feels using Apache2.0 and MIT is not sufficient for the libraries
  - the distinct lack of responsibility of the technology that we're not comfortable with in publishing a ML library
talked to a lawyer family member
- IP lawyer is needed, they said it's magical and very specific
considering writing a first version of a license for open source ML libraries
- however writing your own license is very very difficult
if anyone is interested to help review, reach out
- he will find a way to send it so we can help out

Online Learning Machinery in rust

high veloicty machine learning
written implemetnation of regession
- online learning
  - most well known tool is Vowpal Wabbit
    - written in C++
    - heavily specalized and very fast
  - however, if you narrowdown the problem and use Rust
    - you can make it 3x faster ;)
they wrote an internal tool to make this very fast
- trying to open source it, licensing right now though and it's moving along
why it is interesting
- this is written in Rust that others don't have
- probably the fastest linear regression on normal CPUs
used internal for data science research
- hyperparameter tuning
- maxing out at memory bandwidth it's that fast
going to move from internal repository to github
- so they are forced to maintain it there
optmized a lot
- like instead of gzip using lz4
downside is it's very narrow in what it takes and gives

Rust ML Meetup November 27th

Rust ML WG member talks are listed here:
- https://hackmd.io/7hStiyyHQcCCFGVCJb5pfQ

Actions

chrism: talk to anthony about sort order for arewelearningyet.com
ricky: Possibly give some time to Andraz during Rust ML meetup talk to talk about the online learning tool they are open sourcing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

meeting_00011_20201006.md

meeting_00011_20201006.md

Rust ML WG Meeting 00011

Meeting Info

Agenda

Participants

Minutes

Transfer ZuseZ4/datasets to Rust-ML-WG

Rust ML Optimisation

SmartCore

arewelearningyet.com updates

Open Source Licensing in Machine Learning

Online Learning Machinery in rust

Rust ML Meetup November 27th

Actions

Files

meeting_00011_20201006.md

Latest commit

History

meeting_00011_20201006.md

File metadata and controls

Rust ML WG Meeting 00011

Meeting Info

Agenda

Participants

Minutes

Transfer ZuseZ4/datasets to Rust-ML-WG

Rust ML Optimisation

SmartCore

arewelearningyet.com updates

Open Source Licensing in Machine Learning

Online Learning Machinery in rust

Rust ML Meetup November 27th

Actions