slidenumbers: true
The best minds of my generation are thinking about how to make people click ads. That sucks. -- Jeff Hammerbacher (Co-founder Facebook)
^ now cloudera
- predicting where services will be needed
- prioritizing resources based on expected impact
- forecasting trends and changes
- identifying supporters likely to donate
^ Given enough data
This is already happening...1
- supported by the Eric & Wendy Schmidt Foundation
- 6 month fellowship
- currently in 3rd year (started 2013)
- Partners: NGOs, Governments
^ started by Rayid Ghani (Obama’s Chief data scientist)
- World Bank Group – Prediction & Identification of Collusion in International Development Projects
- Chicago Public Schools – Student Enrollment Prediction for Budget Allocation
- Pecan Street , WikiEnergy – Building Open Source Tools to Analyze Smart Meter Data
visit http://dssg.io/projects/
- like DSSG Chicago
- mainly funded by Oracle and Georgia Tech
- started 2014 (one year after Chicago)
- University of Washington just announced their DSSG Summer Program
http://escience.washington.edu/what-we-do/data-science-for-social-good
Bayes Impact is a nonprofit that deploys data scientists to solve big social problems with civic and nonprofit organizations
- founded 2014
- started as (full-time-)fellowship
- now hiring long-term employees
- Increasing Graduation Rate And Optimizing Class Offerings For UC Riverside
- Improving Outcomes For Emotionally And Behaviorally Challenged Children With Youth Villages
- Stratification Of Parkinson's Disease Patients
- Optimizing Ambulance Response Times In Sf
One weekend, impact the world
http://bayeshack.challengepost.com/submissions
^ curious to see how many will be alive in 6 months ^ image on right shows winner
- started this year (2014)
- currently 4 open competitions
^ https://www.kaggle.com/c/kdd-cup-2014-predicting-excitement-at-donors-choose ^ donors choose lets teachers enter projects for crowdfunding
We're tackling the world's biggest problems through data science. -- http://www.datakind.org
DataKind connects charities with data scientists by organizing two-day data dives where those data scientists help solve the charities’ data problems.
DataKind helped GiveDirectly – an NGO making unconditional cash transfers to poor households via mobile phones in Kenia and Uganda2 – to identify especially needy villages through satellite image analysis3.
^ predictive model to estimate number of roofs ^ and percentage of thatched / metal roofs ^ crowdsourced training data ^ template matching ^ 100 person days of manual effort saved
[fit] View the presentation
[fit] or read the paper
To help prioritize the many calls for help reaching Amnesty International’s Urgent Action Network DataKind volunteers have created a predictive model that analyzes messages for potential escalation.45
Combining data from Shooting Star Chase, public data about the hospice and healthcare sector and demographic data DataKind volunteers calculated predicted demand against hospice capacity to reveal areas of possible shortage.6
^ + a few other things
Most of DataKinds projects have been tackled by volunteers on 2-day data dives.
^ Who has been on a data dive?
(by voluntary data ambassadors in collaboration with the challenge partner – starting ~2 month before the data dive)
- anonymization/pseudonymization
- cleaning/fixing
- ensuring proper (machine readable) data formats
Any data scientist worth their salary will tell you that you should start with a question, NOT the data. -- Jake Porway in https://hbr.org/2013/03/you-cant-just-hack-your-way-to/
- Challenge partners pitch their problems
- Volunteers create analyses, models and visualizations (led by data ambassadors) in two intense days of hacking
- solutions are being presented at the end
^ Big community event ^ Data Ambassadors important
Social organizations still don’t have the expertise: data ambassadors must help implement the solutions
^ Not yet quite clear to me ^ Sent DataKind and email to clarify
^ Not yet quite clear to me
There is currently no organization in Germany comparable to DataKind.
There is currently no organization in Germany comparable to DataKind.
- Daniel Kirsch
- Marit Brademann
- Jana Kludas
- Richard Lawrence
- Georg Walther
^ Detexify, Co-founded OK Lab Münster
- Klaas Bollhöfer, Chief Data Scientist @ The Unbelievable Machine Company
- Adam Drake, Chief Data Officer @ Skyscanner
- Dr. Alexander Weiß Head of Data Analytics @ Trademob
- to prepare data before data dives
- lead teams at data dives
- help with the implementation afterwards
The international of the Data Science for Social Good-movement shows that data scientists are eager to donate their skills.
Social organizations need to understand how we can help them. Are you in contact with NGOs? Spread the word!
http://dssg-berlin.org/ @dssgber
Daniel Kirsch [email protected]
- http://datakind.org
- http://dssg.io
- http://dssg-atl.io
- http://bayesimpact.org
- http://codefor.de
- http://datalook.io
- Foto of Jeff Hammberbacher by Fred Brenenson licensed under CC BY 2.0
Footnotes
-
...but not so much in Germany ↩
-
http://www.ted.com/talks/joy_sun_should_you_donate_differently ↩
-
http://www.datakind.org/projects/using-the-simple-to-be-radical/ ↩
-
http://www.datakind.org/projects/using-predictive-analytics-to-prevent-human-rights-abuses/ ↩
-
http://www.washingtonpost.com/business/on-it/amnesty-international-considers-using-big-data-to-predict-human-rights-violations/2013/11/22/3f4f1a1e-5388-11e3-a7f0-b790929232e1_story.html ↩