Children's Services' Data Tool

This repository holds a set of tools and utilities for processing and cleaning Children's Services' data.

Most of the utilities are centred around three core datasets:

SSDA903
CIN Census
Annex A

Introduction to LIIA project

The LIIA (London Innovation and Improvement Alliance) project brings together Children’s Services data from all the Local Authorities (LAs) in London with the aim of providing analytical insights that are uniquely possible using pan-London datasets.

Please see LIIA Child Level Data Project for more information about the project, its aims and partners.

Purpose of liia-tools-pipeline package

The package is designed to process data deposited onto the data platform by local authorities such that it can be used for analysis purposes.

This is a Dagster code server library which is setup to be used as a code server.

How to use:

Local Development

Run poetry install
Copy .env.sample to .env and fill in the variables there as needed
Run the following command:
- For LA-level pipeline work: poetry run dagster dev -f .\liiatools_pipeline\repository_la.py
- For Region-level (Organisation) pipeline work: poetry run dagster dev -f .\liiatools_pipeline\repository_org.py
Once running, navigate to http://localhost:3000/
Add the pre-commit hook by running pre-commit install. This will ensure your code is formatted before you commit something

Preparation for Production or Staging

How this will run in production is that the library will be brought into a docker container with configuration specified in the file Dockerfile_user_code. Which code servers are used can be specified in the installation. See The SFDATA Platform's Workspace definition for details

The idea is each code server will have its own setup which will be a copy of what's here.

Note: Multiple libraries, pipelines, etc can exist in a single code server. Different servers should be used if they have conflicting requirements (e.g. different python versions)

Documentation

Take a look at the documentation to understand what this code is designed to do and how to replicate it for your own dataset transformations. We recommend reading text first, followed by text.

Name		Name	Last commit message	Last commit date
Latest commit History 890 Commits
.github/workflows		.github/workflows
docs		docs
external_dataset		external_dataset
liiatools		liiatools
liiatools_pipeline		liiatools_pipeline
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.env.sample		.env.sample
.gitignore		.gitignore
.gitpod.Dockerfile		.gitpod.Dockerfile
.gitpod.yml		.gitpod.yml
.mailmap		.mailmap
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
Dockerfile_LA		Dockerfile_LA
Dockerfile_Org		Dockerfile_Org
LICENSE		LICENSE
README.md		README.md
dagster.yaml		dagster.yaml
development-best-practices.md		development-best-practices.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
workspace.yaml		workspace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Children's Services' Data Tool

Introduction to LIIA project

Purpose of liia-tools-pipeline package

How to use:

Local Development

Preparation for Production or Staging

Documentation

About

Releases 5

Packages

Languages

License

SocialFinanceDigitalLabs/liia-tools-pipeline

Folders and files

Latest commit

History

Repository files navigation

Children's Services' Data Tool

Introduction to LIIA project

Purpose of liia-tools-pipeline package

How to use:

Local Development

Preparation for Production or Staging

Documentation

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Languages

Packages