Skip to content

Latest commit

 

History

History
369 lines (255 loc) · 23.5 KB

README.md

File metadata and controls

369 lines (255 loc) · 23.5 KB

Climate Action Data Trust

Please review the CAD Trust Terms and Conditions and CAD Trust User Terms and Conditions.

​This project provides the Climate Action Data Trust (CADT) API that integrates with the Chia Blockchain. For a user interface, see the CADT UI project, which will connect to the CADT API.

This project was formerly known as the Climate Warehouse, and you may see this term used interchangeably with CADT.

Pagination is now strongly recommended on the units, projects, and issuances API endpoints and will be mandatory in the future. Please adjust your API calls accordingly.

User Guide

The CADT application is designed to run 24/7, much like any other API. While it is possible to run it on-demand only when API requests need to be made, this guide assumes a permanently running solution.

The simplest way to run the CADT application is to use the same machine the Chia Full Node, Wallet, Datalayer, and Datalayer-HTTP services reside on. CADT communicates with the Chia services over an RPC interface. The RPC interface uses certificates to authenticate, which will work automatically when the CADT application is run as the same user on the same machine as the Chia services. To run CADT on a separate machine from Chia, a public certificate from the Chia node must be used to authenticate (not yet documented).

Basic Chia installation instructions are provided below, but further installation options, please see the Chia docs site. For most CADT setups, we recommend the installing the headless chia-blockchain-cli package via the apt repo and using systemd.

After the initial installation, it will take anywhere from a few days (most likely for a cloud-hosted server) to a few weeks (possible on lower-powered systems or servers with slow connections) to sync the Chia full node. During this time, CADT can start syncing, but any writes to CADT are likely to fail. For best results, we recommend waiting for the Chia full node to finish syncing before using CADT. Check the status of the full node sync with chia show -s.

Those familiar with bittorrent and have a fast connection can speed up the full node syncing by downloading the database and copying it into place manually

How to use the API

Please see the CADT RPC API Guide.

Installation

Releases are tagged in Github, and binaries are built for Windows, macOS, and Linux. ARM binaries are available for Debian versions of Linux only.

System Requirements

CADT and Chia system usage will depend on many factors, including how busy the blockchain is, how much data is being mirrored by DataLayer, and how much data CADT is ingesting and processing. The current minimum requirements for running CADT and Chia together on a system are:

  • 4 CPU cores
  • 8 GB RAM
  • 300 GB disk space

ARM and x86 systems are supported. While Windows, MacOS, and all versions of Linux are supported, Ubuntu Linux is the recommended operating system as it is used most in testing and our internal hosting.

Linux

A binary file that can run on all Linux distributions on x86 hardware can be found for each tagged release named cadt-linux-x64-<version>.zip. This zip file will extract to the cadt-linux-64 directory by default, where the cadt file can be executed to run the API.

Recommended Method: APT on Debian-based Linux Distros (Ubuntu, Mint, etc)

The CADT API and the Chia Blockchain software can be installed with apt. Both ARM and x86 versions can be installed this way.

  1. Start by updating apt and allowing repository download over HTTPS:
sudo apt-get update
sudo apt-get install ca-certificates curl gnupg
  1. Add Chia's official GPG Key (if you have installed Chia with apt, you'll have this key already and will get a message about overwriting the existing key, which is safe to do):
curl -sL https://repo.chia.net/FD39E6D3.pubkey.asc | sudo gpg --dearmor -o /usr/share/keyrings/chia.gpg
  1. Use the following command to set up the CADT repository.
echo "deb [arch=$(dpkg --print-architecture) signed-by=/usr/share/keyrings/chia.gpg] https://repo.chia.net/cadt/debian/ stable main" | sudo tee /etc/apt/sources.list.d/cadt.list > /dev/null
  1. Add the Chia repository (via the official Chia installation instructions)
echo "deb [arch=$(dpkg --print-architecture) signed-by=/usr/share/keyrings/chia.gpg] https://repo.chia.net/debian/ stable main" | sudo tee /etc/apt/sources.list.d/chia.list > /dev/null
  1. Install Chia CLI and CADT
sudo apt-get update
sudo apt-get install chia-blockchain-cli cadt
  1. Start Chia Wallet, Full Node, and Datalayer with systemd
sudo systemctl start chia-wallet@<USERNAME> chia-data-layer@<USERNAME> chia-full-node@<USERNAME>

For <USERNAME>, enter the user that Chia runs as (the user with the .chia directory in their home directory). For example, if the ubuntu is where Chia runs, start Chia with systemctl start chia-wallet@ubuntu.

Optional: If using Chia's built-in HTTP server to share datalayer files, start the chia-data-layer-http service with

sudo systemctl start chia-data-layer-http@<USERNAME>
  1. Start CADT with systemd
sudo systemctl start cadt@<USERNAME>
  1. Set Chia and CADT to run at boot
sudo systemctl enable chia-wallet@<USERNAME> chia-data-layer@<USERNAME> chia-full-node@<USERNAME> cadt@<USERNAME>

If using the built-in HTTP server for datalayer, start it at boot with

sudo systemctl enable chia-data-layer-http@<USERNAME>
  1. View CADT logs to validate
journalctl -u cadt@<USERNAME> -f
(ctrl+c to exit)

Installation from Source

Installation from source is only recommended for those contributing code to CADT and is not intended to be used in production.

You'll need: ​

  • Git
  • nvm - This app uses nvm to align node versions across development, CI and production. If you're working on Windows, you should consider nvm-windows
  • A working Chia installation running wallet and datalayer (full node recommended)

To install from source:

git clone [email protected]:Chia-Network/cadt.git
cd cadt
nvm install
nvm use
npm run start

Datalayer HTTP File Serving

CADT relies on all participants publicly sharing their data over Chia Datalayer, which includes sharing the Chia-generated .dat files over HTTP. The files are located in ~/.chia/mainnet/data_layer/db/server_files_location_<NETWORK>/ (where <NETWORK> is the Chia network, usually either "mainnet" or "testneta") and can be shared over any web-accessible HTTP endpoint, including

  • Using the built-in datalayer-http service (see Installation instructions below). Datalayer-http runs on port 8575 by default which may need to be opened in your firewall configuration or forwarded by your router. Additionally, a static IP address, or stable DNS record, will be required, which is not offered by default on some hosting providers. On AWS, assign an Elastic IP to the EC2 instance or use an Application Load Balancer to solve this.

  • Using Nginx, Apache, Caddy, or any other web server. This also requires a static IP address, or dynamically assigned DNS record. Another challenge is that the default location for the .dat files is in the user's home directory, which the web server software will not have read-access to. One simple solution is

    • mv ~/.chia/mainnet/data_layer/db/server_files_location_<NETWORK> /var/www/ - move the datalayer file directory outside of the home directory
    • chmod -R 744 /var/www/server_files_location_<NETWORK> - change permissions on all datalayer files to be read by any user
    • ln -s /var/www/server_files_location_<NETWORK> ~/.chia/mainnet/data_layer/db/server_files_location_<NETWORK> - create a shortcut from the old location to the new
    • Use Nginx, Apache, Caddy, or any web server to serve the files over HTTP. Here is a sample Nginx config:
    server {
      listen 80;
    
      root /var/www/server_files_location_<NETWORK>;
    
      server_name datalayer.example.com;
    
      index index.html;
    
      expires 30d;
      add_header Pragma "public";
      add_header Cache-Control "public";
      
    }
    
  • Use S3 or other object store. Datalayer .dat files can be synced to any cloud file storage solution that can serve them publicly over HTTP. One recommended solution using S3 is to use this script and follow the installation and usage instructions in the README.

Once the .dat files are publicly available, update DATALAYER_FILE_SERVER_URL in the CADT configuration file with the URL or IP address (always include http:// or https://) and the port, then restart CADT. CADT will begin to create mirrors at this URL for your data and all stores you are subscribed to.

Run CADT on a Testnet

Chia has a few test networks called "testnets". Testnets allow anyone to test applications using plentiful and low value TXCH instead of needing to purchace XCH. We recommend running a testnet version of CADT in order to test integrations, software updates, and experiment in a low-stakes environment.

CADT runs on a testnet called "testnetA" which is different than the main Chia testnet, testnet11. TestnetA has a CADT governance node and an observer. To configure your Chia and CADT environment to use testnetA, do the following:

Note - these instructions only work with Chia version 2.4.4 and above

  1. Follow the instructions here and install chia-tools.

  2. Use chia-tools to switch to testneta in the Chia config

    chia-tools network switch testneta

  3. Restart Chia

    sudo systemctl restart chia-wallet@<USERNAME> chia-data-layer@<USERNAME> chia-full-node@<USERNAME>

  4. Stop CADT

    sudo systemctl stop cadt@<USERNAME>

  5. Update the GOVERNANCE_BODY_ID in ~/.chia/mainnet/cadt/v1/config.yaml to be 1019153f631bb82e7fc4984dc1f0f2af9e95a7c29df743f7b4dcc2b975857409

  6. If you already were running CADT on mainnet, delete the CADT database

    rm ~/.chia/mainnet/cadt/v1/data.sqlite3*

  7. Start CADT

    sudo systemctl start cadt@<USERNAME>

Ports, Networking, and Security

By default, the CADT API will listen on localhost only on port 31310. If running a node with READ_ONLY set to false, it is highly recommended that CADT is run on a private network or with access limited by IP address. To allow remote connections to CADT, set the BIND_ADDRESS (see the Configuration section below) to the IP to listen on, or 0.0.0.0 to listen on all interfaces. The port for the CADT API can be set with the parameter CW_PORT. The default port is 31310. In many cases, users will need to access the API from their workstations for either the CADT UI or to integrate with existing tools and scripts. To add authentication to the API, use the CADT_API_KEY parameter. Alternatively, the API can be served behind an authentication proxy to restrict access and the CADT_API_KEY can be left blank. If running an observer node with READ_ONLY set to true, the CADT API will only share data from the public blockchain, and running without authentication is usually safe. If READ_ONLY is set to false, authentication must be used to prevent unauthorized writes to the blockchain.

Adding Encryption to the CADT API

The CADT API uses HTTP and is unencrypted. To add encryption, use a reverse proxy like Nginx with an SSL certificate. In this scenario, the CADT application can be set to listen only on localhost, and Nginx (on the same server) will proxy incoming requests to port 31310.

Configuration

In the CHIA_ROOT directory (usually ~/.chia/mainnet on Linux), CADT will add a directory called cadt/v1 when the application is first run (in fact, this directory could be deleted at any time and CADT will recreate it next time it is started). The main CADT configuration file is called config.yaml and can be found in this directory. The options in this file are as follows (the full list of available options can be seen in the config template):

  • MIRROR_DB: This section is for configuring the MySQL-compatible database that can be used for easy querying for report generation. This is optional and only provides a read-only mirror of the data CADT uses.
    • DB_USERNAME: MySQL username
    • DB_PASSWORD: MySQL password
    • DB_NAME: MySQL database name
    • DB_HOST: Hostname of the MySQL database
  • APP: This section is for configuring the CADT application.
    • CW_PORT: CADT port where the API will be available. 31310 by default.
    • LOG_LEVEL: Controls verbosity of logging. Common settings are info and debug. Setting to silly will log all queries.
    • BIND_ADDRESS: By default, CADT listens on localhost only. To enable remote connections to CADT, change this to 0.0.0.0 to listen on all network interfaces, or to an IP address to listen on a specific network interface.
    • DATALAYER_URL: URL and port to connect to the Chia DataLayer RPC. If Chia is installed locally with default settings, https://localhost:8562 will work.
    • WALLET_URL: URL and port to connect to the Chia Wallet RPC. If Chia is installed on the same machine as CADT with default settings, https://localhost:9256 will work.
    • USE_SIMULATOR: Developer setting to populate CADT from a governance file and enable some extra APIs. Should always be "false" under normal usage.
    • READ_ONLY: When hosting an Observer node, set it to "true" to prevent any data from being written using the CADT APIs. This makes the application safe to run with public endpoints as it is just displaying publicly available data. When running a governance node, or a participant node, set to "false" to allow data to be written to the CADT APIs. When "false", additional authentication or access restrictions must be applied to prevent unauthorized alteration of the data.
    • CADT_API_KEY: This key is used by the CADT UI to authenticate with the CADT API endpoints. This allows the API to power the UI only without allowing requests missing the API key in the header to access the API. This can be left blank to allow open access to the API or if access is restricted by other means. The CADT_API_KEY can be set to any value, but we recommend at least a 32-character random string. The CADT_API_KEY can be passed in a request using the x-api-key header. See the RPC documentation for examples.
    • CHIA_NETWORK: CADT can run on Chia mainnet or any testnet. Set to "mainnet" for production instances or "testnet" if using the main Chia testnet.
    • USE_DEVELOPMENT_MODE: Should be false in most use cases. If a developer is writing code for the app, this can be changed to "true," which will bypass the need for a governance node.
    • IS_GOVERNANCE_BODY: "True" or "false" toggle to enable/disable mode for this instance being a governing body.
    • DEFAULT_FEE: Fee for each transaction on the Chia blockchain in mojos. The default is 300000000 mojos (0.0003 XCH) and can be set higher or lower depending on how busy the Chia network is. If a fee is set very low, it may cause a delay in transaction processing.
    • DEFAULT_COIN_AMOUNT: Units are mojo. Each DataLayer transaction needs a coin amount, and the default is 300000000 mojo.
    • CERTIFICATE_FOLDER_PATH: If using a custom path for the Chia Blockchain certificates folder, enter the path here to allow CADT to find the certificates and authenticate to the Chia RPC. CADT assumes the folder structure within the directory specified matches the default Chia SSL directory of $CHIA_ROOT/config/ssl/.
    • DATALAYER_FILE_SERVER_URL: Publicly available Chia DataLayer HTTP URL and port, including schema (http:// or https://). If serving DataLayer files from S3, this would be the public URL of the S3 bucket. Port can be omitted if using standard ports for http or https requests.
    • AUTO_SUBSCRIBE_FILESTORE: Subscribing to the filestore for any organization is optional. To automatically subscribe and sync the filestore to every organization you subscribe to, set this to true.
    • AUTO_MIRROR_EXTERNAL_STORES: When set to true (the default), CADT will automatically create mirrors for each store you are subscribed to. Mirroring all subscriptions using the DATALAYER_FILE_SERVER_URL will make the entire CADT network more resiliant and distributed.
    • TASKS: Section for configuring sync intervals
      • GOVERNANCE_SYNC_TASK_INTERVAL: Syncs new organizations from the governance node. Default 86400.
      • ORGANIZATION_META_SYNC_TASK_INTERVAL: Syncs organization data from the blockchain. Default 300.
      • PICKLIST_SYNC_TASK_INTERVAL: Syncs picklist from the governance node. Default 30.
      • MIRROR_CHECK_TASK_INTERVAL: Checks if our DataLayer is advertising our DATALAYER_FILE_SERVER_URL as a mirror for all subscriptions when AUTO_MIRROR_EXTERNAL_STORES is true. Default 86460.
  • GOVERNANCE: Section on settings for the Governance body to connect to.
    • GOVERNANCE_BODY_ID: This determines the governance body your CADT network will be connected to. While there could be multiple governance body IDs, the default of 23f6498e015ebcd7190c97df30c032de8deb5c8934fc1caa928bc310e2b8a57e is the right ID for most people on mainnet.

​ Note that the CADT application will need to be restarted after any changes to the config.yaml file.

Developer Guide

A development environment for CADT assumes a synced Chia wallet running locally. Node version manager (nvm) is used to switch node environments quickly. The repo contains a .nvmrc file that specifies the node version the CADT is expected to use and developers can do nvm use to switch to the version in the .nvmrc.

Contributing

All branches should be created from the develop branch and not from main. All pull requests should be made against the develop branch unless it is a new release. The develop branch will be merged into the main branch to create a release. Automation in the CI will create the release and attach the installation files to it automatically whenever code is merged to main. Additionally, the changelog will automatically be updated in the main branch. Therefore, the main branch should always be a representation of the latest released code.

​This repo uses a commit convention. A typical commit message might read: ​

    fix: correct home screen layout

​ The first part of this is the commit "type". The most common types are "feat" for new features and "fix" for bugfixes. Using these commit types helps us correctly manage our version numbers and changelogs. Since our release process calculates new version numbers from our commits, getting this right is very important. ​

  • feat is for introducing a new feature
  • fix is for bug fixes
  • docs for documentation only changes
  • style is for code formatting only
  • refactor is for changes to code that should not be detectable by users or testers
  • perf is for a code change that improves performance
  • test is for changes that only touch test files or related tooling
  • build is for changes that only touch our develop/release tools
  • ci is for changes to the continuous integration files and scripts
  • chore is for changes that don't modify code, like a version bump
  • revert is for reverting a previous commit ​ After the type and scope, there should be a colon (:). ​ The "subject" of the commit follows. It should be a short indication of the change. The commit convention prefers that this is written in the present-imperative tense.

Versioning

This project mostly adheres to semantic versioning. The version specified in package.json will be used by the ci to create the new release in Github, so it is important to set that correctly. The major version (version 1.0, 2.0, etc) should only be changed when the data model changes and the API goes from v1 to v2. Minor version changes (version 1.2 to 1.3, etc.) are for breaking or substantial changes, usually requiring some action on the user's part.

Commit linting

​ Each time you commit the message will be checked against these standards in a pre-commit hook. Additionally, all the commits in a PR branch will be linted before it can be merged to master.

To set up the pre-commit hooks on your local, run the following: ​

npm install -g @babel/cli husky prettier lint-staged cross-env
npm set-script prepare "husky install"
npm run prepare
​
// If you are on linux or mac run
chmod ug+x .husky/*
chmod ug+x .git/hooks/*

Build Binaries

After running the "Installation from Source" steps above, do the following:

// transcompile project to es5
npm run build
​
// Output binaries to dist folder
npm run create-win-x64-dist
npm run create-mac-x64-dist
npm run create-linux-x64-dist

Sequelize Generator

Creating Model and Migration Script

​ Use the following command to create a model and a migration script ​ npx sequelize-cli model:generate --name User --attributes firstName:string,lastName:string,email:string ​

Running Migrations

​ Migrations run automatically when you run the software. There is no manual tool that needs to be used. ​

Making changes to Migrations without rolling back

​ If you want to alter, drop, or add a column or add a foreign key or anything with the table. Use the following command to create a barebones migration script ​ npx sequelize-cli migration:generate --name ​

Running Full Text Search Queries

​ To run an FTS query on a supported table, you can use the MATCH operator. A virtual column rank is also available for sorting purposes. ​ Example: ​

SELECT rank, * FROM projects_fts WHERE projects_fts MATCH "PartialMatch*" ORDER BY rank
​
The '*' in the match is needed for wildcard

​ More info: https://www.sqlite.org/fts5.html

Connecting to the WebSocket

  1. Open a WebSocket connection to http://localhost:31310/v1/ws
    1. Once subscribed, emit either emit ...['subscribe', 'units'] or ['subscribe', 'projects']. You can emit both on the same connection
  2. You will receive frames that look like ...[change:units, {orgUid: '123'}], letting you know that there has been an update to a record Coll