Skip to content

InGen is a command line tool written on top of pandas and great_expectations to perform small scale data transformations and validations without writing code.

License

Notifications You must be signed in to change notification settings

blackrock/ingen

Repository files navigation

InGen

InGen is a command line tool written on top of pandas and great_expectations to perform small scale data transformations and validations without writing code. It is designed for developers and analysts to quickly transform data by specifying their requirements in a simple YAML file.

Table of Contents

Installation

Note: This project has been tested on python version 3.9.6 on MacOS Ventura with apple chip.

To install the project locally follow the steps:

  1. Make sure you have Python 3.9.6 installed on your system.
  2. To be able to build the project locally, you will need to install the build package
    pip install build
    
  3. Clone the repository
    git clone [email protected]:blackrock/ingen.git
    
  4. Build the project
    cd ingen
    python -m build
    
  5. Install the wheel
    pip install dist/ingen-*.whl
    
  6. Run the project
    python -m ingen <metadata file path>
    

Examples

Checkout the sample metadata files in the examples directory to see how InterfaceGenerator can be used to solve common data problems.

Examples
Merge two CSV files
Database as source
Convert XML to CSV/Excel

For a complete list of configuration options, see the metadata reference documentation here.

Contributing

All contributions are welcome, please see open issues or create a new issue to discuss your ideas. Please see our contributing guidelines for more information.

License

LICENSE

About

InGen is a command line tool written on top of pandas and great_expectations to perform small scale data transformations and validations without writing code.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages