- The Data Engineering Cookbook
- Data Engineer Handbook
- Netflix Tech Blog Data Engineering
- Uber Engineering Blog
- Cloudflare Blog
- Meta Engineering Blog
- Linkedin Engineering
- AWS Architecture Blog
- Slack Engineering Blog
- Stripe Engineering Blog
- AWS SDK Examples
- AWS Samples
- AWS Labs
- aws-solutions
- build-on-aws
- Microservices.io
- martin fowler and Gregor Hohpe
- ArjanCodes
- mCodingLLC
- Building a Poor Mans Datalake from Scratch with DuckDB
- Revisiting the Poor Man’s Data Lake with MotherDuck
- Database Transactions
- Design Patterns in the Real World
- Design Patterns
- Refactoring Guru
- Design Patterns: Elements of Reusable Software
- UI Design Patterns
- OO Design
- python-ddd
- Design Patterns Scala
- Enterprise Application Patterns
- Practical Cryptography for Developers
- Problem Solving with Algorithms and Data Structures using Python
- Computer Security
- Open DSA Data Structures and Algorithms
- Awesome ETL
- Scala Design Patterns
- Goodreads ETL Pipeline
- Around Data Engineering
- Awesome Design Patterns
- Python Patterns
- Start Data Engineering
- Formal Ontology
- 6.005 Software Construction
- Google Site Reliability Engineering
- Calm Code
- Cosmic Python
- Python for Data Analysis
- Python Data Science Handbook
- Operating Systems
- Distributed Computing
- Data Integration
- Data Tools
- Awesome Big Data
- data oriented design
- Patterns of Distributed Systems
- Data Mesh Principles and Architecture
- Patterns for API Design
- gaphor
- Data Centric Design
- open-data-fabric
- serverlessland
- Data Oriented Design
- Python Patterns
- Diagrams
- python-anti-patterns
- Python 3 Patterns, Recipes and Idioms
- awesome-ddd
- designing-data-intensive-applications
- solution-architecture-patterns
- Serverless Patterns
- awesome-system-design
- awesome-software-architecture
- Google Site Reliabilty Engineering
- Microservice API Patterns
- Algorithms
- Data Structures
- Software_architecture
- Software_engineering
- Programming_paradigms
- Software_testing
- Systems_engineering
- Systems_science
- Systems_theory
- Systems_analysis
- Cloud Computing
- Software_requirements
- Programming Paradigms
- Programming_language_concepts
- Programming_principles
- Abstract_factory_pattern
- Builder_pattern
- Singleton_pattern
- Prototype_pattern
- Object_pool_pattern
- Facade_pattern
- Chain of Responsibility Pattern
- Flyweight Pattern
- Design Factory Patterns
- Composite Pattern
- Bridge Pattern
- Mediator Pattern
- Visitor Pattern
- Adapter Pattern
- Model View Controller
- Decorator Pattern
- Chain of Responsibility Pattern
- Command Pattern
- ThreatDragon
- Kotlin Faker
- DataBunker
- awesome-IAM
- open-data-anonymizer
- Presidio
- Penetration Testing Tools
- Public Pentesting Reports
- Nettacker
- Kubesploit
- Hackerpro
- RapidScan
- Astra
- awesome-pentest-cheat-sheets
- Hacking Security Ebooks
- awesome-infosec
- awesome-web-hacking
- infosec-reference
- Web Security Testing Guide
- Infection Monkey
- awesome-web-security
- awesome-hacking-resources
- h4cker
- PayloadsAllTheThings
- threat-model-cookbook
- Awesome Threat Modeling
- awesome-hacking
- penetration testing
- awesome-pentest
- chaos-toolkit
- Python for Network Engineers
- Chaos Engineering
- awesome networking
- Python for Network Engineers
- awesome-network-automation
- Boltons
- Docker-py
- more-itertools
- Kafka Python
- ZODB
- Click
- DataConv
- DBT Core
- State Transition Machines
- Storm
- Toolz
- Paramiko
- Textblob
- Jupyter: Docker-stacks
- cookie-cutter
- Transcriber
- Pytools
- Misskey
- OpenMeta
- Chalice
- Microservice Architecture: Serverless Compute Implementation
- Python-Lambda
- Pywren
- Zappa
- Memray: Memory Profiling
- Pybossa
- Apache Samza
- filesystem_spec
- google-i18n-address
- docker-wsl
- aws-data-wrangler
- Optimus
- metricflow
- lightdash
- chaos genius
- pyrsistent
- pydash
- latexify_py
- rocketry
- pydatafaker
- pydbgen
- faker
- RateLimiting
- DateTimeRange
- tenacity
- mako
- jinjasql
- data engineering on gcp
- polars
- Vaex
- Fugure: Distributed Computation
- Funcy
- Singer
- Dateutil
- pyparsing
- psutil
- ray
- click
- flask-boilerplate
- python-packager
- python-project-skeleton
- wemake-python-package
- pyscaffold
- xmltodict
- duckdb
- Dash and Sample Apps
- Seaborn
- Plotnine
- Bokeh
- Pygal
- Geoplotlib
- Gleam
- Missingno
- Leather
- Altair
- Folium
- Plotly
- Pillow
- Superset
- Glue Visualization
- BIRT
- SpagoBI
- Seal-Report
- metabase
- Databox
- KNIME
- Datapane
- Perspective
- redash
- reportserver
- awesome-business-intelligence
- Turnilo
- SandDance
- Abixen Platform
- d3
- Dash Examples
- sweetviz
- Awesome Web Viz Frameworks
- Echarts
- Grafana
- awesome-dataviz
- python-data-visualization
- The-Python-Graph-Gallery
- rustworkx
- solara
- pygwalker
- graphic-walker
- datapane
- gleam
- streamlit
- ipywidgets
- voila
- dbeaver
- awesome-db-tools
- amazon-redshift-utils
- amazon-redshift-developer-guides
- awesome-time-series-database
- SQL Alchemy
- Pyodbc
- PyMySQL
- Redash
- SQLmap
- Pyodbc
- ddlparse
- lacquer
- omymodels
- sql-metadata
- sqlglot
- sqlparse
- Sqlbucket
- DBFread
- sqlalchemy-hana
- pymssql
- sqleyes
- data-diff
- amazon-redshift-python-driver
- spyql
- awesome-sqlalchemy
- ipython-sql
- redshift-developer-guide
- cloud-sql-python-connector
- aiosql
- sqlfluff
- sqlmodel
- pypika
- Amazon Redshift Utils
- connector-x
- psycopg2
- pg_simple
- databases
- sqlmesh
- DBUtils
- pymysql-pool
- django-db-connection-pool
- Vanna
- malloy
- Apache ORC
- Apache Pinot
- pdfplumber
- camelot
- ingestr
- Flask
- Tornado
- Tenacity
- Eve
- Flask Restful
- Google API Client
- Zeep
- Connexion
- Hug
- Falcon
- Aiohttp
- FastAPI
- OpenAPI Python Client
- requests-toolbelt
- smart_open
- Wikipedia API and Wrapper
- Office365-Rest-Python-Client
- youtube-dl
- Twisted
- simple-salesforce
- Venmo API
- Flask
- Django
- Coursera Downloader
- Public APIs
- python-oauth2
- requests-oauth2
- redo
- backoff
- Directus Data Stack
- StreamLit
- Pybossa
- starlette
- awesome-fastapi
- awesome-fastapi-projects
- Requests-futures
- Requests-threads
- grequests
- async_generator
- httpx
- requests-async
- mpire
- offspring
- multiprocessing_on_dill
- continuous threading
- Needle
- atasker
- asgiref
- concurrency-in-python-with-asyncio
- Async & Multitasking
- fast_map
- aiobotocore
- aioboto3
- aiohttp-client-cache
- aiohttp
- multiprocess
- aiofiles
- aiobotocore
- aioboto3
- dataflow
- pyfi
- dataflowkit
- data flow graph
- python flow
- gerda dataflow
- dataflows
- d6tflow
- prefect
- Schedule
- Luigi
- Faust
- Redis Queue
- Airflow-Great-Expectations
- Smart Open
- Zipstream
- multi-part-upload
- Celery
- airflow
- sftp-lambda
- lambda-s3-ftp
- Apache Beam
- Processing (I/O and Piplines)
- stream unzip
- pypyr
- Data Flow Ops
- Apache Spark Guide
- Orchest
- Mage AI
- Meltano
- DataJoint Python
- Hamilton
- Kombu
- airbyte
- ploomber
- data-diff
- Amazon Apache Airflow Managed Workflow
- Airbyte
- mage-ai
- Dagster
- Data All
- awesome-flink and Examples
- flink
- RedPanda
- Materialize
- Hazelcast
- Watermill
- Amazon Kinesis Client Python
- Faust
- Stream Processing
- Spark Streaming in Python
- Kestra
- Hamilton
- CloudQuery
- Nifi
- Pentaho-Kettle
- Camel
- Riko
- Bonobo
- Petl
- awesome-apache-airflow
- airflow provider sample
- metabase
- Flowman
- Apache Beam
- hamilton
- pachyderm
- elementary
- AWS Encryption SDK
- AWS Xray SDK
- AWS SDK Pandas
- Sagemaker SDK
- GCP Data Validator
- AWS Redshift Driver
- Cloudwatch Logging
- Former2
- Sagemaker Spark
- Secrets Manager Caching
- Spark With Python
- Learning Pyspark
- Spark Redshift
- Sagemaker Graph ER
- aws-glue-developer-guide
- pyspark examples
- pyspark cheatsheet
- aws-scheduler
- emr-serverless-samples
- Polars
- duckDB
- Dask
- SparkSQL
- cloud-experiments
- document-understanding-solution
- aws-glue-docker
- amazon-comprehend-examples
- Festin
- MinIO-py
- Bucketstore
- amazon-redshift-udfs
- PyLazyS3
- Minio
- Pandas Profiling
- WhyLogs
- PointBlank
- Hooqu
- pyDMNrules
- DQ-Meerkat
- dataqtor
- DataGristle
- versatile-data-kit
- Soda-Core
- ydata-quality
- Pydqc
- Business Rule Engine
- Python Business Logic
- Business Rules
- Hay_checker
- Great Expectations
- Feast
- Datatile
- business-rules venmo
- pydqc
- Data Gristle
- deep diff
- Great Expectations
- XML to Dict
- Pylint
- postal-address
- python-email-validator
- flatten-dict
- pytesseract
- python deequ
- ydata-profiling
- Memphis
- Benthos
- Awesome Streaming
- Storm
- bigquery-data-lineage
- multi-data-lineage-capture-py
- DataTracer
- data-lineage
- elementary
- stairlight
- OpenLineage
- Marquez
- Odd-Platform
- waltz
- sqllineage
- Spline
- grafana
- awesome observability
- Signoz
- zipkin
- kibana
- vector
- netdata
- odd platform
- Data Observability in Practice
- Monosi
- Swiple
- Elementary
- awesome-opentelemetry
- dd-trace-py
- signoz
- vector
- netdata
- awesome-observability
- malloy
- Json Classes
- Schema
- Jmespath
- XmlUtils
- ExtraTools
- Collections Extended
- More Itertools
- Lark Parser
- Json Flattener
- Scrapy
- Beautiful Soup
- marshmallow
- PyPDF2
- Pydantic
- Pyspider
- Pydantic SQLAlchemy
- simplejson
- json2parquet
- pyyaml
- Attrs
- Chardet
- simple-enum
- dataklasses
- dataclasses-json
- dataclassy
- python-choicesenum
- fastenum
- bnum
- data-enum
- superstring.py
- text2text
- pyspellchecker
- symspellpy
- python string similarity
- textdistance
- string-algorithms
- python-phonenumbers
- CommonRegex
- Addresser
- Unidecode
- whoosh
- usaddress
- jellyfish
- Postal Address
- dirtyJson
- awesome-json
- dataclass_array
- python-graphs
- python-email-validator
- dataprep
- fuzzywuzzy
- Cerberus
- PolyFuzz
- Fuzzy Search
- Pachyderm
- CleanLab
- awesome-jsonschema
- dateparser
- dateutil
- Cerebrus
- validators
- Valideer
- Pandera
- Typical
- kdatapackage
- PandasSchema
- TypedFrame
- tableschema
- validator-collection
- deepchecks
- awesome-validation-python
- attrs-strict
- pydantic-core
- dataclass-type-validator
- Jinja
- liaison
- DataCleaner